Introduction to statistical pattern recognition (2nd ed.)
Introduction to statistical pattern recognition (2nd ed.)
C4.5: programs for machine learning
C4.5: programs for machine learning
Machine Learning
A Tutorial on Support Vector Machines for Pattern Recognition
Data Mining and Knowledge Discovery
Application of the GA/KNN method to SELDI proteomics data
Bioinformatics
Automatic Quality Assessment of Peptide Tandem Mass Spectra
Bioinformatics
Pattern Recognition Letters
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Markov blanket-embedded genetic algorithm for gene selection
Pattern Recognition
Functional proteomic pattern identification under low dose ionizing radiation
Artificial Intelligence in Medicine
IEEE Transactions on Information Technology in Biomedicine
Possibilistic nonlinear dynamical analysis for pattern recognition
Pattern Recognition
Hi-index | 0.00 |
High-resolution matrix-assisted laser desorption/ionization time-of-flight mass spectrometry has recently shown promise as a screening tool for detecting discriminatory peptide/protein patterns. The major computational obstacle in finding such patterns is the large number of mass/charge peaks (features, biomarkers, data points) in a spectrum. To tackle this problem, we have developed methods for data preprocessing and biomarker selection. The preprocessing consists of binning, baseline correction, and normalization. An algorithm, extended Markov blanket, is developed for biomarker detection, which combines redundant feature removal and discriminant feature selection. The biomarker selection couples with support vector machine to achieve sample prediction from high-resolution proteomic profiles. Our algorithm is applied to recurrent ovarian cancer study that contains platinum-sensitive and platinum-resistant samples after treatment. Experiments show that the proposed method performs better than other feature selection algorithms. In particular, our algorithm yields good performance in terms of both sensitivity and specificity as compared to other methods.