A Machine Learning Approach to Mass Spectra Classification with Unsupervised Feature Selection

  • Authors:
  • Michele Ceccarelli;Antonio D'Acierno;Angelo Facchiano

  • Affiliations:
  • Department of Biological and Environmental Sciences, University of Sannio, Benevento, Italy 82100 and Research Center on Software Technologies, University of Sannio, Benevento, Italy 82100 and Bio ...;Institute of Food Sciences, Italian National Research Council, Avellino, Italy;Institute of Food Sciences, Italian National Research Council, Avellino, Italy

  • Venue:
  • Computational Intelligence Methods for Bioinformatics and Biostatistics
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Mass spectrometry spectra are recognized as a screening tool for detecting discriminatory protein patterns. Mass spectra, however, are high dimensional data and a large number of local maxima (a.k.a. peaks ) have to be analyzed; to tackle this problem we have developed a three-step strategy. After data pre-processing we perform an unsupervised feature selection phase aimed at detecting salient parts of the spectra which could be useful for the subsequent classification phase. The main contribution of the paper is the development of this feature selection and extraction procedure grounded on the theory of multi-scale spaces. Then we use support vector machines for classification. Results obtained by the analysis of a data set of tumor/healthy samples allowed us to correctly classify more than 95% of samples. ROC analysis has been also performed.