Genetic algorithm-based feature selection in high-resolution NMR spectra

  • Authors:
  • Hyun-Woo Cho;Seoung Bum Kim;Myong K. Jeong;Youngja Park;Thomas R. Ziegler;Dean P. Jones

  • Affiliations:
  • Department of Industrial and Information Engineering, The University of Tennessee, Knoxville, TN 37996, USA;Department of Industrial and Manufacturing Systems Engineering, The University of Texas at Arlington, Arlington, TX 76019, USA;Department of Industrial and Information Engineering, The University of Tennessee, Knoxville, TN 37996, USA;Clinical Biomarkers Laboratory, Department of Medicine, Emory University, Atlanta, GA 30322, USA;Center for Clinical and Molecular Nutrition, Department of Medicine, Emory University, Atlanta, GA 30322, USA;Clinical Biomarkers Laboratory, Center for Clinical and Molecular Nutrition, Department of Medicine, Emory University, Atlanta, GA 30322, USA

  • Venue:
  • Expert Systems with Applications: An International Journal
  • Year:
  • 2008

Quantified Score

Hi-index 12.06

Visualization

Abstract

High-resolution nuclear magnetic resonance (NMR) spectroscopy has provided a new means for detection and recognition of metabolic changes in biological systems in response to pathophysiological stimuli and to the intake of toxins or nutrition. To identify meaningful patterns from NMR spectra, various statistical pattern recognition methods have been applied to reduce their complexity and uncover implicit metabolic patterns. In this paper, we present a genetic algorithm (GA)-based feature selection method to determine major metabolite features to play a significant role in discrimination of samples among different conditions in high-resolution NMR spectra. In addition, an orthogonal signal filter was employed as a preprocessor of NMR spectra in order to remove any unwanted variation of the data that is unrelated to the discrimination of different conditions. The results of k-nearest neighbors and the partial least squares discriminant analysis of the experimental NMR spectra from human plasma showed the potential advantage of the features obtained from GA-based feature selection combined with an orthogonal signal filter.