Simultaneous genes and training samples selection by modified particle swarm optimization for gene expression data classification

  • Authors:
  • Qi Shen;Zhen Mei;Bao-Xian Ye

  • Affiliations:
  • Department of Chemistry, Zhengzhou University, Zhengzhou 450052, China;Department of Chemistry, Zhengzhou University, Zhengzhou 450052, China;Department of Chemistry, Zhengzhou University, Zhengzhou 450052, China

  • Venue:
  • Computers in Biology and Medicine
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Gene expression datasets is a means to classify and predict the diagnostic categories of a patient. Informative genes and representative samples selection are two important aspects for reducing gene expression data. Identifying and pruning redundant genes and samples simultaneously can improve the performance of classification and circumvent the local optima problem. In the present paper, the modified particle swarm optimization was applied to selecting optimal genes and samples simultaneously and support vector machine was used as an objective function to determine the optimum set of genes and samples. To evaluate the performance of the new proposed method, it was applied to three publicly available microarray datasets. It has been demonstrated that the proposed method for gene and sample selection is a useful tool for mining high dimension data.