Hybrid methods to select informative gene sets in microarray data classification

  • Authors:
  • Pengyi Yang;Zili Zhang

  • Affiliations:
  • Intelligent Software and Software Engineering Laboratory, Faculty of Computer and Information Science, Southwest University, Chongqing, China;Intelligent Software and Software Engineering Laboratory, Faculty of Computer and Information Science, Southwest University, Chongqing, China and School of Engineering and Information Technology, ...

  • Venue:
  • AI'07 Proceedings of the 20th Australian joint conference on Advances in artificial intelligence
  • Year:
  • 2007

Quantified Score

Hi-index 0.02

Visualization

Abstract

One of the key applications of microarray studies is to select and classify gene expression profiles of cancer and normal subjects. In this study, two hybrid approaches-genetic algorithm with decision tree (GADT) and genetic algorithm with neural network (GANN)-are utilized to select optimal gene sets which contribute to the highest classification accuracy. Two benchmark microarray datasets were tested, and the most significant disease related genes have been identified. Furthermore, the selected gene sets achieved comparably high sample classification accuracy (96.79% and 94.92% in colon cancer dataset, 98.67% and 98.05% in leukemia dataset) compared with those obtained by mRMR algorithm. The study results indicate that these two hybrid methods are able to select disease related genes and improve classification accuracy.