An integrated approach of particle swarm optimization and support vector machine for gene signature selection and cancer prediction

  • Authors:
  • C. W. Yeung;F. H. F. Leung;K. Y. Chan;S. H. Ling

  • Affiliations:
  • Centre for Signal Processing, Dept of Electronic and Information Engineering, The Hong Kong Polytechnic University, Hong Kong;Centre for Signal Processing, Dept of Electronic and Information Engineering, The Hong Kong Polytechnic University, Hong Kong;Digital Ecosystems and Business Intelligence Institute, Curtin University of Technology, Perth, Australia;University of Technology, Sydney, NSW, Australia

  • Venue:
  • IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

To improve cancer diagnosis and drug development, the classification of tumor types based on genomic information is important. As DNA microarray studies produce a large amount of data, expression data are highly redundant and noisy, and most genes are believed to be uninformative with respect to the studied classes. Only a fraction of genes may present distinct profiles for different classes of samples. Classification tools to deal with these issues are thus important. These tools should learn to robustly identify a subset of informative genes embedded in a large dataset that is contaminated with high dimensional noises. In this paper, an integrated approach of support vector machine (SVM) and particle swarm optimization (PSO) is proposed for this purpose. The proposed approach can simultaneously optimize the selection of feature subset and the classifier through a common solution coding mechanism. As an illustration, the proposed approach is applied to search the combinational gene signatures for predicting histologic response to chemotherapy of osteosarcoma patients. Crossvalidation results show that the proposed approach outperforms other existing methods in terms of classification accuracy. Further validation using an independent dataset shows misclassification of only one out of fourteen patient samples, suggesting that the selected gene signatures can reflect the chemoresistance in osteosarcoma.