Performance comparison of tumor classification based on linear and non-linear dimensionality reduction methods

  • Authors:
  • Shu-Lin Wang;Hong-Zhu You;Ying-Ke Lei;Xue-Ling Li

  • Affiliations:
  • Intelligent Computing Laboratory, Hefei Institute of Intelligent Machines, Chinese Academy of Science, Hefei, China and School of Computer and Communication, Hunan University, Changsha, Hunan, Chi ...;Intelligent Computing Laboratory, Hefei Institute of Intelligent Machines, Chinese Academy of Science, Hefei, China;Intelligent Computing Laboratory, Hefei Institute of Intelligent Machines, Chinese Academy of Science, Hefei, China and Electronic Engineering Institute, Hefei, China;Intelligent Computing Laboratory, Hefei Institute of Intelligent Machines, Chinese Academy of Science, Hefei, China

  • Venue:
  • ICIC'10 Proceedings of the 6th international conference on Advanced intelligent computing theories and applications: intelligent computing
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Gene expression profiles play more and more important roles in accurate tumor diagnosis and treatment. However, the curse of dimensionality that the number of genes far exceeds the number of samples issues the challenges to the traditional dimensionality reduction methods. Here based on two-stage dimensionality reduction model we design 18 tumor classification methods by combining two classical gene filters with three common dimensionality reduction methods: principal component analysis (PCA), linear discriminative analysis (LDA) and multidimensional scaling (MDS) method to extract discriminative features and use three common machine learning methods to evaluate the prediction accuracy of the extracted features on six tumor datasets, respectively. Although gene expression presents the non-linear characteristics, non-linear dimensionality reduction method MDS is not always the best in prediction accuracy among the three dimensionality reductions on all six tumor datasets. Moreover, the performance comparison indicates that no single dimensionality reduction is always superior to the others on all of the six tumor datasets. Our results also suggest that the prediction accuracy obtained depends strongly on the dataset, and less on the gene selection and classification methods.