Gene extraction for cancer diagnosis by support vector machines-An improvement

Authors:
Te Ming Huang;Vojislav Kecman
Affiliations:
School of Engineering, The University of Auckland, 20 Symonds Street, Private Box 92019, Auckland, New Zealand;School of Engineering, The University of Auckland, 20 Symonds Street, Private Box 92019, Auckland, New Zealand
Venue:
Artificial Intelligence in Medicine
Year:
2005

Citing 2
Cited 14

Gene Selection for Cancer Classification using Support Vector Machines

Machine Learning
Variable selection using svm based criteria

The Journal of Machine Learning Research

An integrated algorithm for gene selection and classification applied to microarray data of ovarian cancer

Artificial Intelligence in Medicine
Forward selection method with regression analysis for optimal gene selection in cancer classification

International Journal of Computer Mathematics - Bioinformatics
Wrapper filtering criteria via linear neuron and kernel approaches

Computers in Biology and Medicine
Efficient Feature Selection for PTR-MS Fingerprinting of Agroindustrial Products

ICANN '08 Proceedings of the 18th international conference on Artificial Neural Networks, Part II
Investigating the Efficacy of Nonlinear Dimensionality Reduction Schemes in Classifying Gene and Protein Expression Studies

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Microarray Design Using the Hilbert---Schmidt Independence Criterion

PRIB '08 Proceedings of the Third IAPR International Conference on Pattern Recognition in Bioinformatics
Comparison of feature selection and classification combinations for cancer classification using microarray data

International Journal of Bioinformatics Research and Applications
An integrated approach of particle swarm optimization and support vector machine for gene signature selection and cancer prediction

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Gene selection for cancer classification through ensemble of methods

ICANNGA'09 Proceedings of the 9th international conference on Adaptive and natural computing algorithms
SVM-based multimodal classification of activities of daily living in health smart homes: sensors, algorithms, and first experimental results

IEEE Transactions on Information Technology in Biomedicine - Special section on affective and pervasive computing for healthcare
Combining support vector machines and the t-statistic for gene selection in DNA microarray data analysis

PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part II
An efficient alternative to SVM based recursive feature elimination with applications in natural language processing and bioinformatics

AI'06 Proceedings of the 19th Australian joint conference on Artificial Intelligence: advances in Artificial Intelligence
Accurate Prediction of Coronary Artery Disease Using Reliable Diagnosis System

Journal of Medical Systems
Exploring correlations in gene expression microarray data for maximum predictive-minimum redundancy biomarker selection and classification

Computers in Biology and Medicine

Quantified Score

Hi-index	0.00

Visualization

Abstract

Objective:: To improve the performance of gene extraction for cancer diagnosis by recursive feature elimination with support vector machines (RFE-SVMs): A cancer diagnosis by using the DNA microarray data faces many challenges the most serious one being the presence of thousands of genes and only several dozens (at the best) of patient's samples. Thus, making any kind of classification in high-dimensional spaces from a limited number of data is both an extremely difficult and a prone to an error procedure. The improved RFE-SVMs is introduced and used here for an elimination of less relevant genes and just for a reduction of the overall number of genes used in a medical diagnostic. Methods:: The paper shows why and how the, usually neglected, penalty parameter C and some standard data preprocessing techniques (normalizing and scaling) influence classification results and the gene selection of RFE-SVMs. The gene selected by RFE-SVMs is compared with eight other gene selection algorithms implemented in the Rankgene software to investigate whether there is any consensus among the algorithms, so the scope of finding the right set of genes can be reduced. Results:: The improved RFE-SVMs is applied on the two benchmarking colon and lymphoma cancer data sets with various C parameters and different standard preprocessing techniques. Here, decreasing C leads to the smaller diagnosis error in comparisons to other known methods applied to the benchmarking data sets. With an appropriate parameter C and with a proper preprocessing procedure, the reduction in a diagnosis error is as high as 36%. Conclusions:: The results suggest that with a properly chosen parameter C, the extracted genes and the constructed classifier will ensure less overfitting of the training data leading to an increased accuracy in selecting relevant genes. Finally, comparison in gene ranking obtained by different algorithms shows that there is a significant consensus among the various algorithms as to which set of genes is relevant.