Hybrid huberized support vector machines for microarray classification and gene selection

Authors:
Li Wang;Ji Zhu;Hui Zou
Affiliations:
-;-;-
Venue:
Bioinformatics
Year:
2008

Citing 0
Cited 12

Efficient mining of multilevel gene association rules from microarray and gene ontology

Information Systems Frontiers
A new support vector machine for microarray classification and adaptive gene selection

ACC'09 Proceedings of the 2009 conference on American Control Conference
Exploiting the Accumulated Evidence for Gene Selection in Microarray Gene Expression Data

Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
A Bayesian hybrid Huberized support vector machine and its applications in high-dimensional medical data

Computational Statistics & Data Analysis
Gene selection and prediction for cancer classification using support vector machines with a reject option

Computational Statistics & Data Analysis
Rough set based maximum relevance-maximum significance criterion and Gene selection from microarray data

International Journal of Approximate Reasoning
An experimental comparison of gene selection by Lasso and Dantzig selector for cancer classification

Computers in Biology and Medicine
Combined Feature Selection and Cancer Prognosis Using Support Vector Machine Regression

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Combining information theoretic kernels with generative embeddings for classification

Neurocomputing
An iterative SVM approach to feature selection and classification in high-dimensional datasets

Pattern Recognition
CODA: high dimensional copula discriminant analysis

The Journal of Machine Learning Research
ICP: A novel approach to predict prognosis of prostate cancer with inner-class clustering of gene expression data

Computers in Biology and Medicine

Quantified Score

Hi-index	3.84

Visualization

Abstract

Motivation: The standard L2-norm support vector machine (SVM) is a widely used tool for microarray classification. Previous studies have demonstrated its superior performance in terms of classification accuracy. However, a major limitation of the SVM is that it cannot automatically select relevant genes for the classification. The L1-norm SVM is a variant of the standard L2-norm SVM, that constrains the L1-norm of the fitted coefficients. Due to the singularity of the L1-norm, the L1-norm SVM has the property of automatically selecting relevant genes. On the other hand, the L1-norm SVM has two drawbacks: (1) the number of selected genes is upper bounded by the size of the training data; (2) when there are several highly correlated genes, the L1-norm SVM tends to pick only a few of them, and remove the rest. Results: We propose a hybrid huberized support vector machine (HHSVM). The HHSVM combines the huberized hinge loss function and the elastic-net penalty. By doing so, the HHSVM performs automatic gene selection in a way similar to the L1-norm SVM. In addition, the HHSVM encourages highly correlated genes to be selected (or removed) together. We also develop an efficient algorithm to compute the entire solution path of the HHSVM. Numerical results indicate that the HHSVM tends to provide better variable selection results than the L1-norm SVM, especially when variables are highly correlated. Availability: R code are available at http://www.stat.lsa.umich.edu/~jizhu/code/hhsvm/ Contact: jizhu@umich.edu Supplementary information: Supplementary data are available at Bioinformatics online.