Partially supervised feature selection with regularized linear models

Authors:
Thibault Helleputte;Pierre Dupont
Affiliations:
University of Louvain, Louvain-la-Neuve, Belgium;University of Louvain, Louvain-la-Neuve, Belgium
Venue:
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Year:
2009

Citing 9
Cited 5

Choosing Multiple Parameters for Support Vector Machines

Machine Learning
Gene Selection for Cancer Classification using Support Vector Machines

Machine Learning
An introduction to variable and feature selection

The Journal of Machine Learning Research
Use of the zero norm with linear models and kernel methods

The Journal of Machine Learning Research
Hybrid huberized support vector machines for microarray classification

Proceedings of the 24th international conference on Machine learning
Hybrid huberized support vector machines for microarray classification

Proceedings of the 24th international conference on Machine learning
A stability index for feature selection

AIAP'07 Proceedings of the 25th conference on Proceedings of the 25th IASTED International Multi-Conference: artificial intelligence and applications
A review of feature selection techniques in bioinformatics

Bioinformatics
The generalized LASSO

IEEE Transactions on Neural Networks

Feature Selection by Transfer Learning with Linear Regularized Models

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
Review Article: Stable feature selection for biomarker discovery

Computational Biology and Chemistry
Modelling complex data by learning which variable to construct

DaWaK'10 Proceedings of the 12th international conference on Data warehousing and knowledge discovery
Improving accuracy of microarray classification by a simple multi-task feature selection filter

International Journal of Data Mining and Bioinformatics
Stable Gene Selection from Microarray Data via Sample Weighting

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper addresses feature selection techniques for classification of high dimensional data, such as those produced by microarray experiments. Some prior knowledge may be available in this context to bias the selection towards some dimensions (genes) a priori assumed to be more relevant. We propose a feature selection method making use of this partial supervision. It extends previous works on embedded feature selection with linear models including regularization to enforce sparsity. A practical approximation of this technique reduces to standard SVM learning with iterative rescaling of the inputs. The scaling factors depend here on the prior knowledge but the final selection may depart from it. Practical results on several microarray data sets show the benefits of the proposed approach in terms of the stability of the selected gene lists with improved classification performances.