Choosing Multiple Parameters for Support Vector Machines
Machine Learning
An introduction to variable and feature selection
The Journal of Machine Learning Research
Use of the zero norm with linear models and kernel methods
The Journal of Machine Learning Research
Hybrid huberized support vector machines for microarray classification
Proceedings of the 24th international conference on Machine learning
Hybrid huberized support vector machines for microarray classification
Proceedings of the 24th international conference on Machine learning
A stability index for feature selection
AIAP'07 Proceedings of the 25th conference on Proceedings of the 25th IASTED International Multi-Conference: artificial intelligence and applications
A review of feature selection techniques in bioinformatics
Bioinformatics
IEEE Transactions on Neural Networks
Feature Selection by Transfer Learning with Linear Regularized Models
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
Review Article: Stable feature selection for biomarker discovery
Computational Biology and Chemistry
Modelling complex data by learning which variable to construct
DaWaK'10 Proceedings of the 12th international conference on Data warehousing and knowledge discovery
Improving accuracy of microarray classification by a simple multi-task feature selection filter
International Journal of Data Mining and Bioinformatics
Stable Gene Selection from Microarray Data via Sample Weighting
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Hi-index | 0.00 |
This paper addresses feature selection techniques for classification of high dimensional data, such as those produced by microarray experiments. Some prior knowledge may be available in this context to bias the selection towards some dimensions (genes) a priori assumed to be more relevant. We propose a feature selection method making use of this partial supervision. It extends previous works on embedded feature selection with linear models including regularization to enforce sparsity. A practical approximation of this technique reduces to standard SVM learning with iterative rescaling of the inputs. The scaling factors depend here on the prior knowledge but the final selection may depart from it. Practical results on several microarray data sets show the benefits of the proposed approach in terms of the stability of the selected gene lists with improved classification performances.