Stability of Feature Selection Algorithms

Authors:
Alexandros Kalousis;Julien Prados;Melanie Hilario
Affiliations:
University of Geneva;University of Geneva;University of Geneva
Venue:
ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Year:
2005

Citing 7
Cited 5

Neural networks and the bias/variance dilemma

Neural Computation
Technical Note: Bias and the Quantification of Stability

Machine Learning - Special issue on bias evaluation and selection
Data mining: practical machine learning tools and techniques with Java implementations

Data mining: practical machine learning tools and techniques with Java implementations
Gene Selection for Cancer Classification using Support Vector Machines

Machine Learning
A Unifeid Bias-Variance Decomposition and its Applications

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Theoretical and Empirical Analysis of ReliefF and RReliefF

Machine Learning
Feature extraction from mass spectra for classification of pathological states

PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases

A stability index for feature selection

AIAP'07 Proceedings of the 25th conference on Proceedings of the 25th IASTED International Multi-Conference: artificial intelligence and applications
Avoidance of model re-induction in SVM-based feature selection for text categorization

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Improving stability of feature selection methods

CAIP'07 Proceedings of the 12th international conference on Computer analysis of images and patterns
Feature selection stability assessment based on the Jensen-Shannon divergence

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I
Comparison of metaheuristic strategies for peakbin selection in proteomic mass spectrometry data

Information Sciences: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

With the proliferation of extremely high-dimensional data, feature selection algorithms have become indispensable components of the learning process. Strangely, despite extensive work on the stability of learning algorithms, the stability of feature selection algorithms has been relatively neglected. This study is an attempt to fill that gap by quantifying the sensitivity of feature selection algorithms to variations in the training set. We assess the stability of feature selection algorithms based on the stability of the feature preferences that they express in the form of weights-scores, ranks, or a selected feature subset. We examine a number of measures to quantify the stability of feature preferences and propose an empirical way to estimate them. We perform a series of experiments with several feature selection algorithms on a set of proteomics datasets. The experiments allow us to explore the merits of each stability measure and create stability profiles of the feature selection algorithms. Finally we show how stability profiles can support the choice of a feature selection algorithm.