Resampling methods for parameter-free and robust feature selection with mutual information

Authors:
D. François;F. Rossi;V. Wertz;M. Verleysen
Affiliations:
Université catholique de Louvain, Machine Learning Group, CESAME, Av. Georges Lemaitre, 4, B-1348 Louvain-la-Neuve, Belgium;Projet AxIS, INRIA, Domaine de Voluceau, Rocquencourt, B.P. 105, 78153 Le Chesnay Cedex, France;Université catholique de Louvain, Machine Learning Group, CESAME, Av. Georges Lemaitre, 4, B-1348 Louvain-la-Neuve, Belgium;Université catholique de Louvain, Machine Learning Group, DICE, Place du Levant, 3, B-1348 Louvain-la-Neuve, Belgium
Venue:
Neurocomputing
Year:
2007

Citing 13
Cited 17

Input Feature Selection by Mutual Information Based on Parzen Window

IEEE Transactions on Pattern Analysis and Machine Intelligence
Feature selection with neural networks

Pattern Recognition Letters
Using a Permutation Test for Attribute Selection in Decision Trees

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Generalized relevance learning vector quantization

Neural Networks - New developments in self-organizing maps
An introduction to variable and feature selection

The Journal of Machine Learning Research
On the Kernel Widths in Radial-Basis Function Networks

Neural Processing Letters
A Feature Selection Newton Method for Support Vector Machine Classification

Computational Optimization and Applications
Fast Binary Feature Selection with Conditional Mutual Information

The Journal of Machine Learning Research
Feature Extraction: Foundations and Applications (Studies in Fuzziness and Soft Computing)

Feature Extraction: Foundations and Applications (Studies in Fuzziness and Soft Computing)
Representation of functional data in neural networks

Neurocomputing
Speeding up the wrapper feature subset selection in regression by mutual information relevance and redundancy analysis

ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part I
Input feature selection for classification problems

IEEE Transactions on Neural Networks
Using mutual information for selecting features in supervised neural net learning

IEEE Transactions on Neural Networks

Combined input variable selection and model complexity control for nonlinear regression

Pattern Recognition Letters
K nearest neighbours with mutual information for simultaneous classification and missing data imputation

Neurocomputing
Simultaneous input variable and basis function selection for RBF networks

Neurocomputing
Advances in Feature Selection with Mutual Information

Similarity-Based Clustering
Information-theoretic feature selection for functional data classification

Neurocomputing
Mineral identification using color spaces and artificial neural networks

Computers & Geosciences
Strengthening the Forward Variable Selection Stopping Criterion

ICANN '09 Proceedings of the 19th International Conference on Artificial Neural Networks: Part II
Sparse kernel density estimations and its application in variable selection based on quadratic Renyi entropy

Neurocomputing
Information-theoretic approaches to SVM feature selection for metagenome read classification

Computational Biology and Chemistry
Feature selection for multi-label classification problems

IWANN'11 Proceedings of the 11th international conference on Artificial neural networks conference on Advances in computational intelligence - Volume Part I
Feature selection with mutual information for uncertain data

DaWaK'11 Proceedings of the 13th international conference on Data warehousing and knowledge discovery
Feature selection with missing data using mutual information estimators

Neurocomputing
Low bias histogram-based estimation of mutual information for feature selection

Pattern Recognition Letters
Theoretical and empirical study on the potential inadequacy of mutual information for feature selection in classification

Neurocomputing
A new histogram-based estimation technique of entropy and mutual information using mean squared error minimization

Computers and Electrical Engineering
Letters: Mutual information-based feature selection for multilabel classification

Neurocomputing
Estimating mutual information for feature selection in the presence of label noise

Computational Statistics & Data Analysis

Quantified Score

Hi-index	0.01

Visualization

Abstract

Combining the mutual information criterion with a forward feature selection strategy offers a good trade-off between optimality of the selected feature subset and computation time. However, it requires to set the parameter(s) of the mutual information estimator and to determine when to halt the forward procedure. These two choices are difficult to make because, as the dimensionality of the subset increases, the estimation of the mutual information becomes less and less reliable. This paper proposes to use resampling methods, a K-fold cross-validation and the permutation test, to address both issues. The resampling methods bring information about the variance of the estimator, information which can then be used to automatically set the parameter and to calculate a threshold to stop the forward procedure. The procedure is illustrated on a synthetic data set as well as on the real-world examples.