Machine Learning
Wrappers for feature subset selection
Artificial Intelligence - Special issue on relevance
Machine Learning
Ensemble Methods in Machine Learning
MCS '00 Proceedings of the First International Workshop on Multiple Classifier Systems
An introduction to variable and feature selection
The Journal of Machine Learning Research
Variable selection using svm based criteria
The Journal of Machine Learning Research
Use of the zero norm with linear models and kernel methods
The Journal of Machine Learning Research
Empirical characterization of random forest variable importance measures
Computational Statistics & Data Analysis
Consistency of Random Forests and Other Averaging Classifiers
The Journal of Machine Learning Research
Proceedings of the 13th annual conference companion on Genetic and evolutionary computation
Environmental Modelling & Software
A hybrid KMV model, random forests and rough set theory approach for credit rating
Knowledge-Based Systems
Analysis of a random forests model
The Journal of Machine Learning Research
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
A new variable selection approach using Random Forests
Computational Statistics & Data Analysis
Feature subset selection Filter-Wrapper based on low quality data
Expert Systems with Applications: An International Journal
Expert Systems with Applications: An International Journal
Hi-index | 0.10 |
This paper proposes, focusing on random forests, the increasingly used statistical method for classification and regression problems introduced by Leo Breiman in 2001, to investigate two classical issues of variable selection. The first one is to find important variables for interpretation and the second one is more restrictive and try to design a good parsimonious prediction model. The main contribution is twofold: to provide some experimental insights about the behavior of the variable importance index based on random forests and to propose a strategy involving a ranking of explanatory variables using the random forests score of importance and a stepwise ascending variable introduction strategy.