Analysis of a random forests model

Authors:
Gérard Biau
Affiliations:
LSTA & LPMA, Université Pierre et Marie Curie – Paris VI, Paris Cedex 05, France
Venue:
The Journal of Machine Learning Research
Year:
2012

Citing 16
Cited 1

Real and complex analysis, 3rd ed.

Real and complex analysis, 3rd ed.
Bagging predictors

Machine Learning
Shape quantization and recognition with randomized trees

Neural Computation
The Random Subspace Method for Constructing Decision Forests

IEEE Transactions on Pattern Analysis and Machine Intelligence
An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting, and Randomization

Machine Learning
Random Forests

Machine Learning
2d Object Detection and Recognition: Models, Algorithms, and Networks

2d Object Detection and Recognition: Models, Algorithms, and Networks
Kernel Methods for Pattern Analysis

Kernel Methods for Pattern Analysis
Different Paradigms for Choosing Sequential Reweighting Algorithms

Neural Computation
Quantile Regression Forests

The Journal of Machine Learning Research
Consistency of Random Forests and Other Averaging Classifiers

The Journal of Machine Learning Research
Enriched random forests

Bioinformatics
From Sparse Solutions of Systems of Equations to Sparse Modeling of Signals and Images

SIAM Review
On the Rate of Convergence of the Bagged Nearest Neighbor Estimate

The Journal of Machine Learning Research
Variable selection using random forests

Pattern Recognition Letters
On the layered nearest neighbour estimate, the bagged nearest neighbour estimate and the random forest method in regression and classification

Journal of Multivariate Analysis

Pairwise meta-rules for better meta-learning-based algorithm ranking

Machine Learning

Quantified Score

Hi-index	0.00

Visualization

Abstract

Random forests are a scheme proposed by Leo Breiman in the 2000's for building a predictor ensemble with a set of decision trees that grow in randomly selected subspaces of data. Despite growing interest and practical use, there has been little exploration of the statistical properties of random forests, and little is known about the mathematical forces driving the algorithm. In this paper, we offer an in-depth analysis of a random forests model suggested by Breiman (2004), which is very close to the original algorithm. We show in particular that the procedure is consistent and adapts to sparsity, in the sense that its rate of convergence depends only on the number of strong features and not on how many noise variables are present.