Standard errors for bagged and random forest estimators

  • Authors:
  • Joseph Sexton;Petter Laake

  • Affiliations:
  • Institute of Basic Medical Sciences, Department of Biostatistics, PO Box 1122 Blindern, 0317 Oslo, Norway;Institute of Basic Medical Sciences, Department of Biostatistics, PO Box 1122 Blindern, 0317 Oslo, Norway

  • Venue:
  • Computational Statistics & Data Analysis
  • Year:
  • 2009

Quantified Score

Hi-index 0.03

Visualization

Abstract

Bagging and random forests are widely used ensemble methods. Each forms an ensemble of models by randomly perturbing the fitting of a base learner. The standard errors estimation of the resultant regression function is considered. Three estimators are discussed. One, based on the jackknife, is applicable to bagged estimators and can be computed using the bagged ensemble. The two other estimators target the bootstrap standard error estimator, and require fitting multiple ensemble estimators, one for each bootstrap sample. It is shown that these bootstrap ensemble sizes can be small, which reduces the computation involved in forming the estimator. The estimators are studied using both simulated and real data.