Stochastic gradient boosting

  • Authors:
  • Jerome H. Friedman

  • Affiliations:
  • Department of Statistics and Stanford Linear Accelerator Center, Stanford University, Stanford, CA

  • Venue:
  • Computational Statistics & Data Analysis - Nonlinear methods and data mining
  • Year:
  • 2002

Quantified Score

Hi-index 0.01

Visualization

Abstract

Gradient boosting constructs additive regression models by sequentially fitting a simple parameterized function (base learner) to current "pseudo'-residuals by least squares at each iteration. The pseudo-residuals are the gradient of the loss functional being minimized, with respect to the model values at each training data point evaluated at the current step. It is shown that both the approximation accuracy and execution speed of gradient boosting can be substantially improved by incorporating randomization into the procedure. Specifically, at each iteration a subsample of the training data is drawn at random (without replacement) from the full training data set. This randomly selected subsample is then used in place of the full sample to fit the base learner and compute the model update for the current iteration. This randomized approach also increases robustness against overcapacity of the base learner.