Randomizing Outputs to Increase Prediction Accuracy

  • Authors:
  • Leo Breiman

  • Affiliations:
  • Statistics Department, University of California, Berkeley, CA 94720, USA. leo@stat.berkeley.edu

  • Venue:
  • Machine Learning
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

Bagging and boosting reduce error by changing both the inputs and outputs to form perturbed training sets, growing predictors on these perturbed training sets and combining them. An interesting question is whether it is possible to get comparable performance by perturbing the outputs alone. Two methods of randomizing outputs are experimented with. One is called output smearing and the other output flipping. Both are shown to consistently do better than bagging.