Online non-stationary boosting

  • Authors:
  • Adam Pocock;Paraskevas Yiapanis;Jeremy Singer;Mikel Luján;Gavin Brown

  • Affiliations:
  • School of Computer Science, University of Manchester, UK;School of Computer Science, University of Manchester, UK;School of Computer Science, University of Manchester, UK;School of Computer Science, University of Manchester, UK;School of Computer Science, University of Manchester, UK

  • Venue:
  • MCS'10 Proceedings of the 9th international conference on Multiple Classifier Systems
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Oza's Online Boosting algorithm provides a version of AdaBoost which can be trained in an online way for stationary problems. One perspective is that this enables the power of the boosting framework to be applied to datasets which are too large to fit into memory. The online boosting algorithm assumes the data distribution to be independent and identically distributed (i.i.d.) and therefore has no provision for concept drift. We present an algorithm called Online Non-Stationary Boosting (ONSBoost) that, like Online Boosting, uses a static ensemble size without generating new members each time new examples are presented, and also adapts to a changing data distribution. We evaluate the new algorithm against Online Boosting, using the STAGGER dataset and three challenging datasets derived from a learning problem inside a parallelising virtual machine. We find that the new algorithm provides equivalent performance on the STAGGER dataset and an improvement of up to 3% on the parallelisation datasets.