Online non-stationary boosting

Authors:
Adam Pocock;Paraskevas Yiapanis;Jeremy Singer;Mikel Luján;Gavin Brown
Affiliations:
School of Computer Science, University of Manchester, UK;School of Computer Science, University of Manchester, UK;School of Computer Science, University of Manchester, UK;School of Computer Science, University of Manchester, UK;School of Computer Science, University of Manchester, UK
Venue:
MCS'10 Proceedings of the 9th international conference on Multiple Classifier Systems
Year:
2010

Citing 7
Cited 2

Floating search methods in feature selection

Pattern Recognition Letters
Learning in the presence of concept drift and hidden contexts

Machine Learning
Online ensemble learning

Online ensemble learning
The DaCapo benchmarks: java benchmarking development and analysis

Proceedings of the 21st annual ACM SIGPLAN conference on Object-oriented programming systems, languages, and applications
Boosting classifiers for drifting concepts

Intelligent Data Analysis - Knowlegde Discovery from Data Streams
Incremental Learning of Variable Rate Concept Drift

MCS '09 Proceedings of the 8th International Workshop on Multiple Classifier Systems
An ensemble approach for incremental learning in nonstationary environments

MCS'07 Proceedings of the 7th international conference on Multiple classifier systems

Tracking concept change with incremental boosting by minimization of the evolving exponential loss

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I
Ensemble of online neural networks for non-stationary and imbalanced data streams

Neurocomputing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Oza's Online Boosting algorithm provides a version of AdaBoost which can be trained in an online way for stationary problems. One perspective is that this enables the power of the boosting framework to be applied to datasets which are too large to fit into memory. The online boosting algorithm assumes the data distribution to be independent and identically distributed (i.i.d.) and therefore has no provision for concept drift. We present an algorithm called Online Non-Stationary Boosting (ONSBoost) that, like Online Boosting, uses a static ensemble size without generating new members each time new examples are presented, and also adapts to a changing data distribution. We evaluate the new algorithm against Online Boosting, using the STAGGER dataset and three challenging datasets derived from a learning problem inside a parallelising virtual machine. We find that the new algorithm provides equivalent performance on the STAGGER dataset and an improvement of up to 3% on the parallelisation datasets.