RCD: A recurring concept drift framework

  • Authors:
  • Paulo Mauricio GonçAlves Jr;Roberto Souto Maior De Barros

  • Affiliations:
  • Centro de Informática, Universidade Federal de Pernambuco, Cidade Universitária, 50.740-560 Recife, Brazil and Instituto Federal de Pernambuco, Cidade Universitária, 50.740-540 Reci ...;Centro de Informática, Universidade Federal de Pernambuco, Cidade Universitária, 50.740-560 Recife, Brazil

  • Venue:
  • Pattern Recognition Letters
  • Year:
  • 2013

Quantified Score

Hi-index 0.10

Visualization

Abstract

This paper presents recurring concept drifts (RCD), a framework that offers an alternative approach to handle data streams that suffer from recurring concept drifts (on-line learning). It creates a new classifier to each context found and stores a sample of data used to build it. When a new concept drift occurs, the algorithm compares the new context to previous ones using a non-parametric multivariate statistical test to verify if both contexts come from the same distribution. If so, the corresponding classifier is reused. The RCD framework is compared with several algorithms (among single and ensemble approaches), in both artificial and real data sets, chosen from frequently used algorithms and data sets in the concept drift research area. We claim the proposed framework had better average ranks in data sets with abrupt and gradual concept drifts compared to both the single classifiers and the ensemble approaches that use the same base learner.