Regression Trees from Data Streams with Drift Detection

  • Authors:
  • Elena Ikonomovska;João Gama;Raquel Sebastião;Dejan Gjorgjevik

  • Affiliations:
  • FEEIT, Ss. Cyril and Methodius University, Skopje, Macedonia 1000;LIAAD/INESC, University of Porto, Porto, Portugal 4050-190 and Faculty of Economics, University of Porto, Porto, Portugal 4200;LIAAD/INESC, University of Porto, Porto, Portugal 4050-190 and Faculty of Science, University of Porto, Porto, Portugal 4100;FEEIT, Ss. Cyril and Methodius University, Skopje, Macedonia 1000

  • Venue:
  • DS '09 Proceedings of the 12th International Conference on Discovery Science
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

The problem of extracting meaningful patterns from time changing data streams is of increasing importance for the machine learning and data mining communities. We present an algorithm which is able to learn regression trees from fast and unbounded data streams in the presence of concept drifts. To our best knowledge there is no other algorithm for incremental learning regression trees equipped with change detection abilities. The FIRT-DD algorithm has mechanisms for drift detection and model adaptation, which enable to maintain accurate and updated regression models at any time. The drift detection mechanism is based on sequential statistical tests that track the evolution of the local error, at each node of the tree, and inform the learning process for the detected changes. As a response to a local drift, the algorithm is able to adapt the model only locally, avoiding the necessity of a global model adaptation. The adaptation strategy consists of building a new tree whenever a change is suspected in the region and replacing the old ones when the new trees become more accurate. This enables smooth and granular adaptation of the global model. The results from the empirical evaluation performed over several different types of drift show that the algorithm has good capability of consistent detection and proper adaptation to concept drifts.