CBDT: A Concept Based Approach to Data Stream Mining

  • Authors:
  • Stefan Hoeglinger;Russel Pears;Yun Sing Koh

  • Affiliations:
  • School of Computing and Mathematical Sciences, Auckland University of Technology, New Zealand;School of Computing and Mathematical Sciences, Auckland University of Technology, New Zealand;School of Computing and Mathematical Sciences, Auckland University of Technology, New Zealand

  • Venue:
  • PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Data Stream mining presents unique challenges compared to traditional mining on a random sample drawn from a stationary statistical distribution. Data from real-world data streams are subject to concept drift due to changes that take place continuously in the underlying data generation mechanism. Concept drift complicates the process of mining data as models that are learnt need to be updated continuously to reflect recent changes in the data while retaining relevant information that has been learnt from the past. In this paper, we describe a Concept Based Decision Tree (CBDT) learner and compare it with the CVDFT algorithm, which uses a sliding time window. Our experimental results show that CBDT outperforms CVFDT in terms of both classification accuracy and memory consumption.