Concept Drifting Detection on Noisy Streaming Data in Random Ensemble Decision Trees

  • Authors:
  • Peipei Li;Xuegang Hu;Qianhui Liang;Yunjun Gao

  • Affiliations:
  • School of Computer Science and Information Technology, Hefei University of Technology, China 230009 and School of Information Systems, Singapore Management University, Singapore 178902;School of Computer Science and Information Technology, Hefei University of Technology, China 230009;School of Information Systems, Singapore Management University, Singapore 178902;School of Information Systems, Singapore Management University, Singapore 178902 and College of Computer Science, Zhejiang University, China 310027

  • Venue:
  • MLDM '09 Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Although a vast majority of inductive learning algorithms has been developed for handling of the concept drifting data streams, especially the ones in virtue of ensemble classification models, few of them could adapt to the detection on the different types of concept drifts from noisy streaming data in a light demand on overheads of time and space. Motivated by this, a new classification algorithm for Concept drifting Detection based on an ensembling model of Random Decision Trees (called CDRDT) is proposed in this paper. Extensive studies with synthetic and real streaming data demonstrate that in comparison to several representative classification algorithms for concept drifting data streams, CDRDT not only could effectively and efficiently detect the potential concept changes in the noisy data streams, but also performs much better on the abilities of runtime and space with an improvement in predictive accuracy. Thus, our proposed algorithm provides a significant reference to the classification for concept drifting data streams with noise in a light weight way.