Clustering-training for Data Stream Mining

  • Authors:
  • Shuang Wu;Chunyu Yang;Jie Zhou

  • Affiliations:
  • Tsinghua University, Beijing, China;Tsinghua University, Beijing, China;Tsinghua University, Beijing, China

  • Venue:
  • ICDMW '06 Proceedings of the Sixth IEEE International Conference on Data Mining - Workshops
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Mining data streams has attracted much attention recently. Labeled samples needed by most current stream classification methods are more difficult and expensive to obtain than unlabeled ones. This paper proposed a semisupervised learning algorithm - clustering-training to utilize the unlabeled samples. It uses clustering to select confidently unlabeled samples, and uses them to re-train the classifier incrementally. Experiments on synthetic and real data set showed the effectiveness of the proposed algorithm.