Incremental and Adaptive Clustering Stream Data over Sliding Window

  • Authors:
  • Xuan Hong Dang;Vincent C. Lee;Wee Keong Ng;Kok Leong Ong

  • Affiliations:
  • Monash University, Australia;Monash University, Australia;Nanyang Technological University, Singapore;Deakin University, Australia

  • Venue:
  • DEXA '09 Proceedings of the 20th International Conference on Database and Expert Systems Applications
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Cluster analysis has played a key role in data stream understanding. The problem is difficult when the clustering task is considered in a sliding window model in which the requirement of outdated data elimination must be dealt with properly. We propose SWEM algorithm that is designed based on the Expectation Maximization technique to address these challenges. Equipped in SWEM is the capability to compute clusters incrementally using a small number of statistics summarized over the stream and the capability to adapt to the stream distribution's changes. The feasibility of SWEM has been verified via a number of experiments and we show that it is superior than Clustream algorithm, for both synthetic and real datasets.