Requirements for clustering data streams
ACM SIGKDD Explorations Newsletter
FOCS '00 Proceedings of the 41st Annual Symposium on Foundations of Computer Science
A framework for diagnosing changes in evolving data streams
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
MONIC: modeling and monitoring cluster transitions
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Density-based clustering for real-time stream data
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
A framework for clustering evolving data streams
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Detecting change in data streams
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Tracing evolving clusters by subspace and value similarity
PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part II
Hi-index | 0.00 |
Clustering data streams is a challenging problem that has received significant attention in the recent decade. In this paper, we address the hitherto inadequately addressed challenge of managing the output of stream clustering. This task comprises the continuous cluster model validation, monitoring, trend and change detection, and summarization of the cluster mining output. For this purpose, we propose a complete framework to mine, track and validate the clusters in a data stream. The proposed framework keeps track of each discovered cluster model of the stream through time, while quantifying, modeling and summarizing the cluster evolution trends, and storing a summary of the cluster models and the evolution trends only at milestones corresponding to the occurrence of significant changes. Our experiments demonstrate the accuracy of the proposed framework in tracking cluster evolution over time, while generating a concise summary of the evolution trends over the lifetime of the stream.