Streaming-Data Algorithms for High-Quality Clustering

  • Authors:
  • Affiliations:
  • Venue:
  • ICDE '02 Proceedings of the 18th International Conference on Data Engineering
  • Year:
  • 2002

Quantified Score

Hi-index 0.01

Visualization

Abstract

Streaming data analysis has recently attracted attention in numerous applications including telephone records, web documents and clickstreams. For such analysis, single-pass algorithms that consume a small amount of memory are critical. We describe such a streaming algorithm that effectively clusters large data streams. We also provide empirical evidence of the algorithm's performance on synthetic and real data streams.