Subspace Clustering of High Dimensional Data Streams

  • Authors:
  • Shuyun Wang;Yingjie Fan;Chenghong Zhang;HeXiang Xu;Xiulan Hao;Yunfa Hu

  • Affiliations:
  • -;-;-;-;-;-

  • Venue:
  • ICIS '08 Proceedings of the Seventh IEEE/ACIS International Conference on Computer and Information Science (icis 2008)
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, SOStream, which is a novel algorithm of clustering over high dimensional online data stream is presented, it is based on subspace. SOStream partitions the data space into grids, and maintains a superset of all dense units in an online way. A deterministic lower and upper bound of the selectivity of each maintained units are also given. With the maintained potential dense units, SOStream is capable of discovering the clusters in different subspaces over high dimensional data stream with arbitrary shape. The experimental results on real and synthetic datasets demonstrate the effectivity of the approach.