HDG-tree: a structure for clustering high-dimensional data streams
IITA'09 Proceedings of the 3rd international conference on Intelligent information technology application
Clustering high dimensional data streams with representative points
FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 1
Increasing availability of industrial systems through data stream mining
Computers and Industrial Engineering
Hi-index | 0.00 |
In this paper, SOStream, which is a novel algorithm of clustering over high dimensional online data stream is presented, it is based on subspace. SOStream partitions the data space into grids, and maintains a superset of all dense units in an online way. A deterministic lower and upper bound of the selectivity of each maintained units are also given. With the maintained potential dense units, SOStream is capable of discovering the clusters in different subspaces over high dimensional data stream with arbitrary shape. The experimental results on real and synthetic datasets demonstrate the effectivity of the approach.