A clustering algorithm based on matrix over high dimensional data stream
WISM'10 Proceedings of the 2010 international conference on Web information systems and mining
A practice probability frequent pattern mining method over transactional uncertain data streams
UIC'11 Proceedings of the 8th international conference on Ubiquitous intelligence and computing
HUE-Stream: evolution-based clustering technique for heterogeneous data streams with uncertainty
ADMA'11 Proceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part II
Mining top-k frequent patterns over data streams sliding window
Journal of Intelligent Information Systems
Hi-index | 0.00 |
In this paper, we will study the problem of projected clustering of uncertain data streams. The use of uncertainty is especially important in the high dimensional scenario, because the sparsity property of high dimensional data is aggravated by the uncertainty. The uncertainty information is important for not only the determination of the assignment of data points to clusters, but also that of the valid projections across which the data is naturally clustered. The problem is especially challenging in the case where the data is not available on disk and arrives in the form of a fast stream. In such cases, the one-pass constraint in data stream computation poses special challenges to the algorithmic sophistication required for incorporating uncertainty information into the high dimensional computations. We will show that the projected clustering problem can be effectively solved in the context of uncertain data streams.