A Sliding-Window Approach for Finding Top-k Frequent Itemsets from Uncertain Streams

  • Authors:
  • Xiaojian Zhang;Huili Peng

  • Affiliations:
  • Department of Computer Science, Henan University Finance & Economics, Zhengzhou, China 450002;Department of Education, Henan Radio & Television University, Zhengzhou, China 450008

  • Venue:
  • APWeb/WAIM '09 Proceedings of the Joint International Conferences on Advances in Data and Web Management
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

The analysis and management of uncertain data has attracted a lot of attention recently in many important applications such as pattern recognition and sensor network. Frequent itemset mining is often useful in analyzing uncertain data in those applications. However, previous works just focus on the static uncertain data instead of uncertain streams. In this paper, we study the problem of mining top-k FIs in uncertain streams. We propose an efficient algorithm, called UTK-FI, based on sliding-window and Chernoff bound techniques for finding k most frequent itemsets of different sizes. Experimental results show that our algorithm performs much better than many established methods in uncertain streams environment.