An Efficient Approximate Approach to Mining Frequent Itemsets over High Speed Transactional Data Streams

  • Authors:
  • Kuen-Fang Jea;Chao-Wei Li;Tsui-Ping Chang

  • Affiliations:
  • -;-;-

  • Venue:
  • ISDA '08 Proceedings of the 2008 Eighth International Conference on Intelligent Systems Design and Applications - Volume 03
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

A data stream is a massive and unbounded sequence of data elements that are continuously generated at a fast speed. Compared with traditional data mining, knowledge discovery in data streams is more challenging since several requirements need to be satisfied. In this paper we propose a mining algorithm for finding frequent itemsets over a transactional data stream. Unlike most of existing algorithms, our method works based on the theory of Approximate Inclusion–Exclusion to approximate the itemsets' counts. Some techniques are designed and integrated into the algorithm for performance improvement. And the performance of the proposed algorithm is tested and analyzed through several experiments.