Mining Maximal Frequent Itemsets in Data Streams Based on FP-Tree

  • Authors:
  • Fujiang Ao;Yuejin Yan;Jian Huang;Kedi Huang

  • Affiliations:
  • School of Mechanical Engineering and Automation, National University, of Defense Technology, Changsha, 410073, China;School of Computer Science, National University of Defense, Technology, Changsha, 410073, China;School of Mechanical Engineering and Automation, National University, of Defense Technology, Changsha, 410073, China;School of Mechanical Engineering and Automation, National University, of Defense Technology, Changsha, 410073, China

  • Venue:
  • MLDM '07 Proceedings of the 5th international conference on Machine Learning and Data Mining in Pattern Recognition
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Mining maximal frequent itemsets in data streams is more difficult than mining them in static databases for the huge, high-speed and continuous characteristics of data streams. In this paper, we propose a novel one-pass algorithm called FpMFI-DS, which mines all maximal frequent itemsets in Landmark windows or Sliding windows in data streams based on FP-Tree. A new structure of FP-Tree is designed for storing all transactions in Landmark windows or Sliding windows in data streams. To improve the efficiency of the algorithm, a new pruning technique, extension support equivalency pruning (ESEquivPS), is imported to it. The experiments show that our algorithm is efficient and scalable. It is suitable for mining MFIs both in static database and in data streams.