Mining frequent closed itemsets from a landmark window over online data streams

  • Authors:
  • Xuejun Liu;Jihong Guan;Ping Hu

  • Affiliations:
  • Department of Computer Science & Engineering, Fudan University, Shanghai 200433, China and College of Information Science & Engineering, Nanjing University of Technology, Nanjing 210009, China;Department of Computer Science & Technology, Tongji University, Shanghai 200092, China;College of Information Science & Engineering, Nanjing University of Technology, Nanjing 210009, China

  • Venue:
  • Computers & Mathematics with Applications
  • Year:
  • 2009

Quantified Score

Hi-index 0.09

Visualization

Abstract

The frequent closed itemsets determine exactly the complete set of frequent itemsets and are usually much smaller than the later. However, mining frequent closed itemsets from a landmark window over data streams is a challenging problem. To solve the problem, this paper presents a novel algorithm (called FP-CDS) that can capture all frequent closed itemsets and a new storage structure (called FP-CDS tree) that can be dynamically adjusted to reflect the evolution of itemsets' frequencies over time. A landmark window is divided into several basic windows and these basic windows are used as updating units. Potential frequent closed itemsets in each basic window are mined and stored in FP-CDS tree based on some proposed strategies. Extensive experiments are conducted to validate the proposed method.