An efficiently algorithm based on itemsets-lattice and bitmap index for finding frequent itemsets

  • Authors:
  • Fuzan Chen;Minqiang Li

  • Affiliations:
  • School of Management, Tianjin University, (Tianjin), China;School of Management, Tianjin University, (Tianjin), China

  • Venue:
  • FSKD'05 Proceedings of the Second international conference on Fuzzy Systems and Knowledge Discovery - Volume Part II
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Frequent itemsets play an essential role in many data mining tasks that try to find interesting patterns from databases. A new algorithm based on the lattice theory and bitmap index for mining frequent itemsets is proposed in this paper. Firstly, the algorithm converts the origin transaction database to an itemsets-lattice (which is a directed graph) in the preprocessing, where each itemset vertex has a label to represent its support. So we can change the complicated task of mining frequent itessets in the database to a simpler one of searching vertexes in the lattice, which can speeds up greatly the mining process. Secondly, Support counting in the association rules mining requires a great I/O and computing cost. A bitmap index technique to speed up the counting process is employed in this paper. Saving the intact bitmap usually has a big space requirement. Each bit vector is partitioned into some blocks, and hence every bit block is encoded as a shorter symbol. Therefore the original bitmap is impacted efficiently. At the end experimental and analytical results are presented.