Domain and data partitioning for parallel mining of frequent closed itemsets

  • Authors:
  • Peiyi Tang;Li Ning;Ningning Wu

  • Affiliations:
  • University of Arkansas at LR, Little Rock, AR;University of Arkansas at LR, Little Rock, AR;University of Arkansas at LR, Little Rock, AR

  • Venue:
  • Proceedings of the 43rd annual Southeast regional conference - Volume 1
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we propose an algorithm to partition both the search space and the database for the parallel mining of frequent closed itemsets in large databases. The partitioning of the search space is based on splitting the power set lattice of the total item set to two sub-lattices. Conditional databases axe used to partition the large database. The combination of the search space and database partitioning allows parallel processors to mine the frequent closed itemsets independently and thus minimizes the interprocessor communication and synchronization. The partitioning also ensures the load balance among the parallel processors.