An Adaptive Algorithm for Mining Association Rules on Shared-Memory Parallel Machines

  • Authors:
  • David W. Cheung;Kan Hu;Shaowei Xia

  • Affiliations:
  • Department of Computer Science and Information Systems, The University of Hong Kong, Hong Kong. dcheung@csis.hku.hk;Department of Automation, Tsinghua University, Beijing;Department of Automation, Tsinghua University, Beijing. swxia@mail.tsinghua.edu.cn

  • Venue:
  • Distributed and Parallel Databases
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Mining association rules from large databases is very costly. We propose to develop parallel algorithms for this task on shared-memory multiprocessor (SMP). All proposed parallel algorithms for other paradigms follow the conventional level-wise approach: they need as many iterations as the length of the maximum large itemset. To make matter worse, they impose a synchronization in every iteration which would cause serious I/O contention on shared-memory parallel system. An adaptive asynchronous parallel mining algorithm APM has been proposed for SMP. All processors generate candidates dynamically and count itemset supports independently without synchronization. Two optimization techniques have been proposed for the reduction of database scanning and the number of candidates. The algorithm APM has been implemented on a Sun Enterprise 4000 shared-memory multiprocessor with 12 nodes. The experiments show that the optimizations have very good effects and APM has a substantial lead in performance over other proposed level-wise algorithms.