Tree partition based parallel frequent pattern mining on shared memory systems

  • Authors:
  • Dehao Chen;Chunrong Lai;Wei Hu;WenGuang Chen;Yimin Zhang;Weimin Zheng

  • Affiliations:
  • Tsinghua University, Dept. of Computer Science, Beijing, China;Intel Corporation, Intel China Research Center, Beijing, China;Intel Corporation, Intel China Research Center, Beijing, China;Tsinghua University, Dept. of Computer Science, Beijing, China;Intel Corporation, Intel China Research Center, Beijing, China;Tsinghua University, Dept. of Computer Science, Beijing, China

  • Venue:
  • IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we present a tree-partition algorithm for parallel mining of frequent patterns. Our work is based on FP-Growth algorithm, which is constituted of tree-building stage and mining stage. The main idea is to build only one FP-Tree in the memory, partition it into several independent parts and distribute them to different threads. A heuristic algorithm is devised to balance the workload. Our algorithm can not only alleviate the impact of locks during the tree-building stage, but also avoid the overhead that do great harm to the mining stage. We present the experiments on different kinds of datasets and compare the results with other parallel approaches. The results suggest that our approach has great advantage in efficiency, especially on certain kinds of datasets. As the number of processors increases, our parallel algorithm shows good scalability.