Maximizing the Efficiency of Parallel Apriori Algorithm

  • Authors:
  • Ketan Shah;Sunita Mahajan

  • Affiliations:
  • -;-

  • Venue:
  • ARTCOM '09 Proceedings of the 2009 International Conference on Advances in Recent Technologies in Communication and Computing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we attempt to maximize the efficiency of the parallel Apriori Algorithm. The paper analyzes the performance of the algorithm over different datasets and over n processors on a commodity cluster of machines. In the Apriori Algorithm all processes need to synchronize after every pass. If any process is assigned more load than other processes in the system, the slowest process will dictate the speed of the program. It is therefore important to ensure that load is equally balanced among all processes. Our algorithm determines the no. of running processes and divides the load equally so as to maximize the system performance and its efficiency. The experiments conducted show that the parallel algorithm scales well to the number of processes and also improves on the efficiency by effective load balancing.