Fastest association rule mining algorithm predictor (FARM-AP)

  • Authors:
  • Metanat HooshSadat;Hamman W. Samuel;Sonal Patel;Osmar R. Zaïane

  • Affiliations:
  • University of Alberta, Edmonton, Alberta, Canada;University of Alberta, Edmonton, Alberta, Canada;University of Alberta, Edmonton, Alberta, Canada;University of Alberta, Edmonton, Alberta, Canada

  • Venue:
  • Proceedings of The Fourth International C* Conference on Computer Science and Software Engineering
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Association rule mining is a particularly well studied field in data mining given its importance as a building block in many data analytics tasks. Many studies have focused on efficiency because the data to be mined is typically very large. However, while there are many approaches in literature, each approach claims to be the fastest for some given dataset. In other words, there is no clear winner. On the other hand, there is panoply of algorithms and implementations specifically designed for parallel computing. These solutions are typically implementations of sequential algorithms in a multi-processor configuration focusing on load balancing and data partitioning, each processor running the same implementation on it is own partition. The question we ask in this paper is whether there is a means to select the appropriate frequent itemset mining algorithm given a dataset and if each processor in a parallel implementation could select its own algorithm provided a given partition of the data.