Accelerating frequent itemset mining on graphics processing units

  • Authors:
  • Fan Zhang;Yan Zhang;Jason D. Bakos

  • Affiliations:
  • University of South Carolina, Columbia, USA;University of South Carolina, Columbia, USA;University of South Carolina, Columbia, USA

  • Venue:
  • The Journal of Supercomputing
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we describe a new parallel Frequent Itemset Mining algorithm called "Frontier Expansion." This implementation is optimized to achieve high performance on a heterogeneous platform consisting of a shared memory multiprocessor and multiple Graphics Processing Unit (GPU) coprocessors. Frontier Expansion is an improved data-parallel algorithm derived from the Equivalent Class Clustering (Eclat) method, in which a partial breadth-first search is utilized to exploit maximum parallelism while being constrained by the available memory capacity. In our approach, the vertical transaction lists are represented using a "bitset" representation and operated using wide bitwise operations across multiple threads on a GPU. We evaluate our approach using four NVIDIA Tesla GPUs and observed a 6---30脳 speedup relative to state-of-the-art sequential Eclat and FPGrowth implementations executed on a multicore CPU.