A new method for GPU based irregular reductions and its application to k-means clustering

  • Authors:
  • Balaji Dhanasekaran;Norman Rubin

  • Affiliations:
  • Advanced Micro Devices, Inc.;Advanced Micro Devices, Inc.

  • Venue:
  • Proceedings of the Fourth Workshop on General Purpose Processing on Graphics Processing Units
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

A frequently used method of clustering is a technique called k-means clustering. The k-means algorithm consists of two steps: A map step, which is simple to execute on a GPU, and a reduce step, which is more problematic. Previous researchers have used a hybrid approach in which the map step is computed on the GPU and the reduce step is performed on the CPU. In this work, we present a new algorithm for irregular reductions and apply it to k-means such that the GPU executes both the map and reduce steps. We provide experimental comparisons using OpenCL. Our results show that our scheme is 3.2 times faster than the hybrid scheme for k = 10, an average 1.5 times faster when the number of clusters, k = 100 and on average equal for k = 400, on an ATI Radeon® HD 5870 (best speedup was 3.5 times) compared to the hybrid approach. In addition, we compare the GPU code with the standard OpenMP benchmark, MineBench. In that implementation, both the map and reduce steps are computed on the CPU. For large data sizes, the new GPU scheme shows great promise, with performance up to 35 times faster than MineBench on a four core Intel i7 CPU.