Parallel Pruning for K-Means Clustering on Shared Memory Architectures

  • Authors:
  • Attila Gürsoy;Ilker Cengiz

  • Affiliations:
  • -;-

  • Venue:
  • Euro-Par '01 Proceedings of the 7th International Euro-Par Conference Manchester on Parallel Processing
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

We have developed and evaluated two parallelization schemes for a tree-based k-means clustering method on shared memory machines. One scheme is to partition the pattern space across processors. We have determined that spatial decomposition of patterns outperforms random decomposition even though random decomposition has almost no load imbalance problem. The other scheme is the parallel traverse of the search tree. This approach solves the load imbalance problem and performs slightly better than the spatial decomposition, but the efficiency is reduced due to thread synchronizations. In both cases, parallel treebased k-means clustering is significantly faster than the direct parallel k-means.