Linear-time approximation schemes for clustering problems in any dimensions

  • Authors:
  • Amit Kumar;Yogish Sabharwal;Sandeep Sen

  • Affiliations:
  • Indian Institute of Technology, New Delhi, India;IBM Research - India, New Delhi, India;Indian Institute of Technology, New Delhi, India

  • Venue:
  • Journal of the ACM (JACM)
  • Year:
  • 2010

Quantified Score

Hi-index 0.01

Visualization

Abstract

We present a general approach for designing approximation algorithms for a fundamental class of geometric clustering problems in arbitrary dimensions. More specifically, our approach leads to simple randomized algorithms for the k-means, k-median and discrete k-means problems that yield (1+ϵ) approximations with probability ≥ 1/2 and running times of O(2(k/ϵ)O(1) dn). These are the first algorithms for these problems whose running times are linear in the size of the input (nd for n points in d dimensions) assuming k and ϵ are fixed. Our method is general enough to be applicable to clustering problems satisfying certain simple properties and is likely to have further applications.