Approximate clustering via core-sets

  • Authors:
  • Mihai Bādoiu;Sariel Har-Peled;Piotr Indyk

  • Affiliations:
  • MIT Laboratory for Comp. Sci., Cambridge, MA;University of Illinois, Urbana, IL;MIT Laboratory for Comp. Sci., Cambridge, MA

  • Venue:
  • STOC '02 Proceedings of the thiry-fourth annual ACM symposium on Theory of computing
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we show that for several clustering problems one can extract a small set of points, so that using those core-sets enable us to perform approximate clustering efficiently. The surprising property of those core-sets is that their size is independent of the dimension.Using those, we present a (1+ &egr;)-approximation algorithms for the k-center clustering and k-median clustering problems in Euclidean space. The running time of the new algorithms has linear or near linear dependency on the number of points and the dimension, and exponential dependency on 1/&egr; and k. As such, our results are a substantial improvement over what was previously known.We also present some other clustering results including (1+ &egr;)-approximate 1-cylinder clustering, and k-center clustering with outliers.