Sublinear time approximate clustering

  • Authors:
  • Nina Mishra;Dan Oblinger;Leonard Pitt

  • Affiliations:
  • Hewlett-Packard Labs, Palo Alto, CA;IBM TJ Watson Labs;University of Illinois at Urbana-Champaign, Urbana, IL

  • Venue:
  • SODA '01 Proceedings of the twelfth annual ACM-SIAM symposium on Discrete algorithms
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Clustering is of central importance in a number of disciplines including Machine Learning, Statistics, and Data Mining. This paper has two foci: (1) It describes how existing algorithms for clustering can benefit from simple sampling techniques arising from work in statistics [Pol84]. (2) It motivates and introduces a new model of clustering that is in the spirit of the “PAC (probably approximately correct)” learning model, and gives examples of efficient PAC-clustering algorithms.