Coresets for discrete integration and clustering

  • Authors:
  • Sariel Har-Peled

  • Affiliations:
  • Department of Computer Science, University of Illinois, Urbana, IL

  • Venue:
  • FSTTCS'06 Proceedings of the 26th international conference on Foundations of Software Technology and Theoretical Computer Science
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Given a set P of n points on the real line and a (potentially infinite) family of functions, we investigate the problem of finding a small (weighted) subset ${\mathcal{S}} \subseteq P$, such that for any $f \in {\mathcal{F}}$, we have that f(P) is a (1±ε)-approximation to $f({\mathcal{S}})$. Here, f(Q)=∑q∈Qw(q) f(q) denotes the weighted discrete integral of f over the point set Q, where w(q) is the weight assigned to the point q. We study this problem, and provide tight bounds on the size ${\mathcal{S}}$ for several families of functions. As an application, we present some coreset constructions for clustering.