A spectral algorithm for learning mixture models

  • Authors:
  • Santosh Vempala;Grant Wang

  • Affiliations:
  • Department of Mathematics, MIT, Cambridge, MA;Laboratory for Computer Science, MIT, Cambridge, MA

  • Venue:
  • Journal of Computer and System Sciences - Special issue on FOCS 2002
  • Year:
  • 2004

Quantified Score

Hi-index 0.02

Visualization

Abstract

We show that a simple spectral algorithm for learning a mixture of k spherical Gaussians in Rn works remarkably well--it succeeds in identifying the Gaussians assuming essentially the minimum possible separation between their centers that keeps them unique (solving an open problem of Arora and Kannan (Proceedings of the 33rd ACM STOC, 2001). The sample complexity and running time are polynomial in both n and k. The algorithm can be applied to the more general problem of learning a mixture of "weakly isotropic" distributions (e.g. a mixture of uniform distributions on cubes).