Large-Scale Clustering through Functional Embedding

  • Authors:
  • Frédéric Ratle;Jason Weston;Matthew L. Miller

  • Affiliations:
  • IGAR, University of Lausanne, Amphipôle, Switzerland 1015;NEC Labs America, , Princeton NJ, USA;NEC Labs America, , Princeton NJ, USA

  • Venue:
  • ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a new framework for large-scale data clustering. The main idea is to modify functional dimensionality reduction techniques to directly optimize over discrete labels using stochastic gradient descent. Compared to methods like spectral clustering our approach solves a single optimization problem, rather than an ad-hoc two-stage optimization approach, does not require a matrix inversion, can easily encode prior knowledge in the set of implementable functions, and does not have an "out-of-sample" problem. Experimental results on both artificial and real-world datasets show the usefulness of our approach.