A dependence maximization view of clustering

  • Authors:
  • Le Song;Alex Smola;Arthur Gretton;Karsten M. Borgwardt

  • Affiliations:
  • University of Sydney;Statistical Machine Learning Program, Canberra, Australia;MPI for Biological Cybernetics, Tübingen, Germany;LMU, Oettingenstr., München, Germany

  • Venue:
  • Proceedings of the 24th international conference on Machine learning
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

We propose a family of clustering algorithms based on the maximization of dependence between the input variables and their cluster labels, as expressed by the Hilbert-Schmidt Independence Criterion (HSIC). Under this framework, we unify the geometric, spectral, and statistical dependence views of clustering, and subsume many existing algorithms as special cases (e.g. k-means and spectral clustering). Distinctive to our framework is that kernels can also be applied on the labels, which can endow them with particular structures. We also obtain a perturbation bound on the change in k-means clustering.