Semi-Supervised Learning on Riemannian Manifolds

  • Authors:
  • Mikhail Belkin;Partha Niyogi

  • Affiliations:
  • Department of Computer Science, University of Chicago, 1100 E. 58th Street, Chicago, IL 60637, USA. misha@math.uchicago.edu;Department of Computer Science, University of Chicago, 1100 E. 58th Street, Chicago, IL 60637, USA. niyogi@cs.uchicago.edu

  • Venue:
  • Machine Learning
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

We consider the general problem of utilizing both labeled and unlabeled data to improve classification accuracy. Under the assumption that the data lie on a submanifold in a high dimensional space, we develop an algorithmic framework to classify a partially labeled data set in a principled manner. The central idea of our approach is that classification functions are naturally defined only on the submanifold in question rather than the total ambient space. Using the Laplace-Beltrami operator one produces a basis (the Laplacian Eigenmaps) for a Hilbert space of square integrable functions on the submanifold. To recover such a basis, only unlabeled examples are required. Once such a basis is obtained, training can be performed using the labeled data set.Our algorithm models the manifold using the adjacency graph for the data and approximates the Laplace-Beltrami operator by the graph Laplacian. We provide details of the algorithm, its theoretical justification, and several practical applications for image, speech, and text classification.