Large-scale SVD and manifold learning

Authors:
Ameet Talwalkar;Sanjiv Kumar;Mehryar Mohri;Henry Rowley
Affiliations:
University of California, Berkeley, Division of Computer Science, Berkeley, CA;Google Research, New York, NY;Courant Institute and Google Research, New York, NY;Google Research, Mountain View, CA
Venue:
The Journal of Machine Learning Research
Year:
2013

Citing 17
Cited 0

A training algorithm for optimal margin classifiers

COLT '92 Proceedings of the fifth annual workshop on Computational learning theory
Fast Monte-Carlo Algorithms for finding low-rank approximations

FOCS '98 Proceedings of the 39th Annual Symposium on Foundations of Computer Science
The CMU Pose, Illumination, and Expression (PIE) Database

FGR '02 Proceedings of the Fifth IEEE International Conference on Automatic Face and Gesture Recognition
Efficient svm training using low-rank kernel representations

The Journal of Machine Learning Research
Spectral Grouping Using the Nyström Method

IEEE Transactions on Pattern Analysis and Machine Intelligence
A kernel view of the dimensionality reduction of manifolds

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Face Recognition Using Laplacianfaces

IEEE Transactions on Pattern Analysis and Machine Intelligence
Matrix approximation and projective clustering via volume sampling

SODA '06 Proceedings of the seventeenth annual ACM-SIAM symposium on Discrete algorithm
Fast Monte Carlo Algorithms for Matrices II: Computing a Low-Rank Approximation to a Matrix

SIAM Journal on Computing
On the Nyström Method for Approximating a Gram Matrix for Improved Kernel-Based Learning

The Journal of Machine Learning Research
On sampling-based approximate spectral decomposition

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Structure preserving embedding

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
An introduction to nonlinear dimensionality reduction by maximum variance unfolding

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
OpenFst: a general and efficient weighted finite-state transducer library

CIAA'07 Proceedings of the 12th international conference on Implementation and application of automata
Matrix approximation for large-scale learning

Matrix approximation for large-scale learning
Randomized Algorithms for Matrices and Data

Foundations and Trends® in Machine Learning
Sampling methods for the Nyström method

The Journal of Machine Learning Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper examines the efficacy of sampling-based low-rank approximation techniques when applied to large dense kernel matrices. We analyze two common approximate singular value decomposition techniques, namely the Nyström and Column sampling methods. We present a theoretical comparison between these two methods, provide novel insights regarding their suitability for various tasks and present experimental results that support our theory. Our results illustrate the relative strengths of each method. We next examine the performance of these two techniques on the large-scale task of extracting low-dimensional manifold structure given millions of high-dimensional face images. We address the computational challenges of non-linear dimensionality reduction via Isomap and Laplacian Eigenmaps, using a graph containing about 18 million nodes and 65 million edges. We present extensive experiments on learning low-dimensional embeddings for two large face data sets: CMU-PIE (35 thousand faces) and a web data set (18 million faces). Our comparisons show that the Nyström approximation is superior to the Column sampling method for this task. Furthermore, approximate Isomap tends to perform better than Laplacian Eigenmaps on both clustering and classification with the labeled CMU-PIE data set.