Adaptive sampling and fast low-rank matrix approximation

Authors:
Amit Deshpande;Santosh Vempala
Affiliations:
Mathematics Department and CSAIL, MIT;Mathematics Department and CSAIL, MIT
Venue:
APPROX'06/RANDOM'06 Proceedings of the 9th international conference on Approximation Algorithms for Combinatorial Optimization Problems, and 10th international conference on Randomization and Computation
Year:
2006

Citing 10
Cited 24

Fast algorithms for projected clustering

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Clustering in large graphs and matrices

Proceedings of the tenth annual ACM-SIAM symposium on Discrete algorithms
Data-streams and histograms

STOC '01 Proceedings of the thirty-third annual ACM symposium on Theory of computing
Fast computation of low rank matrix approximations

STOC '01 Proceedings of the thirty-third annual ACM symposium on Theory of computing
Pass efficient algorithms for approximating large matrices

SODA '03 Proceedings of the fourteenth annual ACM-SIAM symposium on Discrete algorithms
Approximation schemes for clustering problems

Proceedings of the thirty-fifth annual ACM symposium on Theory of computing
Sampling lower bounds via information theory

Proceedings of the thirty-fifth annual ACM symposium on Theory of computing
Fast monte-carlo algorithms for finding low-rank approximations

Journal of the ACM (JACM)
Matrix approximation and projective clustering via volume sampling

SODA '06 Proceedings of the seventeenth annual ACM-SIAM symposium on Discrete algorithm
A fast random sampling algorithm for sparsifying matrices

APPROX'06/RANDOM'06 Proceedings of the 9th international conference on Approximation Algorithms for Combinatorial Optimization Problems, and 10th international conference on Randomization and Computation

Bi-criteria linear-time approximations for generalized k-mean/median/center

SCG '07 Proceedings of the twenty-third annual symposium on Computational geometry
Sampling-based dimension reduction for subspace approximation

Proceedings of the thirty-ninth annual ACM symposium on Theory of computing
Efficient subspace approximation algorithms

SODA '07 Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms
Bound for the L2 Norm of Random Matrix and Succinct Matrix Approximation

ICCS '08 Proceedings of the 8th international conference on Computational Science, Part II
Clustered subset selection and its applications on it service metrics

Proceedings of the 17th ACM conference on Information and knowledge management
An improved approximation algorithm for the column subset selection problem

SODA '09 Proceedings of the twentieth Annual ACM-SIAM Symposium on Discrete Algorithms
Numerical linear algebra in the streaming model

Proceedings of the forty-first annual ACM symposium on Theory of computing
A fast and efficient algorithm for low-rank approximation of a matrix

Proceedings of the forty-first annual ACM symposium on Theory of computing
On selecting a maximum volume sub-matrix of a matrix and related problems

Theoretical Computer Science
Spectral Algorithms

Foundations and Trends® in Theoretical Computer Science
Spectral methods for matrices and tensors

Proceedings of the forty-second ACM symposium on Theory of computing
A Randomized Algorithm for Principal Component Analysis

SIAM Journal on Matrix Analysis and Applications
Fast Algorithms for Approximating the Singular Value Decomposition

ACM Transactions on Knowledge Discovery from Data (TKDD)
Wisdom of the better few: cold start recommendation via representative based rating elicitation

Proceedings of the fifth ACM conference on Recommender systems
Finding Structure with Randomness: Probabilistic Algorithms for Constructing Approximate Matrix Decompositions

SIAM Review
Optimal column-based low-rank matrix reconstruction

Proceedings of the twenty-third annual ACM-SIAM symposium on Discrete Algorithms
Column subset selection via sparse approximation of SVD

Theoretical Computer Science
A fast random sampling algorithm for sparsifying matrices

APPROX'06/RANDOM'06 Proceedings of the 9th international conference on Approximation Algorithms for Combinatorial Optimization Problems, and 10th international conference on Randomization and Computation
Randomized Algorithms for Matrices and Data

Foundations and Trends® in Machine Learning
A GPU-based approximate SVD algorithm

PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part I
Simple and deterministic matrix sketching

Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Low rank approximation and regression in input sparsity time

Proceedings of the forty-fifth annual ACM symposium on Theory of computing
Learning Big (Image) Data via Coresets for Dictionaries

Journal of Mathematical Imaging and Vision
Column Subset Selection Problem is UG-hard

Journal of Computer and System Sciences

Quantified Score

Hi-index	0.00

Visualization

Abstract

We prove that any real matrix A contains a subset of at most 4k/ε+ 2k log(k+1) rows whose span “contains” a matrix of rank at most k with error only (1+ε) times the error of the best rank-k approximation of A. We complement it with an almost matching lower bound by constructing matrices where the span of any k/2ε rows does not “contain” a relative (1+ε)-approximation of rank k. Our existence result leads to an algorithm that finds such rank-k approximation in time $ O \left( M \left( \frac{k}{\epsilon} + k^{2} \log k \right) + (m+n) \left( \frac{k^{2}}{\epsilon^{2}} + \frac{k^{3} \log k}{\epsilon} + k^{4} \log^{2} k \right) \right), $ i.e., essentially O(Mk/ε), where M is the number of nonzero entries of A. The algorithm maintains sparsity, and in the streaming model [12,14,15], it can be implemented using only 2(k+1)(log(k+1)+1) passes over the input matrix and $O \left( \min \{ m, n \} (\frac{k}{\epsilon} + k^{2} \log k) \right)$ additional space. Previous algorithms for low-rank approximation use only one or two passes but obtain an additive approximation.