Beating randomized response on incoherent matrices

Authors:
Moritz Hardt;Aaron Roth
Affiliations:
IBM Almaden Research, San Jose, CA, USA;University of Pennsylvania, Philadelphia, PA, USA
Venue:
STOC '12 Proceedings of the forty-fourth annual ACM symposium on Theory of computing
Year:
2012

Citing 17
Cited 2

Latent semantic indexing: a probabilistic analysis

PODS '98 Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Fast computation of low rank matrix approximations

STOC '01 Proceedings of the thirty-third annual ACM symposium on Theory of computing
Lectures on Discrete Geometry

Lectures on Discrete Geometry
Revealing information while preserving privacy

Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Fast monte-carlo algorithms for finding low-rank approximations

Journal of the ACM (JACM)
Practical privacy: the SuLQ framework

Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Smooth sensitivity and sampling in private data analysis

Proceedings of the thirty-ninth annual ACM symposium on Theory of computing
Robust De-anonymization of Large Sparse Datasets

SP '08 Proceedings of the 2008 IEEE Symposium on Security and Privacy
Differentially private recommender systems: building privacy into the net

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Exact Matrix Completion via Convex Optimization

Foundations of Computational Mathematics
The power of convex relaxation: near-optimal matrix completion

IEEE Transactions on Information Theory
Matrix completion from a few entries

IEEE Transactions on Information Theory
Differentially private combinatorial optimization

SODA '10 Proceedings of the twenty-first annual ACM-SIAM symposium on Discrete Algorithms
Boosting and Differential Privacy

FOCS '10 Proceedings of the 2010 IEEE 51st Annual Symposium on Foundations of Computer Science
Finding Structure with Randomness: Probabilistic Algorithms for Constructing Approximate Matrix Decompositions

SIAM Review
Toward privacy in public databases

TCC'05 Proceedings of the Second international conference on Theory of Cryptography
Calibrating noise to sensitivity in private data analysis

TCC'06 Proceedings of the Third conference on Theory of Cryptography

Beyond worst-case analysis in private singular vector computation

Proceedings of the forty-fifth annual ACM symposium on Theory of computing
A near-optimal algorithm for differentially-private principal components

The Journal of Machine Learning Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

Computing accurate low rank approximations of large matrices is a fundamental data mining task. In many applications however the matrix contains sensitive information about individuals. In such case we would like to release a low rank approximation that satisfies a strong privacy guarantee such as differential privacy. Unfortunately, to date the best known algorithm for this task that satisfies differential privacy is based on naive input perturbation or randomized response: Each entry of the matrix is perturbed independently by a sufficiently large random noise variable, a low rank approximation is then computed on the resulting matrix. We give (the first) significant improvements in accuracy over randomized response under the natural and necessary assumption that the matrix has low coherence. Our algorithm is also very efficient and finds a constant rank approximation of an m x n matrix in time O(mn). Note that even generating the noise matrix required for randomized response already requires time O(mn).