Approximate Sparse Recovery: Optimizing Time and Measurements

Authors:
Anna C. Gilbert;Yi Li;Ely Porat;Martin J. Strauss
Affiliations:
annacg@umich.edu;leeyi@umich.edu;porately@cs.biu.ac.il;martinjs@umich.edu
Venue:
SIAM Journal on Computing
Year:
2012

Citing 12
Cited 1

The space complexity of approximating the frequency moments

Journal of Computer and System Sciences
Finding Frequent Items in Data Streams

ICALP '02 Proceedings of the 29th International Colloquium on Automata, Languages and Programming
What's hot and what's not: tracking most frequent items dynamically

Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
One sketch for all: fast algorithms for compressed sensing

Proceedings of the thirty-ninth annual ACM symposium on Theory of computing
Near-Optimal Sparse Recovery in the L1 Norm

FOCS '08 Proceedings of the 2008 49th Annual IEEE Symposium on Foundations of Computer Science
Information-theoretic limits on sparsity recovery in the high-dimensional and noisy setting

IEEE Transactions on Information Theory
K-mismatch with don't cares

ESA'07 Proceedings of the 15th annual European conference on Algorithms
CoSaMP: iterative signal recovery from incomplete and inaccurate samples

Communications of the ACM
Lower bounds for sparse recovery

SODA '10 Proceedings of the twenty-first annual ACM-SIAM symposium on Discrete Algorithms
(1 + eps)-Approximate Sparse Recovery

FOCS '11 Proceedings of the 2011 IEEE 52nd Annual Symposium on Foundations of Computer Science
Combinatorial algorithms for compressed sensing

SIROCCO'06 Proceedings of the 13th international conference on Structural Information and Communication Complexity
Compressed sensing

IEEE Transactions on Information Theory

ℓ2/ℓ2-Foreach sparse recovery with low risk

ICALP'13 Proceedings of the 40th international conference on Automata, Languages, and Programming - Volume Part I

Quantified Score

Hi-index	0.00

Visualization

Abstract

A Euclidean approximate sparse recovery system consists of parameters $k,N$, an $m$-by-$N$ measurement matrix, $\bm{\Phi}$, and a decoding algorithm, $\mathcal{D}$. Given a vector, ${\mathbf x}$, the system approximates ${\mathbf x}$ by $\widehat {\mathbf x}=\mathcal{D}(\bm{\Phi} {\mathbf x})$, which must satisfy $|\widehat {\mathbf x} - {\mathbf x}|_2\le C |{\mathbf x} - {\mathbf x}_k|_2$, where ${\mathbf x}_k$ denotes the optimal $k$-term approximation to ${\mathbf x}$. (The output $\widehat{\mathbf x}$ may have more than $k$ terms.) For each vector ${\mathbf x}$, the system must succeed with probability at least 3/4. Among the goals in designing such systems are minimizing the number $m$ of measurements and the runtime of the decoding algorithm, $\mathcal{D}$. In this paper, we give a system with $m=O(k \log(N/k))$ measurements—matching a lower bound, up to a constant factor—and decoding time $k\log^{O(1)} N$, matching a lower bound up to a polylog$(N)$ factor. We also consider the encode time (i.e., the time to multiply $\bm{\Phi}$ by $x$), the time to update measurements (i.e., the time to multiply $\bm{\Phi}$ by a 1-sparse $x$), and the robustness and stability of the algorithm (resilience to noise before and after the measurements). Our encode and update times are optimal up to $\log(k)$ factors. The columns of $\bm{\Phi}$ have at most $O(\log^2(k)\log(N/k))$ nonzeros, each of which can be found in constant time. Our full result, a fully polynomial randomized approximation scheme, is as follows. If ${\mathbf x}={\mathbf x}_k+\nu_1$, where $\nu_1$ and $\nu_2$ (below) are arbitrary vectors (regarded as noise), then setting $\widehat {\mathbf x} = \mathcal{D}(\Phi {\mathbf x} + \nu_2)$, and for properly normalized $\bm{\Phi}$, we get $\left|{\mathbf x} - \widehat {\mathbf x}\right|_2^2 \le (1+\epsilon)\left|\nu_1\right|_2^2 + \epsilon\left|\nu_2\right|_2^2$ using $O((k/\epsilon)\log(N/k))$ measurements and $(k/\epsilon)\log^{O(1)}(N)$ time for decoding.