A comparison of the lasso and marginal regression

Authors:
Christopher R. Genovese;Jiashun Jin;Larry Wasserman;Zhigang Yao
Affiliations:
Department of Statistics, Carnegie Mellon University, Pittsburgh, PA;Department of Statistics, Carnegie Mellon University, Pittsburgh, PA;Department of Statistics, Carnegie Mellon University, Pittsburgh, PA;Department of Statistics, Carnegie Mellon University, Pittsburgh, PA
Venue:
The Journal of Machine Learning Research
Year:
2012

Citing 7
Cited 0

Atomic Decomposition by Basis Pursuit

SIAM Journal on Scientific Computing
All of Nonparametric Statistics (Springer Texts in Statistics)

All of Nonparametric Statistics (Springer Texts in Statistics)
On Model Selection Consistency of Lasso

The Journal of Machine Learning Research
Shifting inequality and recovery of sparse signals

IEEE Transactions on Signal Processing
Uncertainty principles and ideal atomic decomposition

IEEE Transactions on Information Theory
Greed is good: algorithmic results for sparse approximation

IEEE Transactions on Information Theory
Recovery of exact sparse representations in the presence of bounded noise

IEEE Transactions on Information Theory

Quantified Score

Hi-index	0.00

Visualization

Abstract

The lasso is an important method for sparse, high-dimensional regression problems, with efficient algorithms available, a long history of practical success, and a large body of theoretical results supporting and explaining its performance. But even with the best available algorithms, finding the lasso solutions remains a computationally challenging task in cases where the number of covariates vastly exceeds the number of data points. Marginal regression, where each dependent variable is regressed separately on each covariate, offers a promising alternative in this case because the estimates can be computed roughly two orders faster than the lasso solutions. The question that remains is how the statistical performance of the method compares to that of the lasso in these cases. In this paper, we study the relative statistical performance of the lasso and marginal regression for sparse, high-dimensional regression problems. We consider the problem of learning which coefficients are non-zero. Our main results are as follows: (i) we compare the conditions under which the lasso and marginal regression guarantee exact recovery in the fixed design, noise free case; (ii) we establish conditions under which marginal regression provides exact recovery with high probability in the fixed design, noise free, random coefficients case; and (iii) we derive rates of convergence for both procedures, where performance is measured by the number of coefficients with incorrect sign, and characterize the regions in the parameter space recovery is and is not possible under this metric. In light of the computational advantages of marginal regression in very high dimensional problems, our theoretical and simulations results suggest that the procedure merits further study.