| dc.contributor.author | Liao, Yuan | |
| dc.contributor.author | Chernozhukov, Victor V | |
| dc.contributor.author | Hansen, Christian B. | |
| dc.date.accessioned | 2018-02-21T16:08:02Z | |
| dc.date.available | 2018-02-21T16:08:02Z | |
| dc.date.issued | 2017-02 | |
| dc.date.submitted | 2015-12 | |
| dc.identifier.issn | 0090-5364 | |
| dc.identifier.uri | http://hdl.handle.net/1721.1/113848 | |
| dc.description.abstract | Common high-dimensional methods for prediction rely on having either a sparse signal model, a model in which most parameters are zero and there are a small number of nonzero parameters that are large in magnitude, or a dense signal model, a model with no large parameters and very many small nonzero parameters. We consider a generalization of these two basic models, termed here a "sparse + dense" model, in which the signal is given by the sum of a sparse signal and a dense signal. Such a structure poses problems for traditional sparse estimators, such as the lasso, and for traditional dense estimation methods, such as ridge estimation. We propose a new penalization-based method, called lava, which is computationally efficient. With suitable choices of penalty parameters, the proposed method strictly dominates both lasso and ridge. We derive analytic expressions for the finite-sample risk function of the lava estimator in the Gaussian sequence model. We also provide a deviation bound for the prediction risk in the Gaussian regression model with fixed design. In both cases, we provide Stein's unbiased estimator for lava's prediction risk. A simulation example compares the performance of lava to lasso, ridge and elastic net in a regression example using data-dependent penalty parameters and illustrates lava's improved performance relative to these benchmarks. | en_US |
| dc.publisher | Institute of Mathematical Statistics | en_US |
| dc.relation.isversionof | http://dx.doi.org/10.1214/16-AOS1434 | en_US |
| dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
| dc.source | arXiv | en_US |
| dc.title | A lava attack on the recovery of sums of dense and sparse signals | en_US |
| dc.type | Article | en_US |
| dc.identifier.citation | Chernozhukov, Victor et al. “A Lava Attack on the Recovery of Sums of Dense and Sparse Signals.” The Annals of Statistics 45, 1 (February 2017): 39–76 | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Department of Economics | en_US |
| dc.contributor.mitauthor | Chernozhukov, Victor V | |
| dc.contributor.mitauthor | Hansen, Christian B. | |
| dc.relation.journal | The Annals of Statistics | en_US |
| dc.eprint.version | Original manuscript | en_US |
| dc.type.uri | http://purl.org/eprint/type/JournalArticle | en_US |
| eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
| dc.date.updated | 2018-02-20T17:47:45Z | |
| dspace.orderedauthors | Chernozhukov, Victor; Hansen, Christian; Liao, Yuan | en_US |
| dspace.embargo.terms | N | en_US |
| dc.identifier.orcid | https://orcid.org/0000-0002-3250-6714 | |
| mit.license | OPEN_ACCESS_POLICY | en_US |