Sparse Models and Methods for Optimal Instruments with an Application to Eminent Domain

Belloni, Alexandre; Chen, Daniel; Chernozhukov, Victor; Hansen, Christian

Author(s)

Belloni, Alexandre; Chen, Daniel; Chernozhukov, Victor; Hansen, Christian

DownloadVictor11-19.pdf (599.7Kb)

Terms of use

An error occurred on the license name. An error occurred getting the license - uri.

Metadata

Show full item record

Abstract

We develop results for the use of LASSO and Post-LASSO methods to form first-stage predictions and estimate optimal instruments in linear instrumental variables (IV) models with many instruments, p, that apply even when p is much larger than the sample size, n. We rigorously develop asymptotic distribution and inference theory for the resulting IV estimators and provide conditions under which these estimators are asymptotically oracle-efficient. In simulation experiments, the LASSO-based IV estimator with a data-driven penalty performs well compared to recently advocated many-instrument-robust procedures. In an empirical example dealing with the effect of judicial eminent domain decisions on economic outcomes, the LASSO based IV estimator substantially reduces estimated standard errors allowing one to draw much more precise conclusions about the economic effects of these decisions. Optimal instruments are conditional expectations; and in developing the IV results, we also establish a series of new results for LASSO and Post-LASSO estimators of non-parametric conditional expectation functions which are of independent theoretical and practical interest. Specifically, we develop the asymptotic theory for these estimators that allows for non-Gaussian, heteroscedastic disturbances, which is important for econometric applications. By innovatively using moderate deviation theory for self-normalized sums, we provide convergence rates for these estimators that are as sharp as in the homoscedastic Gaussian case under the weak condition that log p = o(n1=3). Moreover, as a practical innovation, we provide a fully data-driven method for choosing the user-specified penalty that must be provided in obtaining LASSO and Post-LASSO estimates and establish its asymptotic validity under non-Gaussian, heteroscedastic disturbances.

Description

Date: First version: June 2009, this version October 28, 2010. Preliminary results of this paper were FIRST presented at Chernozhukov's invited Cowles Foundation lecture at the Northern American meetings of the Econometric society in June of 2009. We thank seminar participants at Brown, Columbia, Harvard-MIT, the Dutch Econometric Study Group, Fuqua School of Business, and NYU for helpful comments. We also thank Denis Chetverikov, JB Doyle, and Joonhwan Lee for thorough reading of this paper and helpful feedback.

Date issued

2011-07-12

URI

http://hdl.handle.net/1721.1/65157

Publisher

Cambridge, MA: Department of Economics, Massachusetts Institute of Technology

Series/Report no.

Working paper (Massachusetts Institute of Technology, Department of Economics);11-19

Keywords

Instrumental Variables, Optimal Instruments, LASSO, Post-LASSO, Sparsity, Eminent Domain, Data-Driven Penalty, Heteroscedasticity, non-Gaussian errors, moderate deviations for self-normalized sums

Collections

MIT Dept. of Economics Working Papers Series

The following license files are associated with this item:

Creative Commons