Post-Selection Inference for Generalized Linear Models With Many Controls
Author(s)
Belloni, Alexandre; Wei, Ying; Chernozhukov, Victor V
Download1304.3969.pdf (660.8Kb)
OPEN_ACCESS_POLICY
Open Access Policy
Creative Commons Attribution-Noncommercial-Share Alike
Terms of use
Metadata
Show full item recordAbstract
This article considers generalized linear models in the presence of many controls. We lay out a general methodology to estimate an effect of interest based on the construction of an instrument that immunizes against model selection mistakes and apply it to the case of logistic binary choice model. More specifically we propose new methods for estimating and constructing confidence regions for a regression parameter of primary interest α[subscript 0], a parameter in front of the regressor of interest, such as the treatment variable or a policy variable. These methods allow to estimate α[subscript 0] at the root-n rate when the total number p of other regressors, called controls, potentially exceeds the sample size n using sparsity assumptions. The sparsity assumption means that there is a subset of s < n controls, which suffices to accurately approximate the nuisance part of the regression function. Importantly, the estimators and these resulting confidence regions are valid uniformly over s-sparse models satisfying s[superscript 2]log [superscript 2]p = o(n) and other technical conditions. These procedures do not rely on traditional consistent model selection arguments for their validity. In fact, they are robust with respect to moderate model selection mistakes in variable selection. Under suitable conditions, the estimators are semi-parametrically efficient in the sense of attaining the semi-parametric efficiency bounds for the class of models in this article.
Date issued
2016-03Department
Massachusetts Institute of Technology. Department of EconomicsJournal
Journal of Business & Economic Statistics
Publisher
Informa UK Limited
Citation
Belloni, Alexandre, Victor Chernozhukov, and Ying Wei. “Post-Selection Inference for Generalized Linear Models With Many Controls.” Journal of Business & Economic Statistics 34, no. 4 (September 15, 2016): 606–619.
Version: Author's final manuscript
ISSN
0735-0015
1537-2707