Latent variable graphical model selection via convex optimization

Chandrasekaran, Venkat; Parrilo, Pablo A.; Willsky, Alan S.

dc.contributor.author	Willsky, Alan S.
dc.date.accessioned	2013-03-13T17:50:55Z
dc.date.available	2013-03-13T17:50:55Z
dc.date.issued	2012
dc.date.submitted	2011-11
dc.identifier.issn	0090-5364
dc.identifier.uri	http://hdl.handle.net/1721.1/77885
dc.description.abstract	Suppose we observe samples of a subset of a collection of random variables. No additional information is provided about the number of latent variables, nor of the relationship between the latent and observed variables. Is it possible to discover the number of latent components, and to learn a statistical model over the entire collection of variables? We address this question in the setting in which the latent and observed variables are jointly Gaussian, with the conditional statistics of the observed variables conditioned on the latent variables being specified by a graphical model. As a first step we give natural conditions under which such latent-variable Gaussian graphical models are identifiable given marginal statistics of only the observed variables. Essentially these conditions require that the conditional graphical model among the observed variables is sparse, while the effect of the latent variables is “spread out” over most of the observed variables. Next we propose a tractable convex program based on regularized maximum-likelihood for model selection in this latent-variable setting; the regularizer uses both the ℓ[subscript 1] norm and the nuclear norm. Our modeling framework can be viewed as a combination of dimensionality reduction (to identify latent variables) and graphical modeling (to capture remaining statistical structure not attributable to the latent variables), and it consistently estimates both the number of latent components and the conditional graphical model structure among the observed variables. These results are applicable in the high-dimensional setting in which the number of latent/observed variables grows with the number of samples of the observed variables. The geometric properties of the algebraic varieties of sparse matrices and of low-rank matrices play an important role in our analysis.	en_US
dc.description.sponsorship	United States. Air Force Office of Scientific Research (AFOSR FA9550-08-1-0180)	en_US
dc.description.sponsorship	United States. Air Force Office of Scientific Research (Grant FA9550-06-1-0303)	en_US
dc.description.sponsorship	National Science Foundation (U.S.) (FRG 0757207)	en_US
dc.language.iso	en_US
dc.publisher	Institute of Mathematical Statistics	en_US
dc.relation.isversionof	http://dx.doi.org/10.1214/12-aos1020	en_US
dc.rights	Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.	en_US
dc.source	Institute of Mathematical Statistics	en_US
dc.title	Latent variable graphical model selection via convex optimization	en_US
dc.type	Article	en_US
dc.identifier.citation	Chandrasekaran, Venkat, Pablo A. Parrilo, and Alan S. Willsky. “Latent Variable Graphical Model Selection via Convex Optimization.” The Annals of Statistics 40.4 (2012): 1935–1967. ©2012 Institute of Mathematical Statistics	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science	en_US
dc.contributor.mitauthor	Willsky, Alan S.
dc.relation.journal	Annals of Statistics	en_US
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/JournalArticle	en_US
eprint.status	http://purl.org/eprint/status/PeerReviewed	en_US
dspace.orderedauthors	Chandrasekaran, Venkat; Parrilo, Pablo A.; Willsky, Alan S.	en_US
dc.identifier.orcid	https://orcid.org/0000-0003-0149-5888
mit.license	PUBLISHER_POLICY	en_US
mit.metadata.status	Complete

Files in this item

Name:: Chandrasekaran-2012-REJOINDER.pdf
Size:: 313.3Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record