Learning Latent Tree Graphical Models

Choi, Myung Jin; Tan, Vincent Y. F.; Anandkumar, Animashree; Willsky, Alan S.

dc.contributor.author	Choi, Myung Jin
dc.contributor.author	Willsky, Alan S.
dc.contributor.author	Tan, Vincent Y. F.
dc.contributor.author	Anandkumar, Animashree
dc.date.accessioned	2012-06-28T12:30:26Z
dc.date.available	2012-06-28T12:30:26Z
dc.date.issued	2011-05
dc.date.submitted	2011-02
dc.identifier.issn	1532-4435
dc.identifier.issn	1533-7928
dc.identifier.uri	http://hdl.handle.net/1721.1/71241
dc.description.abstract	We study the problem of learning a latent tree graphical model where samples are available only from a subset of variables. We propose two consistent and computationally efficient algorithms for learning minimal latent trees, that is, trees without any redundant hidden nodes. Unlike many existing methods, the observed nodes (or variables) are not constrained to be leaf nodes. Our algorithms can be applied to both discrete and Gaussian random variables and our learned models are such that all the observed and latent variables have the same domain (state space). Our first algorithm, recursive grouping, builds the latent tree recursively by identifying sibling groups using so-called information distances. One of the main contributions of this work is our second algorithm, which we refer to as CLGrouping. CLGrouping starts with a pre-processing procedure in which a tree over the observed variables is constructed. This global step groups the observed nodes that are likely to be close to each other in the true latent tree, thereby guiding subsequent recursive grouping (or equivalent procedures such as neighbor-joining) on much smaller subsets of variables. This results in more accurate and efficient learning of latent trees. We also present regularized versions of our algorithms that learn latent tree approximations of arbitrary distributions. We compare the proposed algorithms to other methods by performing extensive numerical experiments on various latent tree graphical models such as hidden Markov models and star graphs. In addition, we demonstrate the applicability of our methods on real-world data sets by modeling the dependency structure of monthly stock returns in the S&P index and of the words in the 20 newsgroups data set.	en_US
dc.language.iso	en_US
dc.publisher	CrossRef test prefix	en_US
dc.relation.isversionof	http://jmlr.csail.mit.edu/papers/v12/choi11b.html	en_US
dc.rights	Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.	en_US
dc.source	MIT Press	en_US
dc.title	Learning Latent Tree Graphical Models	en_US
dc.type	Article	en_US
dc.identifier.citation	Choi, Myung Jin et al. "Learning Latent Tree Graphical Models" Journal of Machine Learning Research 12 (2011). © JMLR 2011	en_US
dc.contributor.department	Massachusetts Institute of Technology. Laboratory for Information and Decision Systems	en_US
dc.contributor.approver	Willsky, Alan S.
dc.contributor.mitauthor	Choi, Myung Jin
dc.contributor.mitauthor	Willsky, Alan S.
dc.relation.journal	Journal of Machine Learning Research	en_US
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/JournalArticle	en_US
eprint.status	http://purl.org/eprint/status/PeerReviewed	en_US
dspace.orderedauthors	Choi, Myung Jin; Tan, Vincent Y. F.; Anandkumar, Animashree; Willsky, Alan S.
dc.identifier.orcid	https://orcid.org/0000-0003-0149-5888
mit.license	PUBLISHER_POLICY	en_US
mit.metadata.status	Complete

Files in this item

Name:: Willsky-2011-What Version-Learning ...
Size:: 836.7Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record