dc.contributor.author | Salakhutdinov, R. | |
dc.contributor.author | Tenenbaum, Joshua B. | |
dc.contributor.author | Torralba, Antonio | |
dc.date.accessioned | 2014-10-15T18:35:14Z | |
dc.date.available | 2014-10-15T18:35:14Z | |
dc.date.issued | 2013-08 | |
dc.date.submitted | 2012-08 | |
dc.identifier.issn | 0162-8828 | |
dc.identifier.issn | 2160-9292 | |
dc.identifier.uri | http://hdl.handle.net/1721.1/90947 | |
dc.description.abstract | We introduce HD (or “Hierarchical-Deep”) models, a new compositional learning architecture that integrates deep learning models with structured hierarchical Bayesian (HB) models. Specifically, we show how we can learn a hierarchical Dirichlet process (HDP) prior over the activities of the top-level features in a deep Boltzmann machine (DBM). This compound HDP-DBM model learns to learn novel concepts from very few training example by learning low-level generic features, high-level features that capture correlations among low-level features, and a category hierarchy for sharing priors over the high-level features that are typical of different kinds of concepts. We present efficient learning and inference algorithms for the HDP-DBM model and show that it is able to learn new concepts from very few examples on CIFAR-100 object recognition, handwritten character recognition, and human motion capture datasets. | en_US |
dc.description.sponsorship | QUALCOMM Inc. | en_US |
dc.description.sponsorship | Natural Sciences and Engineering Research Council of Canada | en_US |
dc.description.sponsorship | United States. Office of Naval Research (ONR MURI Grant 1015GNA126) | en_US |
dc.description.sponsorship | United States. Office of Naval Research (ONR N00014-07-1-0937) | en_US |
dc.description.sponsorship | United States. Army Research Office (ARO grant W911NF-08-1-0242) | en_US |
dc.language.iso | en_US | |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | en_US |
dc.relation.isversionof | http://dx.doi.org/10.1109/TPAMI.2012.269 | en_US |
dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
dc.source | MIT web domain | en_US |
dc.title | Learning with Hierarchical-Deep Models | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Salakhutdinov, R., J. B. Tenenbaum, and A. Torralba. “Learning with Hierarchical-Deep Models.” IEEE Trans. Pattern Anal. Mach. Intell. 35, no. 8 (August 2013): 1958–1971. | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
dc.contributor.mitauthor | Tenenbaum, Joshua B. | en_US |
dc.contributor.mitauthor | Torralba, Antonio | en_US |
dc.relation.journal | IEEE Transactions on Pattern Analysis and Machine Intelligence | en_US |
dc.eprint.version | Author's final manuscript | en_US |
dc.type.uri | http://purl.org/eprint/type/JournalArticle | en_US |
eprint.status | http://purl.org/eprint/status/PeerReviewed | en_US |
dspace.orderedauthors | Salakhutdinov, R.; Tenenbaum, J. B.; Torralba, A. | en_US |
dc.identifier.orcid | https://orcid.org/0000-0002-1925-2035 | |
dc.identifier.orcid | https://orcid.org/0000-0003-4915-0256 | |
mit.license | OPEN_ACCESS_POLICY | en_US |
mit.metadata.status | Complete | |