dc.contributor.author | Kulkarni, Tejas Dattatraya | |
dc.contributor.author | Tenenbaum, Joshua B. | |
dc.contributor.author | Mansinghka, Vikash K. | |
dc.contributor.author | Kohli, Pushmeet | |
dc.date.accessioned | 2015-04-15T15:37:50Z | |
dc.date.available | 2015-04-15T15:37:50Z | |
dc.date.issued | 2015 | |
dc.identifier.isbn | 978-1-4244-3992-8 | |
dc.identifier.issn | 1063-6919 | |
dc.identifier.issn | 2332-564X | |
dc.identifier.uri | http://hdl.handle.net/1721.1/96620 | |
dc.description.abstract | Recent progress on probabilistic modeling and statistical learning, coupled with the availability of large training datasets, has led to remarkable progress in computer vision. Generative probabilistic models, or “analysis-by-synthesis” approaches, can capture rich scene structure but have been less widely applied than their discriminative counterparts, as they often require considerable problem-specific engineering in modeling and inference, and inference is typically seen as requiring slow, hypothesize-and-test Monte Carlo methods. Here we present Picture, a probabilistic programming language for scene understanding that allows researchers to express complex generative vision models, while automatically solving them using fast general-purpose inference machinery. Picture provides a stochastic scene language that can express generative models for arbitrary 2D/3D scenes, as well as a hierarchy of representation layers for comparing scene hypotheses with observed images by matching not simply pixels, but also more abstract features (e.g., contours, deep neural network activations). Inference can flexibly integrate advanced Monte Carlo strategies with fast bottom-up data-driven methods. Thus both representations and inference strategies can build directly on progress in discriminatively trained systems to make generative vision more robust and efficient. We use Picture to write programs for 3D face analysis, 3D human pose estimation, and 3D object reconstruction – each competitive with specially engineered baselines. | en_US |
dc.description.sponsorship | Norman B. Leventhal Fellowship | en_US |
dc.description.sponsorship | United States. Office of Naval Research (Award N000141310333) | en_US |
dc.description.sponsorship | United States. Army Research Office. Multidisciplinary University Research Initiative (W911NF-13-1-2012) | en_US |
dc.description.sponsorship | National Science Foundation (U.S.). Science and Technology Centers (Center for Brains, Minds and Machines. Award CCF-1231216) | en_US |
dc.language.iso | en_US | |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | en_US |
dc.relation.isversionof | http://www.cv-foundation.org/openaccess/CVPR2015.py | en_US |
dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
dc.source | Kulkarni, Tejas Dattatraya | en_US |
dc.title | Picture: A Probabilistic Programming Language for Scene Perception | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Kulkarni, Tejas D., Pushmeet Kohli, Joshua B. Tenenbaum, Vikash Mansinghka. "Picture: A Probabilistic Programming Language for Scene Perception." Forthcoming in the proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Hynes Convention Center, Boston, MA, June 7-12, 2015. | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences | en_US |
dc.contributor.approver | Kulkarni, Tejas Dattatraya | en_US |
dc.contributor.mitauthor | Kulkarni, Tejas Dattatraya | en_US |
dc.contributor.mitauthor | Tenenbaum, Joshua B. | en_US |
dc.contributor.mitauthor | Mansinghka, Vikash K. | en_US |
dc.relation.journal | Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015 | en_US |
dc.eprint.version | Author's final manuscript | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
dspace.orderedauthors | Kulkarni, Tejas Dattatraya; Kohli, Pushmeet ; Tenenbaum, Joshua B.; Mansinghka, Vikash Kumar | en_US |
dc.identifier.orcid | https://orcid.org/0000-0002-7077-2765 | |
dc.identifier.orcid | https://orcid.org/0000-0002-1925-2035 | |
dspace.mitauthor.error | true | |
mit.license | OPEN_ACCESS_POLICY | en_US |
mit.metadata.status | Complete | |