Show simple item record

dc.contributor.authorKulkarni, Tejas Dattatraya
dc.contributor.authorTenenbaum, Joshua B.
dc.contributor.authorMansinghka, Vikash K.
dc.contributor.authorKohli, Pushmeet
dc.date.accessioned2015-04-15T15:37:50Z
dc.date.available2015-04-15T15:37:50Z
dc.date.issued2015
dc.identifier.isbn978-1-4244-3992-8
dc.identifier.issn1063-6919
dc.identifier.issn2332-564X
dc.identifier.urihttp://hdl.handle.net/1721.1/96620
dc.description.abstractRecent progress on probabilistic modeling and statistical learning, coupled with the availability of large training datasets, has led to remarkable progress in computer vision. Generative probabilistic models, or “analysis-by-synthesis” approaches, can capture rich scene structure but have been less widely applied than their discriminative counterparts, as they often require considerable problem-specific engineering in modeling and inference, and inference is typically seen as requiring slow, hypothesize-and-test Monte Carlo methods. Here we present Picture, a probabilistic programming language for scene understanding that allows researchers to express complex generative vision models, while automatically solving them using fast general-purpose inference machinery. Picture provides a stochastic scene language that can express generative models for arbitrary 2D/3D scenes, as well as a hierarchy of representation layers for comparing scene hypotheses with observed images by matching not simply pixels, but also more abstract features (e.g., contours, deep neural network activations). Inference can flexibly integrate advanced Monte Carlo strategies with fast bottom-up data-driven methods. Thus both representations and inference strategies can build directly on progress in discriminatively trained systems to make generative vision more robust and efficient. We use Picture to write programs for 3D face analysis, 3D human pose estimation, and 3D object reconstruction – each competitive with specially engineered baselines.en_US
dc.description.sponsorshipNorman B. Leventhal Fellowshipen_US
dc.description.sponsorshipUnited States. Office of Naval Research (Award N000141310333)en_US
dc.description.sponsorshipUnited States. Army Research Office. Multidisciplinary University Research Initiative (W911NF-13-1-2012)en_US
dc.description.sponsorshipNational Science Foundation (U.S.). Science and Technology Centers (Center for Brains, Minds and Machines. Award CCF-1231216)en_US
dc.language.isoen_US
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)en_US
dc.relation.isversionofhttp://www.cv-foundation.org/openaccess/CVPR2015.pyen_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourceKulkarni, Tejas Dattatrayaen_US
dc.titlePicture: A Probabilistic Programming Language for Scene Perceptionen_US
dc.typeArticleen_US
dc.identifier.citationKulkarni, Tejas D., Pushmeet Kohli, Joshua B. Tenenbaum, Vikash Mansinghka. "Picture: A Probabilistic Programming Language for Scene Perception." Forthcoming in the proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Hynes Convention Center, Boston, MA, June 7-12, 2015.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Brain and Cognitive Sciencesen_US
dc.contributor.approverKulkarni, Tejas Dattatrayaen_US
dc.contributor.mitauthorKulkarni, Tejas Dattatrayaen_US
dc.contributor.mitauthorTenenbaum, Joshua B.en_US
dc.contributor.mitauthorMansinghka, Vikash K.en_US
dc.relation.journalProceedings of the IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015en_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dspace.orderedauthorsKulkarni, Tejas Dattatraya; Kohli, Pushmeet ; Tenenbaum, Joshua B.; Mansinghka, Vikash Kumaren_US
dc.identifier.orcidhttps://orcid.org/0000-0002-7077-2765
dc.identifier.orcidhttps://orcid.org/0000-0002-1925-2035
dspace.mitauthor.errortrue
mit.licenseOPEN_ACCESS_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record