Show simple item record

dc.contributor.authorDu, Yilun
dc.contributor.authorLiu, Zhijian
dc.contributor.authorBasevi, Hector
dc.contributor.authorLeonardis, Aleš
dc.contributor.authorFreenman, William T.
dc.contributor.authorTenenbaum, Joshua B.
dc.contributor.authorWu, Jiajun
dc.date.accessioned2021-11-05T14:03:59Z
dc.date.available2021-11-05T14:03:59Z
dc.date.issued2018
dc.identifier.urihttps://hdl.handle.net/1721.1/137464
dc.description.abstract© 2018 Curran Associates Inc.All rights reserved. Human scene understanding uses a variety of visual and non-visual cues to perform inference on object types, poses, and relations. Physics is a rich and universal cue that we exploit to enhance scene understanding. In this paper, we integrate the physical cue of stability into the learning process by looping in a physics engine into bottom-up recognition models, and apply it to the problem of 3D scene parsing. We first show that applying physics supervision to an existing scene understanding model increases performance, produces more stable predictions, and allows training to an equivalent performance level with fewer annotated training examples. We then present a novel architecture for 3D scene parsing named Prim R-CNN, learning to predict bounding boxes as well as their 3D size, translation, and rotation. With physics supervision, Prim R-CNN outperforms existing scene understanding approaches on this problem. Finally, we show that finetuning with physics supervision on unlabeled real images improves real domain transfer of models training on synthetic data.en_US
dc.language.isoen
dc.relation.isversionofhttps://papers.nips.cc/paper/7444-learning-to-exploit-stability-for-3d-scene-parsingen_US
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.en_US
dc.sourceNeural Information Processing Systems (NIPS)en_US
dc.titleLearning to Exploit Stability for 3D Scene Parsingen_US
dc.typeArticleen_US
dc.identifier.citationDu, Yilun, Liu, Zhijian, Basevi, Hector, Leonardis, Aleš, Freenman, William T. et al. 2018. "Learning to Exploit Stability for 3D Scene Parsing."
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2019-05-28T12:38:26Z
dspace.date.submission2019-05-28T12:38:27Z
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record