Learning to see physics via visual de-animation

Wu, Jiajun; Lu, Erika; Kohli, Pushmeet; Freeman, William T; Tenenbaum, Joshua B

dc.contributor.author	Wu, Jiajun
dc.contributor.author	Lu, Erika
dc.contributor.author	Kohli, Pushmeet
dc.contributor.author	Freeman, William T
dc.contributor.author	Tenenbaum, Joshua B
dc.date.accessioned	2021-02-09T21:08:02Z
dc.date.available	2021-02-09T21:08:02Z
dc.date.issued	2017-12
dc.identifier.uri	https://hdl.handle.net/1721.1/129728
dc.description.abstract	We introduce a paradigm for understanding physical scenes without human annotations. At the core of our system is a physical world representation that is first recovered by a perception module and then utilized by physics and graphics engines. During training, the perception module and the generative models learn by visual de-animation - interpreting and reconstructing the visual information stream. During testing, the system first recovers the physical world state, and then uses the generative models for reasoning and future prediction. Even more so than forward simulation, inverting a physics or graphics engine is a computationally hard problem; we overcome this challenge by using a convolutional inversion network. Our system quickly recognizes the physical world state from appearance and motion cues, and has the flexibility to incorporate both differentiable and non-differentiable physics and graphics engines. We evaluate our system on both synthetic and real datasets involving multiple physical scenes, and demonstrate that our system performs well on both physical state estimation and reasoning problems. We further show that the knowledge learned on the synthetic dataset generalizes to constrained real images.	en_US
dc.description.sponsorship	NSF (Grants 1212849,1447476, 1231216)	en_US
dc.description.sponsorship	ONR MURI (Grant N00014-16-1-2007)	en_US
dc.language.iso	en
dc.publisher	Neural Information Processing Systems Foundation, Inc	en_US
dc.relation.isversionof	https://papers.nips.cc/paper/6620-learning-to-see-physics-via-visual-de-animation	en_US
dc.rights	Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.	en_US
dc.source	Neural Information Processing Systems (NIPS)	en_US
dc.title	Learning to see physics via visual de-animation	en_US
dc.type	Article	en_US
dc.identifier.citation	Wu, Jiajun et al. "Learning to see physics via visual de-animation." Advances in Neural Information Processing Systems 30 (NIPS 2017), December 2017, Long Beach, California, Neural Information Processing Systems Foundation, Inc, December 2017. © 2017 Neural Information Processing Systems Foundation, Inc.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory	en_US
dc.relation.journal	Advances in Neural Information Processing Systems 30 (NIPS 2017)	en_US
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dc.date.updated	2019-05-28T12:55:56Z
dspace.date.submission	2019-05-28T12:55:57Z
mit.metadata.status	Complete

Files in this item

Name:: 6620-learning-to-see-physics-v ...
Size:: 1.111Mb
Format:: PDF
Description:: Published version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record