MIT Libraries logoDSpace@MIT

MIT
View Item 
  • DSpace@MIT Home
  • MIT Open Access Articles
  • MIT Open Access Articles
  • View Item
  • DSpace@MIT Home
  • MIT Open Access Articles
  • MIT Open Access Articles
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Learning to see physics via visual de-animation

Author(s)
Wu, Jiajun; Lu, Erika; Kohli, Pushmeet; Freeman, William T; Tenenbaum, Joshua B
Thumbnail
DownloadPublished version (1.111Mb)
Terms of use
Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.
Metadata
Show full item record
Abstract
We introduce a paradigm for understanding physical scenes without human annotations. At the core of our system is a physical world representation that is first recovered by a perception module and then utilized by physics and graphics engines. During training, the perception module and the generative models learn by visual de-animation - interpreting and reconstructing the visual information stream. During testing, the system first recovers the physical world state, and then uses the generative models for reasoning and future prediction. Even more so than forward simulation, inverting a physics or graphics engine is a computationally hard problem; we overcome this challenge by using a convolutional inversion network. Our system quickly recognizes the physical world state from appearance and motion cues, and has the flexibility to incorporate both differentiable and non-differentiable physics and graphics engines. We evaluate our system on both synthetic and real datasets involving multiple physical scenes, and demonstrate that our system performs well on both physical state estimation and reasoning problems. We further show that the knowledge learned on the synthetic dataset generalizes to constrained real images.
Date issued
2017-12
URI
https://hdl.handle.net/1721.1/129728
Department
Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Journal
Advances in Neural Information Processing Systems 30 (NIPS 2017)
Publisher
Neural Information Processing Systems Foundation, Inc
Citation
Wu, Jiajun et al. "Learning to see physics via visual de-animation." Advances in Neural Information Processing Systems 30 (NIPS 2017), December 2017, Long Beach, California, Neural Information Processing Systems Foundation, Inc, December 2017. © 2017 Neural Information Processing Systems Foundation, Inc.
Version: Final published version

Collections
  • MIT Open Access Articles

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

Login

Statistics

OA StatisticsStatistics by CountryStatistics by Department
MIT Libraries
PrivacyPermissionsAccessibilityContact us
MIT
Content created by the MIT Libraries, CC BY-NC unless otherwise noted. Notify us about copyright concerns.