dc.contributor.author | Xu, Zhenjia | |
dc.contributor.author | Liu, Zhijian | |
dc.contributor.author | Sun, Chen | |
dc.contributor.author | Murphy, Kevin | |
dc.contributor.author | Freeman, William T | |
dc.contributor.author | Tenenbaum, Joshua B | |
dc.contributor.author | Wu, Jiajun | |
dc.date.accessioned | 2020-08-18T20:14:02Z | |
dc.date.available | 2020-08-18T20:14:02Z | |
dc.date.issued | 2019-05 | |
dc.identifier.uri | https://hdl.handle.net/1721.1/126655 | |
dc.description.abstract | Humans easily recognize object parts and their hierarchical structure by watching how they move; they can then predict how each part moves in the future. In this paper, we propose a novel formulation that simultaneously learns a hierarchical, disentangled object representation and a dynamics model for object parts from unlabeled videos. Our Parts, Structure, and Dynamics (PSD) model learns to, first, recognize the object parts via a layered image representation; second, predict hierarchy via a structural descriptor that composes low-level concepts into a hierarchical structure; and third, model the system dynamics by predicting the future. Experiments on multiple real and synthetic datasets demonstrate that our PSD model works well on all three tasks: segmenting object parts, building their hierarchical structure, and capturing their motion distributions. | en_US |
dc.language.iso | en | |
dc.relation.isversionof | https://openreview.net/forum?id=rJe10iC5K7 | en_US |
dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
dc.source | arXiv | en_US |
dc.title | Unsupervised discovery of parts, structure, and dynamics | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Xu, Zhenjia et al. "Unsupervised discovery of parts, structure, and dynamics." ICLR 2019: 7th International Conference on Learning Representations, May 6-9, 2019, New Orleans, Louisiana: https://openreview.net/forum?id=rJe10iC5K7 ©2019 Author(s) | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory | en_US |
dc.relation.journal | ICLR: International Conference on Learning Representations | en_US |
dc.eprint.version | Author's final manuscript | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
dc.date.updated | 2019-10-08T16:09:41Z | |
dspace.date.submission | 2019-10-08T16:09:45Z | |
mit.journal.volume | 7 | en_US |
mit.metadata.status | Complete | |