Unsupervised discovery of parts, structure, and dynamics
Author(s)
Xu, Zhenjia; Liu, Zhijian; Sun, Chen; Murphy, Kevin; Freeman, William T; Tenenbaum, Joshua B; Wu, Jiajun; ... Show more Show less
DownloadAccepted version (4.913Mb)
Terms of use
Metadata
Show full item recordAbstract
Humans easily recognize object parts and their hierarchical structure by watching how they move; they can then predict how each part moves in the future. In this paper, we propose a novel formulation that simultaneously learns a hierarchical, disentangled object representation and a dynamics model for object parts from unlabeled videos. Our Parts, Structure, and Dynamics (PSD) model learns to, first, recognize the object parts via a layered image representation; second, predict hierarchy via a structural descriptor that composes low-level concepts into a hierarchical structure; and third, model the system dynamics by predicting the future. Experiments on multiple real and synthetic datasets demonstrate that our PSD model works well on all three tasks: segmenting object parts, building their hierarchical structure, and capturing their motion distributions.
Date issued
2019-05Department
Massachusetts Institute of Technology. Computer Science and Artificial Intelligence LaboratoryJournal
ICLR: International Conference on Learning Representations
Citation
Xu, Zhenjia et al. "Unsupervised discovery of parts, structure, and dynamics." ICLR 2019: 7th International Conference on Learning Representations, May 6-9, 2019, New Orleans, Louisiana: https://openreview.net/forum?id=rJe10iC5K7 ©2019 Author(s)
Version: Author's final manuscript