Synthesizing 3D Shapes via Modeling Multi-view Depth Maps and Silhouettes with Deep Generative Networks

Soltani, Amir Arsalan; Tenenbaum, Joshua B

Author(s)

Soltani, Amir Arsalan; Tenenbaum, Joshua B

DownloadPublished version (2.861Mb)

Terms of use

Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.

Metadata

Show full item record

Abstract

We study the problem of learning generative models of 3D shapes. Voxels or 3D parts have been widely used as the underlying representations to build complex 3D shapes; however, voxel-based representations suffer from high memory requirements, and parts-based models require a large collection of cached or richly parametrized parts. We take an alternative approach: learning a generative model over multi-view depth maps or their corresponding silhouettes, and using a deterministic rendering function to produce 3D shapes from these images. A multi-view representation of shapes enables generation of 3D models with fine details, as 2D depth maps and silhouettes can be modeled at a much higher resolution than 3D voxels. Moreover, our approach naturally brings the ability to recover the underlying 3D representation from depth maps of one or a few viewpoints. Experiments show that our framework can generate 3D shapes with variations and details. We also demonstrate that our model has out-of-sample generalization power for real-world tasks with occluded objects.

Date issued

2017-07

URI

https://hdl.handle.net/1721.1/126644

Department

Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences

Journal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Citation

Soltani, Amir Arsalan et al. “Synthesizing 3D Shapes via Modeling Multi-view Depth Maps and Silhouettes with Deep Generative Networks.” Paper presented at the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, 21-26 July 2017, IEEE © 2017 The Author(s)

Version: Author's final manuscript

ISSN

1063-6919

Collections

MIT Open Access Articles