Visual tasks beyond categorization for training convolutional neural networks
Author(s)
Lee, Hyo-Dong
DownloadFull printable version (1.982Mb)
Other Contributors
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science.
Advisor
James J. DiCarlo.
Terms of use
Metadata
Show full item recordAbstract
Humans can perceive a variety of visual properties of objects besides their category. In this paper, we explore- whether convolutional neural networks (CNNs) can also learn object-related variables. The models are trained for object position, size and pose, respectively, from synthetic images and tested on unseen held-out objects. First, we show that some object properties come "for free" from learning others, and pose-optimized model can generalize to both categorical and non-categorical variables. Second, we demonstrate that pre-training the model with pose facilitates learning object categories from both synthetic and realistic images.
Description
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2016. Cataloged from PDF version of thesis. Includes bibliographical references (pages 21-23).
Date issued
2016Department
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer SciencePublisher
Massachusetts Institute of Technology
Keywords
Electrical Engineering and Computer Science.