Estimating scene typicality from human ratings and image features
Author(s)
Ehinger, Krista A.; Xiao, Jianxiong; Torralba, Antonio; Oliva, Aude
DownloadOliva-2011-Estimating scene typicality.pdf (969.3Kb)
OPEN_ACCESS_POLICY
Open Access Policy
Creative Commons Attribution-Noncommercial-Share Alike
Terms of use
Metadata
Show full item recordAbstract
Scenes, like objects, are visual entities that can be categorized into functional and semantic groups. One of the core concepts of human categorization is the idea that category membership is graded: some exemplars are more typical than others. Here, we obtain human typicality rankings for more than 120,000 images from 706 scene categories through an online rating task on Amazon Mechanical Turk. We use these rankings to identify the most typical examples of each scene category. Using computational models of scene classification based on global image features, we find that images which are rated as more typical examples of their category are more likely to be classified correctly. This indicates that the most typical scene examples contain the diagnostic visual features that are relevant for their categorization. Objectless, holistic representations of scenes might serve as a good basis for understanding how semantic categories are defined in term of perceptual representations.
Date issued
2011-07Department
Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory; Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences; Massachusetts Institute of Technology. Department of Civil and Environmental Engineering; Massachusetts Institute of Technology. Department of Electrical Engineering and Computer ScienceJournal
Proceedings of the 33rd Annual Cognitive Science Conference, COGSCI 2011, Boston, Massachusetts, Wednesday, July 20 - Saturday July 23, 2011
Publisher
Cognitive Science Society, Inc.
Citation
Ehinger, Krista A. et al. "Estimating scene typicality from human ratings and image features." in Proceedings of the 33rd Annual Cognitive Science Conference, COGSCI 2011.
Version: Author's final manuscript
ISBN
978-0-9768318-7-7