MIT Libraries logoDSpace@MIT

MIT
View Item 
  • DSpace@MIT Home
  • MIT Open Access Articles
  • MIT Open Access Articles
  • View Item
  • DSpace@MIT Home
  • MIT Open Access Articles
  • MIT Open Access Articles
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Comparing state-of-the-art visual features on invariant object recognition tasks

Author(s)
Pinto, Nicolas; Barhomi, Youssef; Cox, David D.; DiCarlo, James
Thumbnail
DownloadPinto et al_IEEE2011_WACV.pdf (1.615Mb)
OPEN_ACCESS_POLICY

Open Access Policy

Creative Commons Attribution-Noncommercial-Share Alike

Terms of use
Creative Commons Attribution-Noncommercial-Share Alike 3.0 http://creativecommons.org/licenses/by-nc-sa/3.0/
Metadata
Show full item record
Abstract
Tolerance (“invariance”) to identity-preserving image variation (e.g. variation in position, scale, pose, illumination) is a fundamental problem that any visual object recognition system, biological or engineered, must solve. While standard natural image database benchmarks are useful for guiding progress in computer vision, they can fail to probe the ability of a recognition system to solve the invariance problem. Thus, to understand which computational approaches are making progress on solving the invariance problem, we compared and contrasted a variety of state-of-the-art visual representations using synthetic recognition tasks designed to systematically probe invariance. We successfully re-implemented a variety of state-of-the-art visual representations and confirmed their published performance on a natural image benchmark. We here report that most of these representations perform poorly on invariant recognition, but that one representation shows significant performance gains over two baseline representations. We also show how this approach can more deeply illuminate the strengths and weaknesses of different visual representations and thus guide progress on invariant object recognition.
Date issued
2011-01
URI
http://hdl.handle.net/1721.1/72169
Department
Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences; McGovern Institute for Brain Research at MIT
Journal
Proceedings of the 2011 IEEE Workshop on Applications of Computer Vision (WACV)
Publisher
Institute of Electrical and Electronics Engineers
Citation
Pinto, Nicolas et al. “Comparing State-of-the-art Visual Features on Invariant Object Recognition Tasks.” Proceedings of the 2011 IEEE Workshop on Applications of Computer Vision (WACV), 5-7 Jan. 2011, Kona, HI, USA, IEEE, 2011. 463–470. Web.
Version: Author's final manuscript
Other identifiers
INSPEC Accession Number: 11823532
ISBN
978-1-4244-9496-5
ISSN
1550-5790

Collections
  • MIT Open Access Articles

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

Login

Statistics

OA StatisticsStatistics by CountryStatistics by Department
MIT Libraries
PrivacyPermissionsAccessibilityContact us
MIT
Content created by the MIT Libraries, CC BY-NC unless otherwise noted. Notify us about copyright concerns.