Comparing state-of-the-art visual features on invariant object recognition tasks

Pinto, Nicolas; Barhomi, Youssef; Cox, David D.; DiCarlo, James

Author(s)

Pinto, Nicolas; Barhomi, Youssef; Cox, David D.; DiCarlo, James

DownloadPinto et al_IEEE2011_WACV.pdf (1.615Mb)

OPEN_ACCESS_POLICY

Terms of use

Creative Commons Attribution-Noncommercial-Share Alike 3.0 http://creativecommons.org/licenses/by-nc-sa/3.0/

Metadata

Show full item record

Abstract

Tolerance (“invariance”) to identity-preserving image variation (e.g. variation in position, scale, pose, illumination) is a fundamental problem that any visual object recognition system, biological or engineered, must solve. While standard natural image database benchmarks are useful for guiding progress in computer vision, they can fail to probe the ability of a recognition system to solve the invariance problem. Thus, to understand which computational approaches are making progress on solving the invariance problem, we compared and contrasted a variety of state-of-the-art visual representations using synthetic recognition tasks designed to systematically probe invariance. We successfully re-implemented a variety of state-of-the-art visual representations and confirmed their published performance on a natural image benchmark. We here report that most of these representations perform poorly on invariant recognition, but that one representation shows significant performance gains over two baseline representations. We also show how this approach can more deeply illuminate the strengths and weaknesses of different visual representations and thus guide progress on invariant object recognition.

Date issued

2011-01

URI

http://hdl.handle.net/1721.1/72169

Department

Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences; McGovern Institute for Brain Research at MIT

Journal

Proceedings of the 2011 IEEE Workshop on Applications of Computer Vision (WACV)

Publisher

Institute of Electrical and Electronics Engineers

Citation

Pinto, Nicolas et al. “Comparing State-of-the-art Visual Features on Invariant Object Recognition Tasks.” Proceedings of the 2011 IEEE Workshop on Applications of Computer Vision (WACV), 5-7 Jan. 2011, Kona, HI, USA, IEEE, 2011. 463–470. Web.

Version: Author's final manuscript

Other identifiers

INSPEC Accession Number: 11823532

ISBN

978-1-4244-9496-5

ISSN

1550-5790

Collections

MIT Open Access Articles

DSpace@MIT