Advanced Search
DSpace@MIT

Realistic Modeling of Simple and Complex Cell Tuning in the HMAXModel, and Implications for Invariant Object Recognition in Cortex

Research and Teaching Output of the MIT Community

Show simple item record

dc.contributor.author Serre, Thomas
dc.contributor.author Riesenhuber, Maximilian
dc.date.accessioned 2005-12-22T01:36:22Z
dc.date.available 2005-12-22T01:36:22Z
dc.date.issued 2004-07-27
dc.identifier.other MIT-CSAIL-TR-2004-052
dc.identifier.other AIM-2004-017
dc.identifier.other CBCL-239
dc.identifier.uri http://hdl.handle.net/1721.1/30491
dc.description.abstract Riesenhuber \& Poggio recently proposed a model of object recognitionin cortex which, beyond integrating general beliefs about the visualsystem in a quantitative framework, made testable predictions aboutvisual processing. In particular, they showed that invariant objectrepresentation could be obtained with a selective pooling mechanismover properly chosen afferents through a {\sc max} operation: Forinstance, at the complex cells level, pooling over a group of simplecells at the same preferred orientation and position in space but atslightly different spatial frequency would provide scale tolerance,while pooling over a group of simple cells at the same preferredorientation and spatial frequency but at slightly different positionin space would provide position tolerance. Indirect support for suchmechanisms in the visual system come from the ability of thearchitecture at the top level to replicate shape tuning as well asshift and size invariance properties of ``view-tuned cells'' (VTUs)found in inferotemporal cortex (IT), the highest area in the ventralvisual stream, thought to be crucial in mediating object recognitionin cortex. There is also now good physiological evidence that a {\scmax} operation is performed at various levels along the ventralstream. However, in the original paper by Riesenhuber \& Poggio,tuning and pooling parameters of model units in early and intermediateareas were only qualitatively inspired by physiological data. Inparticular, many studies have investigated the tuning properties ofsimple and complex cells in primary visual cortex, V1. We show thatunits in the early levels of HMAX can be tuned to produce realisticsimple and complex cell-like tuning, and that the earlier findings onthe invariance properties of model VTUs still hold in this morerealistic version of the model.
dc.format.extent 11 p.
dc.format.extent 24089158 bytes
dc.format.extent 2715073 bytes
dc.format.mimetype application/postscript
dc.format.mimetype application/pdf
dc.language.iso en_US
dc.relation.ispartofseries Massachusetts Institute of Technology Computer Science and Artificial Intelligence Laboratory
dc.subject AI
dc.subject object recognition
dc.subject simple cell
dc.subject complex cell
dc.subject hmax
dc.subject V1
dc.subject IT
dc.subject view-tuned unit
dc.subject in
dc.title Realistic Modeling of Simple and Complex Cell Tuning in the HMAXModel, and Implications for Invariant Object Recognition in Cortex


Files in this item

Name Size Format Description
MIT-CSAIL-TR-2004 ... 22.97Mb Postscript

Files in this item

Name Size Format Description
MIT-CSAIL-TR-2004 ... 2.589Mb PDF

This item appears in the following Collection(s)

Show simple item record

MIT-Mirage