Representation Learning in Sensory Cortex: a theory

Anselmi, Fabio; Poggio, Tomaso

dc.contributor.author	Anselmi, Fabio
dc.contributor.author	Poggio, Tomaso
dc.date.accessioned	2015-12-11T20:54:04Z
dc.date.available	2015-12-11T20:54:04Z
dc.date.issued	2014-11-14
dc.identifier.uri	http://hdl.handle.net/1721.1/100191
dc.description.abstract	We review and apply a computational theory of the feedforward path of the ventral stream in visual cortex based on the hypothesis that its main function is the encoding of invariant representations of images. A key justification of the theory is provided by a theorem linking invariant representations to small sample complexity for recognition – that is, invariant representations allows learning from very few labeled examples. The theory characterizes how an algorithm that can be implemented by a set of ”simple” and ”complex” cells – a ”HW module” – provides invariant and selective representations. The invariance can be learned in an unsupervised way from observed transformations. Theorems show that invariance implies several properties of the ventral stream organization, including the eccentricity dependent lattice of units in the retina and in V1, and the tuning of its neurons. The theory requires two stages of processing: the first, consisting of retinotopic visual areas such as V1, V2 and V4 with generic neuronal tuning, leads to representations that are invariant to translation and scaling; the second, consisting of modules in IT, with class- and object-specific tuning, provides a representation for recognition with approximate invariance to class specific transformations, such as pose (of a body, of a face) and expression. In the theory the ventral stream main function is the unsupervised learning of ”good” representations that reduce the sample complexity of the final supervised learning stage.	en_US
dc.description.sponsorship	This work was supported by the Center for Brains, Minds and Machines (CBMM), funded by NSF STC award CCF - 1231216.	en_US
dc.language.iso	en_US	en_US
dc.publisher	Center for Brains, Minds and Machines (CBMM)	en_US
dc.relation.ispartofseries	CBMM Memo Series;026
dc.rights	Attribution-NonCommercial 3.0 United States	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc/3.0/us/	*
dc.subject	Computational Theory	en_US
dc.subject	Ventral Visual Stream	en_US
dc.subject	Invariance	en_US
dc.subject	Computer vision	en_US
dc.title	Representation Learning in Sensory Cortex: a theory	en_US
dc.type	Technical Report	en_US
dc.type	Working Paper	en_US
dc.type	Other	en_US

Files in this item

Name:: CBMM-Memo-026.pdf
Size:: 1.353Mb
Format:: PDF

View/Open

Name:: license_rdf
Size:: 1.346Kb
Format:: application/rdf+xml

View/Open

This item appears in the following Collection(s)

CBMM Memo Series

Show simple item record