Learning invariant representations and applications to face verification

Liao, Qianli; Leibo, Joel Z.; Poggio, Tomaso.

dc.contributor.author	Liao, Qianli
dc.contributor.author	Leibo, Joel Z.
dc.contributor.author	Poggio, Tomaso A.
dc.date.accessioned	2014-12-16T15:01:38Z
dc.date.available	2014-12-16T15:01:38Z
dc.date.issued	2013
dc.identifier.issn	1049-5258
dc.identifier.uri	http://hdl.handle.net/1721.1/92318
dc.description.abstract	One approach to computer object recognition and modeling the brain's ventral stream involves unsupervised learning of representations that are invariant to common transformations. However, applications of these ideas have usually been limited to 2D affine transformations, e.g., translation and scaling, since they are easiest to solve via convolution. In accord with a recent theory of transformation-invariance, we propose a model that, while capturing other common convolutional networks as special cases, can also be used with arbitrary identity-preserving transformations. The model's wiring can be learned from videos of transforming objects---or any other grouping of images into sets by their depicted object. Through a series of successively more complex empirical tests, we study the invariance/discriminability properties of this model with respect to different transformations. First, we empirically confirm theoretical predictions for the case of 2D affine transformations. Next, we apply the model to non-affine transformations: as expected, it performs well on face verification tasks requiring invariance to the relatively smooth transformations of 3D rotation-in-depth and changes in illumination direction. Surprisingly, it can also tolerate clutter transformations'' which map an image of a face on one background to an image of the same face on a different background. Motivated by these empirical findings, we tested the same model on face verification benchmark tasks from the computer vision literature: Labeled Faces in the Wild, PubFig and a new dataset we gathered---achieving strong performance in these highly unconstrained cases as well."	en_US
dc.language.iso	en_US
dc.publisher	Neural Information Processing Systems Foundation	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/	en_US
dc.source	MIT Web Domain	en_US
dc.title	Learning invariant representations and applications to face verification	en_US
dc.type	Article	en_US
dc.identifier.citation	Liao, Qianli, Joel Z. Leibo, and Tomaso Poggio. "Learning invariant representations and applications to face verification." Advances in Neural Information Processing Systems 26 (NIPS 2013).	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science	en_US
dc.contributor.department	McGovern Institute for Brain Research at MIT	en_US
dc.contributor.mitauthor	Liao, Qianli	en_US
dc.contributor.mitauthor	Leibo, Joel Z.	en_US
dc.contributor.mitauthor	Poggio, Tomaso A.	en_US
dc.relation.journal	Advances in Neural Information Processing Systems (NIPS)	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dspace.orderedauthors	Liao, Qianli; Leibo, Joel Z.; Poggio, Tomaso.	en_US
dc.identifier.orcid	https://orcid.org/0000-0002-3153-916X
dc.identifier.orcid	https://orcid.org/0000-0002-3944-0455
dc.identifier.orcid	https://orcid.org/0000-0003-0076-621X
mit.license	OPEN_ACCESS_POLICY	en_US
mit.metadata.status	Complete

Files in this item

Name:: Poggio_Learning invariant ...
Size:: 1.607Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record