Group Norm for Learning Structured SVMs with Unstructured Latent Variables

Chen, Daozheng; Batra, Dhruv; Freeman, William T.

dc.contributor.author	Chen, Daozheng
dc.contributor.author	Batra, Dhruv
dc.contributor.author	Freeman, William T.
dc.date.accessioned	2015-11-24T19:39:24Z
dc.date.available	2015-11-24T19:39:24Z
dc.date.issued	2013-12
dc.identifier.isbn	978-1-4799-2840-8
dc.identifier.issn	1550-5499
dc.identifier.uri	http://hdl.handle.net/1721.1/100043
dc.description.abstract	Latent variables models have been applied to a number of computer vision problems. However, the complexity of the latent space is typically left as a free design choice. A larger latent space results in a more expressive model, but such models are prone to over fitting and are slower to perform inference with. The goal of this paper is to regularize the complexity of the latent space and learn which hidden states are really relevant for prediction. Specifically, we propose using group-sparsity-inducing regularizers such as ℓ[subscript 1]-ℓ[subscript 2] to estimate the parameters of Structured SVMs with unstructured latent variables. Our experiments on digit recognition and object detection show that our approach is indeed able to control the complexity of latent space without any significant loss in accuracy of the learnt model.	en_US
dc.description.sponsorship	Quanta Computer (Firm)	en_US
dc.description.sponsorship	Google (Firm)	en_US
dc.language.iso	en_US
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)	en_US
dc.relation.isversionof	http://dx.doi.org/10.1109/ICCV.2013.58	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/	en_US
dc.source	Other univ. web domain	en_US
dc.title	Group Norm for Learning Structured SVMs with Unstructured Latent Variables	en_US
dc.type	Article	en_US
dc.identifier.citation	Chen, Daozheng, Dhruv Batra, and William T. Freeman. “Group Norm for Learning Structured SVMs with Unstructured Latent Variables.” 2013 IEEE International Conference on Computer Vision (December 2013).	en_US
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science	en_US
dc.contributor.mitauthor	Freeman, William T.	en_US
dc.relation.journal	Proceedings of the 2013 IEEE International Conference on Computer Vision	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dspace.orderedauthors	Chen, Daozheng; Batra, Dhruv; Freeman, William T.	en_US
dc.identifier.orcid	https://orcid.org/0000-0002-2231-7995
mit.license	OPEN_ACCESS_POLICY	en_US
mit.metadata.status	Complete

Files in this item

Name:: Freeman_Group norm.pdf
Size:: 7.118Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record