dc.contributor.author | Chen, Daozheng | |
dc.contributor.author | Batra, Dhruv | |
dc.contributor.author | Freeman, William T. | |
dc.date.accessioned | 2015-11-24T19:39:24Z | |
dc.date.available | 2015-11-24T19:39:24Z | |
dc.date.issued | 2013-12 | |
dc.identifier.isbn | 978-1-4799-2840-8 | |
dc.identifier.issn | 1550-5499 | |
dc.identifier.uri | http://hdl.handle.net/1721.1/100043 | |
dc.description.abstract | Latent variables models have been applied to a number of computer vision problems. However, the complexity of the latent space is typically left as a free design choice. A larger latent space results in a more expressive model, but such models are prone to over fitting and are slower to perform inference with. The goal of this paper is to regularize the complexity of the latent space and learn which hidden states are really relevant for prediction. Specifically, we propose using group-sparsity-inducing regularizers such as ℓ[subscript 1]-ℓ[subscript 2] to estimate the parameters of Structured SVMs with unstructured latent variables. Our experiments on digit recognition and object detection show that our approach is indeed able to control the complexity of latent space without any significant loss in accuracy of the learnt model. | en_US |
dc.description.sponsorship | Quanta Computer (Firm) | en_US |
dc.description.sponsorship | Google (Firm) | en_US |
dc.language.iso | en_US | |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | en_US |
dc.relation.isversionof | http://dx.doi.org/10.1109/ICCV.2013.58 | en_US |
dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
dc.source | Other univ. web domain | en_US |
dc.title | Group Norm for Learning Structured SVMs with Unstructured Latent Variables | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Chen, Daozheng, Dhruv Batra, and William T. Freeman. “Group Norm for Learning Structured SVMs with Unstructured Latent Variables.” 2013 IEEE International Conference on Computer Vision (December 2013). | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
dc.contributor.mitauthor | Freeman, William T. | en_US |
dc.relation.journal | Proceedings of the 2013 IEEE International Conference on Computer Vision | en_US |
dc.eprint.version | Author's final manuscript | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
dspace.orderedauthors | Chen, Daozheng; Batra, Dhruv; Freeman, William T. | en_US |
dc.identifier.orcid | https://orcid.org/0000-0002-2231-7995 | |
mit.license | OPEN_ACCESS_POLICY | en_US |
mit.metadata.status | Complete | |