Show simple item record

dc.contributor.authorAraya-Polo, Mauricio
dc.contributor.authorFrogner, Charles Albert
dc.contributor.authorZhang, Chiyuan
dc.contributor.authorMobahi, Hossein
dc.contributor.authorPoggio, Tomaso A
dc.date.accessioned2017-11-28T20:02:00Z
dc.date.available2017-11-28T20:02:00Z
dc.date.issued2015-12
dc.identifier.urihttp://hdl.handle.net/1721.1/112312
dc.description.abstractLearning to predict multi-label outputs is challenging, but in many problems there is a natural metric on the outputs that can be used to improve predictions.In this paper we develop a loss function for multi-label learning, based on the Wasserstein distance. The Wasserstein distance provides a natural notion of dissimilarity for probability measures. Although optimizing with respect to the exact Wasserstein distance is costly, recent work has described a regularized approximation that is efficiently computed. We describe an efficient learning algorithm based on this regularization, as well as a novel extension of the Wasserstein distance from probability measures to unnormalized measures. We also describe a statistical learning bound for the loss. The Wasserstein loss can encourage smoothness of the predictions with respect to a chosen metric on the output space. We demonstrate this property on a real-data tag prediction problem, using the Yahoo Flickr Creative Commons dataset, outperforming a baseline that doesn't use the metric.en_US
dc.publisherMIT Pressen_US
dc.relation.isversionofhttps://dl.acm.org/citation.cfm?id=2969469en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourcearXiven_US
dc.titleLearning with a Wasserstein lossen_US
dc.typeArticleen_US
dc.identifier.citationFrogner, Charlie et al. "Learning with a Wasserstein loss." Proceedings of the 28th International Conference on Neural Information Processing Systems (NIPS 2015), December 7-12 2015, Montreal, Canada, MIT Press, December 2015 © 2015 MIT Press Cambridge, MA, USAen_US
dc.contributor.departmentCenter for Brains, Minds, and Machinesen_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Brain and Cognitive Sciencesen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.mitauthorFrogner, Charles Albert
dc.contributor.mitauthorZhang, Chiyuan
dc.contributor.mitauthorMobahi, Hossein
dc.contributor.mitauthorPoggio, Tomaso A
dc.relation.journalProceedings of the 28th International Conference on Neural Information Processing Systems (NIPS 2015)en_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2017-11-17T20:17:41Z
dspace.orderedauthorsFrogner, Charlie; Zhang, Chiyuan; Mobahi, Hossein; Araya-Polo, Mauricio; Poggio, Tomasoen_US
dspace.embargo.termsNen_US
dc.identifier.orcidhttps://orcid.org/0000-0003-2156-4916
dc.identifier.orcidhttps://orcid.org/0000-0001-8467-1888
dc.identifier.orcidhttps://orcid.org/0000-0001-8074-1092
dc.identifier.orcidhttps://orcid.org/0000-0002-3944-0455
mit.licenseOPEN_ACCESS_POLICYen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record