| dc.contributor.author | Araya-Polo, Mauricio | |
| dc.contributor.author | Frogner, Charles Albert | |
| dc.contributor.author | Zhang, Chiyuan | |
| dc.contributor.author | Mobahi, Hossein | |
| dc.contributor.author | Poggio, Tomaso A | |
| dc.date.accessioned | 2017-11-28T20:02:00Z | |
| dc.date.available | 2017-11-28T20:02:00Z | |
| dc.date.issued | 2015-12 | |
| dc.identifier.uri | http://hdl.handle.net/1721.1/112312 | |
| dc.description.abstract | Learning to predict multi-label outputs is challenging, but in many problems there is a natural metric on the outputs that can be used to improve predictions.In this paper we develop a loss function for multi-label learning, based on the Wasserstein distance. The Wasserstein distance provides a natural notion of dissimilarity for probability measures. Although optimizing with respect to the exact Wasserstein distance is costly, recent work has described a regularized approximation that is efficiently computed. We describe an efficient learning algorithm based on this regularization, as well as a novel extension of the Wasserstein distance from probability measures to unnormalized measures. We also describe a statistical learning bound for the loss. The Wasserstein loss can encourage smoothness of the predictions with respect to a chosen metric on the output space. We demonstrate this property on a real-data tag prediction problem, using the Yahoo Flickr Creative Commons dataset, outperforming a baseline that doesn't use the metric. | en_US |
| dc.publisher | MIT Press | en_US |
| dc.relation.isversionof | https://dl.acm.org/citation.cfm?id=2969469 | en_US |
| dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
| dc.source | arXiv | en_US |
| dc.title | Learning with a Wasserstein loss | en_US |
| dc.type | Article | en_US |
| dc.identifier.citation | Frogner, Charlie et al. "Learning with a Wasserstein loss." Proceedings of the 28th International Conference on Neural Information Processing Systems (NIPS 2015), December 7-12 2015, Montreal, Canada, MIT Press, December 2015 © 2015 MIT Press Cambridge, MA, USA | en_US |
| dc.contributor.department | Center for Brains, Minds, and Machines | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
| dc.contributor.mitauthor | Frogner, Charles Albert | |
| dc.contributor.mitauthor | Zhang, Chiyuan | |
| dc.contributor.mitauthor | Mobahi, Hossein | |
| dc.contributor.mitauthor | Poggio, Tomaso A | |
| dc.relation.journal | Proceedings of the 28th International Conference on Neural Information Processing Systems (NIPS 2015) | en_US |
| dc.eprint.version | Author's final manuscript | en_US |
| dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
| eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
| dc.date.updated | 2017-11-17T20:17:41Z | |
| dspace.orderedauthors | Frogner, Charlie; Zhang, Chiyuan; Mobahi, Hossein; Araya-Polo, Mauricio; Poggio, Tomaso | en_US |
| dspace.embargo.terms | N | en_US |
| dc.identifier.orcid | https://orcid.org/0000-0003-2156-4916 | |
| dc.identifier.orcid | https://orcid.org/0000-0001-8467-1888 | |
| dc.identifier.orcid | https://orcid.org/0000-0001-8074-1092 | |
| dc.identifier.orcid | https://orcid.org/0000-0002-3944-0455 | |
| mit.license | OPEN_ACCESS_POLICY | en_US |