dc.contributor.author | Staib, Matthew | |
dc.contributor.author | Wilder, B | |
dc.contributor.author | Jegelka, Stefanie Sabrina | |
dc.date.accessioned | 2021-02-23T22:04:16Z | |
dc.date.available | 2021-02-23T22:04:16Z | |
dc.date.issued | 2019-04 | |
dc.identifier.uri | https://hdl.handle.net/1721.1/129983 | |
dc.description.abstract | Submodular functions have applications throughout machine learning, but in many settings, we do not have direct access to the underlying function f. We focus on stochastic functions that are given as an expectation of functions over a distribution P. In practice, we often have only a limited set of samples fi from P. The standard approach indirectly optimizes f by maximizing the sum of fi. However, this ignores generalization to the true (unknown) distribution. In this paper, we achieve better performance on the actual underlying function f by directly optimizing a combination of bias and variance. Algorithmically, we accomplish this by showing how to carry out distributionally robust optimization (DRO) for submodular functions, providing efficient algorithms backed by theoretical guarantees which leverage several novel contributions to the general theory of DRO. We also show compelling empirical evidence that DRO improves generalization to the unknown stochastic submodular function. | en_US |
dc.language.iso | en | |
dc.publisher | MLResearchPress | en_US |
dc.relation.isversionof | http://proceedings.mlr.press/v89/staib19a.html | en_US |
dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
dc.source | arXiv | en_US |
dc.title | Distributionally robust submodular maximization | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Staib, Matthew et al. "Distributionally robust submodular maximization." 22nd International Conference on Artificial Intelligence and Statistics, April 2019, Naha, Okinawa, Japan, MLResearchPress, April 2019. © 2019 by the author(s) | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory | en_US |
dc.relation.journal | 22nd International Conference on Artificial Intelligence and Statistics | en_US |
dc.eprint.version | Original manuscript | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
dc.date.updated | 2020-12-21T19:49:32Z | |
dspace.orderedauthors | Staib, M; Wilder, B; Jegelka, S | en_US |
dspace.date.submission | 2020-12-21T19:49:34Z | |
mit.license | OPEN_ACCESS_POLICY | |
mit.metadata.status | Complete | |