dc.contributor.author | Daskalakis, C | |
dc.contributor.author | Panageas, I | |
dc.date.accessioned | 2022-06-17T14:24:41Z | |
dc.date.available | 2022-06-17T14:24:41Z | |
dc.date.issued | 2019-01-01 | |
dc.identifier.uri | https://hdl.handle.net/1721.1/143460 | |
dc.description.abstract | © Constantinos Daskalakis and Ioannis Panageas. Motivated by applications in Game Theory, Optimization, and Generative Adversarial Networks, recent work of Daskalakis et al [Daskalakis et al., ICLR, 2018] and follow-up work of Liang and Stokes [Liang and Stokes, 2018] have established that a variant of the widely used Gradient Descent/Ascent procedure, called “Optimistic Gradient Descent/Ascent (OGDA)”, exhibits last-iterate convergence to saddle points in unconstrained convex-concave min-max optimization problems. We show that the same holds true in the more general problem of constrained min-max optimization under a variant of the no-regret Multiplicative-Weights-Update method called “Optimistic Multiplicative-Weights Update (OMWU)”. This answers an open question of Syrgkanis et al [Syrgkanis et al., NIPS, 2015]. The proof of our result requires fundamentally different techniques from those that exist in no-regret learning literature and the aforementioned papers. We show that OMWU monotonically improves the Kullback-Leibler divergence of the current iterate to the (appropriately normalized) min-max solution until it enters a neighborhood of the solution. Inside that neighborhood we show that OMWU becomes a contracting map converging to the exact solution. We believe that our techniques will be useful in the analysis of the last iterate of other learning algorithms. | en_US |
dc.language.iso | en | |
dc.relation.isversionof | 10.4230/LIPIcs.ITCS.2019.27 | en_US |
dc.rights | Creative Commons Attribution 3.0 unported license | en_US |
dc.rights.uri | https://creativecommons.org/licenses/by/3.0/ | en_US |
dc.source | DROPS | en_US |
dc.title | Last-iterate convergence: Zero-sum games and constrained min-max optimization | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Daskalakis, C and Panageas, I. 2019. "Last-iterate convergence: Zero-sum games and constrained min-max optimization." Leibniz International Proceedings in Informatics, LIPIcs, 124. | |
dc.contributor.department | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory | |
dc.relation.journal | Leibniz International Proceedings in Informatics, LIPIcs | en_US |
dc.eprint.version | Final published version | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
dc.date.updated | 2022-06-17T14:20:15Z | |
dspace.orderedauthors | Daskalakis, C; Panageas, I | en_US |
dspace.date.submission | 2022-06-17T14:20:16Z | |
mit.journal.volume | 124 | en_US |
mit.license | PUBLISHER_CC | |
mit.metadata.status | Authority Work and Publication Information Needed | en_US |