Last-iterate convergence: Zero-sum games and constrained min-max optimization

Daskalakis, C; Panageas, I

dc.contributor.author	Daskalakis, C
dc.contributor.author	Panageas, I
dc.date.accessioned	2022-06-17T14:24:41Z
dc.date.available	2022-06-17T14:24:41Z
dc.date.issued	2019-01-01
dc.identifier.uri	https://hdl.handle.net/1721.1/143460
dc.description.abstract	© Constantinos Daskalakis and Ioannis Panageas. Motivated by applications in Game Theory, Optimization, and Generative Adversarial Networks, recent work of Daskalakis et al [Daskalakis et al., ICLR, 2018] and follow-up work of Liang and Stokes [Liang and Stokes, 2018] have established that a variant of the widely used Gradient Descent/Ascent procedure, called “Optimistic Gradient Descent/Ascent (OGDA)”, exhibits last-iterate convergence to saddle points in unconstrained convex-concave min-max optimization problems. We show that the same holds true in the more general problem of constrained min-max optimization under a variant of the no-regret Multiplicative-Weights-Update method called “Optimistic Multiplicative-Weights Update (OMWU)”. This answers an open question of Syrgkanis et al [Syrgkanis et al., NIPS, 2015]. The proof of our result requires fundamentally different techniques from those that exist in no-regret learning literature and the aforementioned papers. We show that OMWU monotonically improves the Kullback-Leibler divergence of the current iterate to the (appropriately normalized) min-max solution until it enters a neighborhood of the solution. Inside that neighborhood we show that OMWU becomes a contracting map converging to the exact solution. We believe that our techniques will be useful in the analysis of the last iterate of other learning algorithms.	en_US
dc.language.iso	en
dc.relation.isversionof	10.4230/LIPIcs.ITCS.2019.27	en_US
dc.rights	Creative Commons Attribution 3.0 unported license	en_US
dc.rights.uri	https://creativecommons.org/licenses/by/3.0/	en_US
dc.source	DROPS	en_US
dc.title	Last-iterate convergence: Zero-sum games and constrained min-max optimization	en_US
dc.type	Article	en_US
dc.identifier.citation	Daskalakis, C and Panageas, I. 2019. "Last-iterate convergence: Zero-sum games and constrained min-max optimization." Leibniz International Proceedings in Informatics, LIPIcs, 124.
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
dc.relation.journal	Leibniz International Proceedings in Informatics, LIPIcs	en_US
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dc.date.updated	2022-06-17T14:20:15Z
dspace.orderedauthors	Daskalakis, C; Panageas, I	en_US
dspace.date.submission	2022-06-17T14:20:16Z
mit.journal.volume	124	en_US
mit.license	PUBLISHER_CC
mit.metadata.status	Authority Work and Publication Information Needed	en_US

Files in this item

Name:: LIPIcs-ITCS-2019-27.pdf
Size:: 631.4Kb
Format:: PDF
Description:: Published version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record