The limit points of (optimistic) gradient descent in min-max optimization

Daskalakis, C; Panageas, I

dc.contributor.author	Daskalakis, C
dc.contributor.author	Panageas, I
dc.date.accessioned	2022-06-14T19:17:17Z
dc.date.available	2022-06-14T19:17:17Z
dc.date.issued	2018-01-01
dc.identifier.uri	https://hdl.handle.net/1721.1/143126
dc.description.abstract	© 2018 Curran Associates Inc.All rights reserved. Motivated by applications in Optimization, Game Theory, and the training of Generative Adversarial Networks, the convergence properties of first order methods in min-max problems have received extensive study. It has been recognized that they may cycle, and there is no good understanding of their limit points when they do not. When they converge, do they converge to local min-max solutions? We characterize the limit points of two basic first order methods, namely Gradient Descent/Ascent (GDA) and Optimistic Gradient Descent Ascent (OGDA). We show that both dynamics avoid unstable critical points for almost all initializations. Moreover, for small step sizes and under mild assumptions, the set of OGDA-stable critical points is a superset of GDA-stable critical points, which is a superset of local min-max solutions (strict in some cases). The connecting thread is that the behavior of these dynamics can be studied from a dynamical systems perspective.	en_US
dc.language.iso	en
dc.relation.isversionof	https://papers.nips.cc/paper/2018/hash/139c3c1b7ca46a9d4fd6d163d98af635-Abstract.html	en_US
dc.rights	Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.	en_US
dc.source	Neural Information Processing Systems (NIPS)	en_US
dc.title	The limit points of (optimistic) gradient descent in min-max optimization	en_US
dc.type	Article	en_US
dc.identifier.citation	Daskalakis, C and Panageas, I. 2018. "The limit points of (optimistic) gradient descent in min-max optimization." Advances in Neural Information Processing Systems, 2018-December.
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
dc.relation.journal	Advances in Neural Information Processing Systems	en_US
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dc.date.updated	2022-06-14T19:09:01Z
dspace.orderedauthors	Daskalakis, C; Panageas, I	en_US
dspace.date.submission	2022-06-14T19:09:02Z
mit.journal.volume	2018-December	en_US
mit.license	PUBLISHER_POLICY
mit.metadata.status	Authority Work and Publication Information Needed	en_US

Files in this item

Name:: NeurIPS-2018-the-limit-points- ...
Size:: 806.6Kb
Format:: PDF
Description:: Published version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record