| dc.contributor.author | Daskalakis, C | |
| dc.contributor.author | Panageas, I | |
| dc.date.accessioned | 2022-06-14T19:17:17Z | |
| dc.date.available | 2022-06-14T19:17:17Z | |
| dc.date.issued | 2018-01-01 | |
| dc.identifier.uri | https://hdl.handle.net/1721.1/143126 | |
| dc.description.abstract | © 2018 Curran Associates Inc.All rights reserved. Motivated by applications in Optimization, Game Theory, and the training of Generative Adversarial Networks, the convergence properties of first order methods in min-max problems have received extensive study. It has been recognized that they may cycle, and there is no good understanding of their limit points when they do not. When they converge, do they converge to local min-max solutions? We characterize the limit points of two basic first order methods, namely Gradient Descent/Ascent (GDA) and Optimistic Gradient Descent Ascent (OGDA). We show that both dynamics avoid unstable critical points for almost all initializations. Moreover, for small step sizes and under mild assumptions, the set of OGDA-stable critical points is a superset of GDA-stable critical points, which is a superset of local min-max solutions (strict in some cases). The connecting thread is that the behavior of these dynamics can be studied from a dynamical systems perspective. | en_US |
| dc.language.iso | en | |
| dc.relation.isversionof | https://papers.nips.cc/paper/2018/hash/139c3c1b7ca46a9d4fd6d163d98af635-Abstract.html | en_US |
| dc.rights | Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use. | en_US |
| dc.source | Neural Information Processing Systems (NIPS) | en_US |
| dc.title | The limit points of (optimistic) gradient descent in min-max optimization | en_US |
| dc.type | Article | en_US |
| dc.identifier.citation | Daskalakis, C and Panageas, I. 2018. "The limit points of (optimistic) gradient descent in min-max optimization." Advances in Neural Information Processing Systems, 2018-December. | |
| dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | |
| dc.contributor.department | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory | |
| dc.relation.journal | Advances in Neural Information Processing Systems | en_US |
| dc.eprint.version | Final published version | en_US |
| dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
| eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
| dc.date.updated | 2022-06-14T19:09:01Z | |
| dspace.orderedauthors | Daskalakis, C; Panageas, I | en_US |
| dspace.date.submission | 2022-06-14T19:09:02Z | |
| mit.journal.volume | 2018-December | en_US |
| mit.license | PUBLISHER_POLICY | |
| mit.metadata.status | Authority Work and Publication Information Needed | en_US |