Beyond convexity—Contraction and global convergence of gradient descent

Wensing, Patrick M; Slotine, Jean-Jacques

dc.contributor.author	Wensing, Patrick M
dc.contributor.author	Slotine, Jean-Jacques
dc.date.accessioned	2022-01-24T19:05:18Z
dc.date.available	2022-01-24T19:05:18Z
dc.date.issued	2020
dc.identifier.uri	https://hdl.handle.net/1721.1/139673
dc.description.abstract	© 2020 Wensing, Slotine. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. This paper considers the analysis of continuous time gradient-based optimization algorithms through the lens of nonlinear contraction theory. It demonstrates that in the case of a timeinvariant objective, most elementary results on gradient descent based on convexity can be replaced by much more general results based on contraction. In particular, gradient descent converges to a unique equilibrium if its dynamics are contracting in any metric, with convexity of the cost corresponding to the special case of contraction in the identity metric. More broadly, contraction analysis provides new insights for the case of geodesically-convex optimization, wherein non-convex problems in Euclidean space can be transformed to convex ones posed over a Riemannian manifold. In this case, natural gradient descent converges to a unique equilibrium if it is contracting in any metric, with geodesic convexity of the cost corresponding to contraction in the natural metric. New results using semi-contraction provide additional insights into the topology of the set of optimizers in the case when multiple optima exist. Furthermore, they show how semi-contraction may be combined with specific additional information to reach broad conclusions about a dynamical system. The contraction perspective also easily extends to time-varying optimization settings and allows one to recursively build large optimization structures out of simpler elements. Extensions to natural primal-dual optimization and game-theoretic contexts further illustrate the potential reach of these new perspectives.	en_US
dc.language.iso	en
dc.publisher	Public Library of Science (PLoS)	en_US
dc.relation.isversionof	10.1371/JOURNAL.PONE.0236661	en_US
dc.rights	Creative Commons Attribution 4.0 International license	en_US
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/	en_US
dc.source	PLoS	en_US
dc.title	Beyond convexity—Contraction and global convergence of gradient descent	en_US
dc.type	Article	en_US
dc.identifier.citation	Wensing, Patrick M and Slotine, Jean-Jacques. 2020. "Beyond convexity—Contraction and global convergence of gradient descent." PLoS ONE, 15 (8).
dc.contributor.department	Massachusetts Institute of Technology. Department of Mechanical Engineering
dc.contributor.department	Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences
dc.contributor.department	Massachusetts Institute of Technology. Nonlinear Systems Laboratory
dc.relation.journal	PLoS ONE	en_US
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/JournalArticle	en_US
eprint.status	http://purl.org/eprint/status/PeerReviewed	en_US
dc.date.updated	2022-01-24T19:00:07Z
dspace.orderedauthors	Wensing, PM; Slotine, J-J	en_US
dspace.date.submission	2022-01-24T19:00:09Z
mit.journal.volume	15	en_US
mit.journal.issue	8	en_US
mit.license	PUBLISHER_CC
mit.metadata.status	Authority Work and Publication Information Needed	en_US

Files in this item

Name:: file.pdf
Size:: 1.624Mb
Format:: PDF
Description:: Published version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record