Direct Runge-Kutta discretization achieves acceleration

Zhang, Jingzhao; Mokhtari, Aryan; Sra, Suvrit; Jadbabaie-Moghadam, Ali

dc.contributor.author	Zhang, Jingzhao
dc.contributor.author	Mokhtari, Aryan
dc.contributor.author	Sra, Suvrit
dc.contributor.author	Jadbabaie-Moghadam, Ali
dc.date.accessioned	2022-01-07T19:25:09Z
dc.date.available	2021-11-04T16:18:35Z
dc.date.available	2022-01-07T19:25:09Z
dc.date.issued	2018
dc.identifier.uri	https://hdl.handle.net/1721.1/137357.2
dc.description.abstract	© 2018 Curran Associates Inc..All rights reserved. We study gradient-based optimization methods obtained by directly discretizing a second-order ordinary differential equation (ODE) related to the continuous limit of Nesterov's accelerated gradient method. When the function is smooth enough, we show that acceleration can be achieved by a stable discretization of this ODE using standard Runge-Kutta integrators. Specifically, we prove that under Lipschitz-gradient, convexity and order-(s + 2) differentiability assumptions, the sequence of iterates generated by discretizing the proposed second-order ODE converges to the optimal solution at a rate of O(N−2 s+1 s ), where s is the order of the Runge-Kutta numerical integrator. Furthermore, we introduce a new local flatness condition on the objective, under which rates even faster than O(N−2) can be achieved with low-order integrators and only gradient information. Notably, this flatness condition is satisfied by several standard loss functions used in machine learning. We provide numerical experiments that verify the theoretical rates predicted by our results.	en_US
dc.language.iso	en
dc.rights	Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.	en_US
dc.source	Neural Information Processing Systems (NIPS)	en_US
dc.title	Direct Runge-Kutta discretization achieves acceleration	en_US
dc.type	Article	en_US
dc.identifier.citation	Zhang, Jingzhao, Mokhtari, Aryan, Sra, Suvrit and Jadbabaie, Ali. 2018. "Direct Runge-Kutta discretization achieves acceleration." Advances in Neural Information Processing Systems, 2018-December.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Laboratory for Information and Decision Systems	en_US
dc.contributor.department	Massachusetts Institute of Technology. Institute for Data, Systems, and Society	en_US
dc.relation.journal	Advances in Neural Information Processing Systems	en_US
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dc.date.updated	2021-03-30T13:49:29Z
dspace.orderedauthors	Zhang, J; Sra, S; Mokhtari, A; Jadbabaie, A	en_US
dspace.date.submission	2021-03-30T13:49:30Z
mit.journal.volume	2018-December	en_US
mit.license	PUBLISHER_POLICY
mit.metadata.status	Publication Information Needed	en_US

Files in this item

Name:: NeurIPS-2018-direct-runge-kutt ...
Size:: 593.1Kb
Format:: Unknown
Description:: Published version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record

Version	Item	Date	Summary
2	1721.1/137357.2*	2022-01-07T19:19:08Z	Verified or entered authority metadata.
1	1721.1/137357	2021-11-04T16:18:35Z

DSpace@MIT

Direct Runge-Kutta discretization achieves acceleration

Files in this item

This item appears in the following Collection(s)

Version History