Notice

This is not the latest version of this item. The latest version can be found at:https://dspace.mit.edu/handle/1721.1/137055.2

Show simple item record

dc.contributor.authorsimchi-levi, David
dc.contributor.authorWang, Xinshang
dc.date.accessioned2021-11-02T11:37:27Z
dc.date.available2021-11-02T11:37:27Z
dc.date.issued2018
dc.identifier.urihttps://hdl.handle.net/1721.1/137055
dc.description.abstract© 2018 Curran Associates Inc..All rights reserved. Classically, the time complexity of a first-order method is estimated by its number of gradient computations. In this paper, we study a more refined complexity by taking into account the “lingering” of gradients: once a gradient is computed at xk, the additional time to compute gradients at xk+1, xk+2, . . . may be reduced. We show how this improves the running time of gradient descent and SVRG. For instance, if the “additional time” scales linearly with respect to the traveled distance, then the “convergence rate” of gradient descent can be improved from 1/T to exp(−T1/3). On the empirical side, we solve a hypothetical revenue management problem on the Yahoo! Front Page Today Module application with 4.6m users to 10−6 error (or 10−12 dual error) using 6 passes of the dataset.en_US
dc.language.isoen
dc.relation.isversionofhttps://papers.nips.cc/paper/2018/file/b4288d9c0ec0a1841b3b3728321e7088-Paper.pdfen_US
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.en_US
dc.sourceNeural Information Processing Systems (NIPS)en_US
dc.titleThe lingering of gradients: How to reuse gradients over timeen_US
dc.typeArticleen_US
dc.identifier.citationsimchi-levi, David and Wang, Xinshang. 2018. "The lingering of gradients: How to reuse gradients over time." Advances in Neural Information Processing Systems, 2018-December.
dc.relation.journalAdvances in Neural Information Processing Systemsen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2020-06-02T16:19:52Z
dspace.date.submission2020-06-02T16:19:55Z
mit.journal.volume2018-Decemberen_US
mit.licensePUBLISHER_POLICY
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

VersionItemDateSummary

*Selected version