Learning to Optimize Under Non-Stationarity

Cheung, Wang Chi; Simchi-Levi, David; Zhu, Ruihao

dc.contributor.author	Cheung, Wang Chi
dc.contributor.author	Simchi-Levi, David
dc.contributor.author	Zhu, Ruihao
dc.date.accessioned	2021-11-02T12:19:41Z
dc.date.available	2021-11-02T12:19:41Z
dc.date.issued	2018
dc.identifier.issn	1556-5068
dc.identifier.uri	https://hdl.handle.net/1721.1/137064
dc.description.abstract	© 2019 by the author(s). We introduce algorithms that achieve state-of-the-art dynamic regret bounds for non-stationary linear stochastic bandit setting. It captures natural applications such as dynamic pricing and ads allocation in a changing environment. We show how the difficulty posed by the non-stationarity can be overcome by a novel marriage between stochastic and adversarial bandits learning algorithms. Defining d, BT, and T as the problem dimension, the variation budget, and the total time horizon, respectively, our main contributions are the tuned Sliding Window UCB (SW-UCB) algorithm with optimal Oe(d2/3(BT + 1)1/3T2/3) dynamic regret, and the tuning free bandit-over-bandit (BOB) framework built on top of the SW-UCB algorithm with best Oe(d2/3(BT + 1)1/4T3/4) dynamic regret.	en_US
dc.language.iso	en
dc.publisher	Elsevier BV	en_US
dc.relation.isversionof	10.2139/ssrn.3261050	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/	en_US
dc.source	arXiv	en_US
dc.title	Learning to Optimize Under Non-Stationarity	en_US
dc.type	Article	en_US
dc.identifier.citation	Cheung, Wang Chi, Simchi-Levi, David and Zhu, Ruihao. 2018. "Learning to Optimize Under Non-Stationarity." AISTATS 2019 - 22nd International Conference on Artificial Intelligence and Statistics, 89.
dc.contributor.department	Massachusetts Institute of Technology. Institute for Data, Systems, and Society
dc.contributor.department	Statistics and Data Science Center (Massachusetts Institute of Technology)
dc.relation.journal	AISTATS 2019 - 22nd International Conference on Artificial Intelligence and Statistics	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dc.date.updated	2020-06-02T16:54:50Z
dspace.date.submission	2020-06-02T16:54:53Z
mit.journal.volume	89	en_US
mit.license	OPEN_ACCESS_POLICY
mit.metadata.status	Authority Work and Publication Information Needed	en_US

Files in this item

Name:: 1810.03024.pdf
Size:: 417.3Kb
Format:: PDF
Description:: Accepted version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record