dc.contributor.author | Cheung, Wang Chi | |
dc.contributor.author | Simchi-Levi, David | |
dc.contributor.author | Zhu, Ruihao | |
dc.date.accessioned | 2021-11-02T12:19:41Z | |
dc.date.available | 2021-11-02T12:19:41Z | |
dc.date.issued | 2018 | |
dc.identifier.issn | 1556-5068 | |
dc.identifier.uri | https://hdl.handle.net/1721.1/137064 | |
dc.description.abstract | © 2019 by the author(s). We introduce algorithms that achieve state-of-the-art dynamic regret bounds for non-stationary linear stochastic bandit setting. It captures natural applications such as dynamic pricing and ads allocation in a changing environment. We show how the difficulty posed by the non-stationarity can be overcome by a novel marriage between stochastic and adversarial bandits learning algorithms. Defining d, BT, and T as the problem dimension, the variation budget, and the total time horizon, respectively, our main contributions are the tuned Sliding Window UCB (SW-UCB) algorithm with optimal Oe(d2/3(BT + 1)1/3T2/3) dynamic regret, and the tuning free bandit-over-bandit (BOB) framework built on top of the SW-UCB algorithm with best Oe(d2/3(BT + 1)1/4T3/4) dynamic regret. | en_US |
dc.language.iso | en | |
dc.publisher | Elsevier BV | en_US |
dc.relation.isversionof | 10.2139/ssrn.3261050 | en_US |
dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
dc.source | arXiv | en_US |
dc.title | Learning to Optimize Under Non-Stationarity | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Cheung, Wang Chi, Simchi-Levi, David and Zhu, Ruihao. 2018. "Learning to Optimize Under Non-Stationarity." AISTATS 2019 - 22nd International Conference on Artificial Intelligence and Statistics, 89. | |
dc.contributor.department | Massachusetts Institute of Technology. Institute for Data, Systems, and Society | |
dc.contributor.department | Statistics and Data Science Center (Massachusetts Institute of Technology) | |
dc.relation.journal | AISTATS 2019 - 22nd International Conference on Artificial Intelligence and Statistics | en_US |
dc.eprint.version | Author's final manuscript | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
dc.date.updated | 2020-06-02T16:54:50Z | |
dspace.date.submission | 2020-06-02T16:54:53Z | |
mit.journal.volume | 89 | en_US |
mit.license | OPEN_ACCESS_POLICY | |
mit.metadata.status | Authority Work and Publication Information Needed | en_US |