Batched Bandit Problems
Author(s)
Perchet, Vianney; Rigollet, Philippe; Chassang, Sylvain; Snowberg, Erik
DownloadRigollet_Batched bandit.pdf (326.4Kb)
OPEN_ACCESS_POLICY
Open Access Policy
Creative Commons Attribution-Noncommercial-Share Alike
Terms of use
Metadata
Show full item recordAbstract
Motivated by practical applications, chiefly clinical trials, we study the regret achievable for stochastic bandits under the constraint that the employed policy must split trials into a small number of batches. Our results show that a very small number of batches gives close to minimax optimal regret bounds. As a byproduct, we derive optimal policies with low switching cost for stochastic bandits.
Date issued
2015-09-24Department
Massachusetts Institute of Technology. Department of MathematicsJournal
forthcoming in Annals of Statistics
Publisher
Institute of Mathematical Statistics
Citation
Perchet, Vianney, Philippe Rigollet, Sylvain Chassang, and Erik Snowberg. "Batched Bandit Problems." Annals of Statistics (2015).
Version: Author's final manuscript
ISSN
0090-5364