BATCHED BANDIT PROBLEMS

Citation
Vianney Perchet et al., BATCHED BANDIT PROBLEMS, Annals of statistics , 44(2), 2016, pp. 660-681
Journal title
ISSN journal
00905364
Volume
44
Issue
2
Year of publication
2016
Pages
660 - 681
Database
ACNP
SICI code
Abstract
Motivated by practical applications, chiefly clinical trials, we study the regret achievable for stochastic bandits under the constraint that the employed policy must split trials into a small number of batches. We propose a simple policy, and show that a very small number of batches gives close to minimax optimal regret bounds. As a byproduct, we derive optimal policies with low switching cost for stochastic bandits.