On the Gittins Index for Multiarmed Bandits

Authors
Citation
Weber, Richard, On the Gittins Index for Multiarmed Bandits, Annals of applied probability , 2(4), 1992, pp. 1024-1033
ISSN journal
10505164
Volume
2
Issue
4
Year of publication
1992
Pages
1024 - 1033
Database
ACNP
SICI code
Abstract
This paper considers the multiarmed bandit problem and presents a new proof of the optimality of the Gittins index policy. The proof is intuitive and does not require an interchange argument. The insight it affords is used to give a streamlined summary of previous research and to prove a new result: The optimal value function is a submodular set function of the available projects.