R. Righter et Jg. Shanthikumar, INDEPENDENTLY EXPIRING MULTIARMED BANDITS, Probability in the engineering and informational sciences, 12(4), 1998, pp. 453-468
Citations number
14
Categorie Soggetti
Statistic & Probability","Operatione Research & Management Science","Engineering, Industrial","Statistic & Probability","Operatione Research & Management Science
We give conditions on the optimality of an index policy for multiarmed
bandits when arms expire independently. We also give a new simple pro
of of the optimality of the Gittins index policy for the classic multi
armed bandit problem.