Multi-armed bandits in discrete and continuous time

Citation
H. Kaspi et A. Mandelbaum, Multi-armed bandits in discrete and continuous time, ANN APPL PR, 8(4), 1998, pp. 1270-1290
Citations number
21
Categorie Soggetti
Mathematics
Journal title
ANNALS OF APPLIED PROBABILITY
ISSN journal
10505164 → ACNP
Volume
8
Issue
4
Year of publication
1998
Pages
1270 - 1290
Database
ISI
SICI code
1050-5164(199811)8:4<1270:MBIDAC>2.0.ZU;2-S
Abstract
We analyze Gittins' Markovian model, as generalized by Varaiya, Walrand and Buyukkoc, in discrete and continuous time. The approach resembles Weber's modification of Whittle's, within the framework, of both multiparameter pro cesses and excursion theory. It is shown that index-priority strategies are optimal, in concert with all the special cases that have been treated prev iously.