The performance of index-based policies for bandit problems with stochastic machine availability

Citation
Rt. Dunn et Kd. Glazebrook, The performance of index-based policies for bandit problems with stochastic machine availability, ADV APPL P, 33(2), 2001, pp. 365-390
Citations number
22
Categorie Soggetti
Mathematics
Journal title
ADVANCES IN APPLIED PROBABILITY
ISSN journal
00018678 → ACNP
Volume
33
Issue
2
Year of publication
2001
Pages
365 - 390
Database
ISI
SICI code
0001-8678(200106)33:2<365:TPOIPF>2.0.ZU;2-L
Abstract
We consider generalisations of two classical stochastic scheduling models, namely the discounted branching bandit and the discounted multi-armed bandi t, to the case where the collection of machines available for processing is itself a stochastic process. Under rather mild conditions on the machine a vailability process we obtain performance guarantees for a range of control s based on Gittins indices. Various forms of asymptotic optimality are esta blished for index-based limit policies as the discount rate approaches 0.