Spinning plates and squad systems: policies for bi-directional restless bandits

Citation
D. Glazebrook, K. et al., Spinning plates and squad systems: policies for bi-directional restless bandits, Advances in applied probability , 38(1), 2006, pp. 95-115
ISSN journal
00018678
Volume
38
Issue
1
Year of publication
2006
Pages
95 - 115
Database
ACNP
SICI code
Abstract
This paper concerns two families of Markov decision problem that fall within the family of (bi-directional) restless bandits, an intractable class of decision processes introduced by Whittle. The spinning plates problem concerns the optimal management of a portfolio of reward-generating assets whose yields grow with investment but otherwise tend to decline. In the model of asset exploitation called the squad system, the yield from an asset tends to decline when it is used but will recover when the asset is at rest. In all cases, simply stated conditions are given that guarantee indexability of the problem, together with conditions necessary and sufficient for its strict indexability. The index heuristics for asset activation that emerge from the analysis are assessed numerically and found to perform very strongly.