Steering policies for controlled Markov chains under a recurrence condition

Citation
Dj. Ma et Am. Makowski, Steering policies for controlled Markov chains under a recurrence condition, IEEE AUTO C, 44(8), 1999, pp. 1583-1587
Citations number
12
Categorie Soggetti
AI Robotics and Automatic Control
Journal title
IEEE TRANSACTIONS ON AUTOMATIC CONTROL
ISSN journal
00189286 → ACNP
Volume
44
Issue
8
Year of publication
1999
Pages
1583 - 1587
Database
ISI
SICI code
0018-9286(199908)44:8<1583:SPFCMC>2.0.ZU;2-1
Abstract
The authors consider the class of steering policies for controlled Markov c hains under a recurrence condition. A steering policy is defined as one ada ptively alternating between two stationary policies in order to track a sam ple average cost to a desired value. Convergence of the sample average cost s is derived via direct sample path arguments, and the performance of the s teering policy is discussed. Steering policies are motivated by, and partic ularly useful in, the discussion of constrained Markov chains with a single constraint.