Adaptive control of constrained finite Markov chains

Citation
As. Poznyak et K. Najim, Adaptive control of constrained finite Markov chains, AUTOMATICA, 35(5), 1999, pp. 777-789
Citations number
45
Categorie Soggetti
AI Robotics and Automatic Control
Journal title
AUTOMATICA
ISSN journal
00051098 → ACNP
Volume
35
Issue
5
Year of publication
1999
Pages
777 - 789
Database
ISI
SICI code
0005-1098(199905)35:5<777:ACOCFM>2.0.ZU;2-P
Abstract
An adaptive control algorithm is presented for constrained finite controlle d Markov chains with unknown transition probabilities. A finite set of alge braic constraints has been considered. The Lagrange multipliers approach is used to solve this constrained optimization problem. This scheme is such t hat at each time n estimates the control policy on the basis on Bush-Mostel ler scheme which is related to stochastic approximation procedures. We pres ent the asymptotic properties (convergence and order of convergence rate) o f the algorithm. They follow from the law of dependent large numbers, marti ngales theory and Lyapunov function analysis approaches. (C) 1999 Elsevier Science Ltd. All rights reserved.