Temporal difference methods for the maximal solution of discrete-time coupled algebraic Riccati equations

Citation
Olv. Costa et Jcc. Aya, Temporal difference methods for the maximal solution of discrete-time coupled algebraic Riccati equations, J OPTIM TH, 109(2), 2001, pp. 289-309
Citations number
17
Categorie Soggetti
Engineering Mathematics
Journal title
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS
ISSN journal
00223239 → ACNP
Volume
109
Issue
2
Year of publication
2001
Pages
289 - 309
Database
ISI
SICI code
0022-3239(200105)109:2<289:TDMFTM>2.0.ZU;2-R
Abstract
In this paper, we present an iterative technique for deriving the maximal s olution of a set of discrete-time coupled algebraic Riccati equations, base d on temporal difference methods, which are related to the optimal control of Markovian jump linear systems and have been studied extensively over the last few years. We trace a parallel with the theory of temporal difference algorithms for Markovian decision processes to develop a lambda -policy it eration like algorithm for the maximal solution of these equations. For the special cases in which lambda =0 and lambda = 1 we have the situation in w hich the algorithm reduces to the iterations of the Riccati difference equa tions (value iteration) and quasilinearization method (policy iteration), r espectively. The advantage of the proposed method is that an appropriate ch oice of lambda between 0 and 1 can speed up the convergence of the policy e valuation step of the policy iteration method by using value iteration.