ITA
ENG

Temporal difference methods for the maximal solution of discrete-time coupled algebraic Riccati equations

Authors

Costa, OLV Aya, JCC

Citation

Olv. Costa et Jcc. Aya, Temporal difference methods for the maximal solution of discrete-time coupled algebraic Riccati equations, J OPTIM TH, 109(2), 2001, pp. 289-309

Citations number

Categorie Soggetti

Engineering Mathematics

Journal title

JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS

ISSN journal

00223239 → ACNP

Volume

109

Issue

Year of publication

2001

Pages

289 - 309

Database

ISI

SICI code

0022-3239(200105)109:2<289:TDMFTM>2.0.ZU;2-R

Abstract

In this paper, we present an iterative technique for deriving the maximal s olution of a set of discrete-time coupled algebraic Riccati equations, base d on temporal difference methods, which are related to the optimal control of Markovian jump linear systems and have been studied extensively over the last few years. We trace a parallel with the theory of temporal difference algorithms for Markovian decision processes to develop a lambda -policy it eration like algorithm for the maximal solution of these equations. For the special cases in which lambda =0 and lambda = 1 we have the situation in w hich the algorithm reduces to the iterations of the Riccati difference equa tions (value iteration) and quasilinearization method (policy iteration), r espectively. The advantage of the proposed method is that an appropriate ch oice of lambda between 0 and 1 can speed up the convergence of the policy e valuation step of the policy iteration method by using value iteration.