MEAN-FIELD THEORY FOR BATCHED TD(LAMBDA)

Authors
Citation
Fj. Pineda, MEAN-FIELD THEORY FOR BATCHED TD(LAMBDA), Neural computation, 9(7), 1997, pp. 1403-1419
Citations number
17
Categorie Soggetti
Computer Sciences","Computer Science Artificial Intelligence",Neurosciences
Journal title
ISSN journal
08997667
Volume
9
Issue
7
Year of publication
1997
Pages
1403 - 1419
Database
ISI
SICI code
0899-7667(1997)9:7<1403:MTFBT>2.0.ZU;2-H
Abstract
A representation-independent mean-field dynamics is presented for batc hed TD(lambda). The task is learning to predict the outcome of an indi rectly observed absorbing Markov process. In the case of linear repres entations, the discrete-time deterministic iteration is an affine map whose fixed point can be expressed in closed form without the assumpti on of linearly independent observation vectors. Batched linear TD(A) i s proved to converge with probability 1 for all lambda. Theory and sim ulation agree on a random walk example.