ITA
ENG

MEAN-FIELD THEORY FOR BATCHED TD(LAMBDA)

Authors

PINEDA FJ

Citation

Fj. Pineda, MEAN-FIELD THEORY FOR BATCHED TD(LAMBDA), Neural computation, 9(7), 1997, pp. 1403-1419

Citations number

Categorie Soggetti

Computer Sciences","Computer Science Artificial Intelligence",Neurosciences

Journal title

Neural computation → ACNP

ISSN journal

08997667

Volume

Issue

Year of publication

1997

Pages

1403 - 1419

Database

ISI

SICI code

0899-7667(1997)9:7<1403:MTFBT>2.0.ZU;2-H

Abstract

A representation-independent mean-field dynamics is presented for batc hed TD(lambda). The task is learning to predict the outcome of an indi rectly observed absorbing Markov process. In the case of linear repres entations, the discrete-time deterministic iteration is an affine map whose fixed point can be expressed in closed form without the assumpti on of linearly independent observation vectors. Batched linear TD(A) i s proved to converge with probability 1 for all lambda. Theory and sim ulation agree on a random walk example.