ITA
ENG

LEARNING THE GLOBAL MAXIMUM WITH PARAMETERIZED LEARNING AUTOMATA

Authors

THATHACHAR MAL PHANSALKAR VV

Citation

Mal. Thathachar et Vv. Phansalkar, LEARNING THE GLOBAL MAXIMUM WITH PARAMETERIZED LEARNING AUTOMATA, IEEE transactions on neural networks, 6(2), 1995, pp. 398-406

Citations number

Categorie Soggetti

Computer Application, Chemistry & Engineering","Engineering, Eletrical & Electronic","Computer Science Artificial Intelligence","Computer Science Hardware & Architecture","Computer Science Theory & Methods

Journal title

IEEE transactions on neural networks → ACNP

ISSN journal

10459227

Volume

Issue

Year of publication

1995

Pages

398 - 406

Database

ISI

SICI code

1045-9227(1995)6:2<398:LTGMWP>2.0.ZU;2-T

Abstract

A feedforward network composed of units of teams of parameterized lear ning automata is considered as a model of a reinforcement teaming syst em. The internal state vector of each learning automaton is updated us ing an algorithm consisting of a gradient following term and a random perturbation term. It is shown that the algorithm weakly converges to a solution of the Langevin equation implying that the algorithm global ly maximizes an appropriate function. The algorithm is decentralized, and the units do not have any information exchange during updating. Si mulation results on common payoff games and pattern recognition proble ms show that reasonable rates of convergence can be obtained.