LOCAL AND GLOBAL OPTIMIZATION ALGORITHMS FOR GENERALIZED LEARNING AUTOMATA

Citation
Vv. Phansalkar et Mal. Thathachar, LOCAL AND GLOBAL OPTIMIZATION ALGORITHMS FOR GENERALIZED LEARNING AUTOMATA, Neural computation, 7(5), 1995, pp. 950-973
Citations number
21
Categorie Soggetti
Computer Sciences","Computer Science Artificial Intelligence",Neurosciences
Journal title
ISSN journal
08997667
Volume
7
Issue
5
Year of publication
1995
Pages
950 - 973
Database
ISI
SICI code
0899-7667(1995)7:5<950:LAGOAF>2.0.ZU;2-U
Abstract
This paper analyzes the long-term behavior of the REINFORCE and relate d algorithms (Williams 1986, 1988, 1992) for generalized learning auto mata (Narendra and Thathachar 1989) for the associative reinforcement learning problem (Barto and Anandan 1985). The learning system conside red here is a feedforward connectionist network of generalized learnin g automata units. We show that REINFORCE is a gradient ascent algorith m but can exhibit unbounded behavior. A modified version of this algor ithm, based on constrained optimization techniques, is suggested to ov ercome this disadvantage. The modified algorithm is shown to exhibit l ocal optimization properties. A global version of the algorithm, based on constant temperature heat bath techniques, is also described and s hown to converge to the global maximum. All algorithms are analyzed us ing weak convergence techniques.