CONTINUOUS ACTION SET LEARNING AUTOMATA FOR STOCHASTIC OPTIMIZATION

Citation
G. Santharam et al., CONTINUOUS ACTION SET LEARNING AUTOMATA FOR STOCHASTIC OPTIMIZATION, Journal of the Franklin Institute, 331B(5), 1994, pp. 607-628
Citations number
NO
Categorie Soggetti
Mathematics,"Engineering, Mechanical
ISSN journal
00160032
Volume
331B
Issue
5
Year of publication
1994
Pages
607 - 628
Database
ISI
SICI code
0016-0032(1994)331B:5<607:CASLAF>2.0.ZU;2-D
Abstract
The problem of optimization with noisy measurements is common in many areas of engineering. The only available information is the noise corr upted value of the objective function at any chosen point in the param eter space. One well known method for solving this problem is the stoc hastic approximation procedure. In this paper we consider an adaptive random search procedure, based on the reinforcement learning paradigm. The learning model presented here generalizes the traditional model o f a learning automaton [Narendra and Thathachar, Learning Automataa: A n Introduction, Prentice Hall, Englewood Cliffs, 1989]. This procedure requires a lesser number of function evaluations at each step compare d to the stochastic approximation. The convergence properties of the a lgorithm are theoretically investigated. Simulation results are presen ted to show the efficacy of the learning method.