A SIMPLIFICATION OF THE BACKPROPAGATION-THROUGH-TIME ALGORITHM FOR OPTIMAL NEUROCONTROL

Citation
H. Bersini et V. Gorrini, A SIMPLIFICATION OF THE BACKPROPAGATION-THROUGH-TIME ALGORITHM FOR OPTIMAL NEUROCONTROL, IEEE transactions on neural networks, 8(2), 1997, pp. 437-441
Citations number
11
Categorie Soggetti
Computer Application, Chemistry & Engineering","Engineering, Eletrical & Electronic","Computer Science Artificial Intelligence","Computer Science Hardware & Architecture","Computer Science Theory & Methods
ISSN journal
10459227
Volume
8
Issue
2
Year of publication
1997
Pages
437 - 441
Database
ISI
SICI code
1045-9227(1997)8:2<437:ASOTBA>2.0.ZU;2-K
Abstract
Backpropagation-through-time (BPTT) is the temporal extension of backp ropagation which allows a multilayer neural network to approximate an optimal state-feedback control law provided some prior knowledge (Jaco bian matrixes) of the process is available. In this paper, a simplifie d version of the BPTT algorithm is proposed which more closely respect s the principle of optimality of dynamic programming. Besides being si mpler, the new algorithm is less time-consuming and allows in some cas es the discovery of better control laws. A formal justification of thi s simplification is attempted by mixing the Lagrangian calculus underl ying BPTT with Bellman-Hamilton-Jacobi equations. The improvements due to this simplification are illustrated by two optimal control problem s: the rendezvous and the bioreactor.