Statistical mechanics of EKF learning in neural networks

Citation
B. Schottky et D. Saad, Statistical mechanics of EKF learning in neural networks, J PHYS A, 32(9), 1999, pp. 1605-1621
Citations number
28
Categorie Soggetti
Physics
Journal title
JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL
ISSN journal
03054470 → ACNP
Volume
32
Issue
9
Year of publication
1999
Pages
1605 - 1621
Database
ISI
SICI code
0305-4470(19990305)32:9<1605:SMOELI>2.0.ZU;2-O
Abstract
We formulate a learning algorithm for online learning in neural networks us ing the extended Kalman filter approach, providing a principled and practic able approximation to the full Bayesian treatment. The latter, which consti tutes optimal learning, does not require artificial setting of training par ameters and allows for the estimation of a wide range of quantities of inte rest. We analyse the performance of the algorithm using tools of statistica l physics in several scenarios: we look at drifting rules represented by li near and nonlinear perceptrons and investigate how different prior settings affect the generalization performance as well as learnability itself. We i nvestigate the learning behaviour of stationary two-layer network, where th e algorithm seems to avoid the, otherwise common, problem of long symmetric plateaus.