ITA
ENG

ACCELERATING LEARNING OF NEURAL NETWORKS WITH CONJUGATE GRADIENTS FORNUCLEAR-POWER-PLANT APPLICATIONS

Authors

REIFMAN J VITELA JE

Citation

J. Reifman et Je. Vitela, ACCELERATING LEARNING OF NEURAL NETWORKS WITH CONJUGATE GRADIENTS FORNUCLEAR-POWER-PLANT APPLICATIONS, Nuclear technology, 106(2), 1994, pp. 225-241

Citations number

Categorie Soggetti

Nuclear Sciences & Tecnology

Journal title

Nuclear technology → ACNP

ISSN journal

00295450

Volume

106

Issue

Year of publication

1994

Pages

225 - 241

Database

ISI

SICI code

0029-5450(1994)106:2<225:ALONNW>2.0.ZU;2-D

Abstract

The method of conjugate gradients is used to expedite the learning pro cess of feedforward multilayer artificial neural networks and to syste matically update both the learning parameter and the momentum paramete r at each training cycle. The mechanism for the occurrence of prematur e saturation of the network nodes observed with the backpropagation al gorithm is described, suggestions are made to eliminate this undesirab le phenomenon, and the reason by which this phenomenon is precluded in the method of conjugate gradients is presented. The proposed method i s compared with the standard backpropagation algorithm in the training of neural networks to classify transient events in nuclear power plan ts simulated by the Midland Nuclear Power Plant Unit 2 simulator. The comparison results indicate that the rate of convergence of the propos ed method is much greater than the standard backpropagation, that it r educes both the number of training cycles and the CPU time, and that i t is less sensitive to the choice of initial weights. The advantages o f the method are more noticeable and important for problems where the network architecture consists of a large number of nodes, the training database is large, and a tight convergence criterion is desired.