PRUNING FROM ADAPTIVE REGULARIZATION

Citation
Lk. Hansen et Ce. Rasmussen, PRUNING FROM ADAPTIVE REGULARIZATION, Neural computation, 6(6), 1994, pp. 1223-1232
Citations number
11
Categorie Soggetti
Computer Sciences","Computer Science Artificial Intelligence",Neurosciences
Journal title
ISSN journal
08997667
Volume
6
Issue
6
Year of publication
1994
Pages
1223 - 1232
Database
ISI
SICI code
0899-7667(1994)6:6<1223:PFAR>2.0.ZU;2-M
Abstract
Inspired by the recent upsurge of interest in Bayesian methods we cons ider adaptive regularization. A generalization based scheme for adapta tion of regularization parameters is introduced and compared to Bayesi an regularization. We show that pruning arises naturally within both a daptive regularization schemes. As model example we have chosen the si mplest possible: estimating the mean of a random variable with known v ariance. Marked similarities are found between the two methods in that they both involve a ''noise limit,'' below which they regularize with infinite weight decay, i.e., they prune. However, pruning is not alwa ys beneficial. We show explicitly that both methods in some cases may increase the generalization error. This corresponds to situations wher e the underlying assumptions of the regularizer are poorly matched to the environment.