RATIONAL APPROXIMATION TECHNIQUES FOR ANALYSIS OF NEURAL NETWORKS

Citation
Ky. Siu et al., RATIONAL APPROXIMATION TECHNIQUES FOR ANALYSIS OF NEURAL NETWORKS, IEEE transactions on information theory, 40(2), 1994, pp. 455-466
Citations number
41
Categorie Soggetti
Information Science & Library Science","Engineering, Eletrical & Electronic
ISSN journal
00189448
Volume
40
Issue
2
Year of publication
1994
Pages
455 - 466
Database
ISI
SICI code
0018-9448(1994)40:2<455:RATFAO>2.0.ZU;2-M
Abstract
Artificial neural networks are comprised of an interconnected collecti on of certain nonlinear devices; examples of commonly used devices inc lude linear threshold elements, sigmoidal elements, and radial-basis e lements. We employ results from harmonic analysis and the theory of ra tional approximation to obtain almost tight lower bounds on the size ( i.e, number of elements) of neural networks. The class of neural netwo rks to which our techniques can be applied is quite general; it includ es any feedforward network in which each element can be piecewise appr oximated by a low degree rational function. For example, we prove that any depth-d + 1 network of sigmoidal units or linear threshold elemen ts computing the parity function of n variables must have OMEGA(dn1/d- epsilon) size, for any fixed epsilon > 0. In addition, we prove that t his lower bound is almost tight by showing that the parity function ca n be computed with 0(dn1/d) sigmoidal units or linear threshold elemen ts in a depth-(d + 1) network. These almost tight bounds are the first known complexity results on the size of neural networks with depth mo re than two. Our lower bound techniques yield a unified approach to th e complexity analysis of various models of neural networks with feedfo rward structures. Moreover, our results indicate that the in the conte xt of computing highly oscillating symmetric Boolean functions, networ ks of continuous-output units such as sigmoidal elements do not offer significant reduction in size compared with networks of linear thresho ld elements of binary outputs.