ITA
ENG

A BAYESIAN-APPROACH TO MODEL SELECTION IN HIERARCHICAL MIXTURES-OF-EXPERTS ARCHITECTURES

Authors

JACOBS RA PENG FC TANNER MA

Citation

Ra. Jacobs et al., A BAYESIAN-APPROACH TO MODEL SELECTION IN HIERARCHICAL MIXTURES-OF-EXPERTS ARCHITECTURES, Neural networks, 10(2), 1997, pp. 231-241

Citations number

Categorie Soggetti

Mathematical Methods, Biology & Medicine","Computer Sciences, Special Topics","Computer Science Artificial Intelligence",Neurosciences,"Physics, Applied

Journal title

Neural networks → ACNP

ISSN journal

08936080

Volume

Issue

Year of publication

1997

Pages

231 - 241

Database

ISI

SICI code

0893-6080(1997)10:2<231:ABTMSI>2.0.ZU;2-J

Abstract

There does not exist a statistical model that shows good performance o n all tasks. Consequently, the model selection problem is unavoidable; investigators must decide which model is best at summarizing the data for each task of interest. This article presents an approach to the m odel selection problem in hierarchical mixtures-of-experts architectur es. These architectures combine aspects of generalized linear models w ith those of finite mixture models in order to perform tasks via a rec ursive ''divide-and-conquer'' strategy. Markov chain Monte Carlo metho dology is used to estimate the distribution of the architectures' para meters. One part of our approach to model selection attempts to estima te the worth of each component of an architecture so that relatively u nused components can be pruned from the architecture's structure. A se cond part of this approach uses a Bayesian hypothesis testing procedur e in order to differentiate inputs that carry useful information from nuisance inputs. Simulation results suggest that the approach presente d here adheres to the dictum of Occam's razor; simple architectures th at are adequate for summarizing the data are favored over more complex structures. (C) 1997 Elsevier Science Ltd. All Rights Reserved.