ITA
ENG

Maximizing generalized linear mixed model likelihoods with an automated Monte Carlo EM algorithm

Authors

Booth, JG Hobert, JP

Citation

Jg. Booth et Jp. Hobert, Maximizing generalized linear mixed model likelihoods with an automated Monte Carlo EM algorithm, J ROY STA B, 61, 1999, pp. 265-285

Citations number

Categorie Soggetti

Mathematics

Journal title

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY

ISSN journal

13697412 → ACNP

Volume

Year of publication

1999

Part

Pages

265 - 285

Database

ISI

SICI code

1369-7412(1999)61:<265:MGLMML>2.0.ZU;2-F

Abstract

Two new implementations of the EM algorithm are proposed for maximum likeli hood fitting of generalized linear mixed models. Both methods use random (i ndependent and identically distributed) sampling to construct Monte Carlo a pproximations at the E-step. One approach involves generating random sample s from the exact conditional distribution of the random effects (given the data) by rejection sampling, using the marginal distribution as a candidate . The second method uses a multivariate t importance sampling approximation . In many applications the two methods are complementary. Rejection samplin g is more efficient when sample sizes are small, whereas importance samplin g is better with larger sample sizes. Monte Carte approximation using rando m samples allows the Monte Carlo error at each iteration to be assessed by using standard central limit theory combined with Taylor series methods. Sp ecifically, we construct a sandwich variance estimate for the maximizer at each approximate E-step. This suggests a rule for automatically increasing the Monte Carlo sample size after iterations in which the true EM step is s wamped by Monte Carlo error. In contrast, techniques for assessing Monte Ca rlo error have not been developed for use with alternative implementations of Monte Carlo EM algorithms utilizing Markov chain Monte Carlo E-step appr oximations. Three different data sets, including the infamous salamander da ta of McCullagh and Nelder, are used to illustrate the techniques and to co mpare them with the alternatives. The results show that the methods propose d can be considerably more efficient than those based on Markov chain Monte Carlo algorithms. However, the methods proposed may break down when the in tractable integrals in the likelihood function are of high dimension.