Multiple imputation for multivariate missing-data problems: A data analyst's perspective

Citation
Jl. Schafer et Mk. Olsen, Multiple imputation for multivariate missing-data problems: A data analyst's perspective, MULTIV BE R, 33(4), 1998, pp. 545-571
Citations number
19
Categorie Soggetti
Psycology
Journal title
MULTIVARIATE BEHAVIORAL RESEARCH
ISSN journal
00273171 → ACNP
Volume
33
Issue
4
Year of publication
1998
Pages
545 - 571
Database
ISI
SICI code
0027-3171(1998)33:4<545:MIFMMP>2.0.ZU;2-R
Abstract
Analyses of multivariate data are frequently hampered by missing values. Un til recently, the only missing-data methods available to most data analysts have been relatively ad hoc practices such as listwise deletion. Recent dr amatic advances in theoretical and computational statistics, however, have produced a new generation of flexible procedures with a sound statistical b asis. These procedures involve multiple imputation (Rubin, 1987), a simulat ion technique that replaces each missing datum with a set of m > 1 plausibl e values. The Nz versions of the complete data are analyzed by standard com plete-data methods, and the results are combined using simple ru les to yie ld estimates, standard errors, and p-values that formally incorporate missi ng-data uncertainty. New computational algorithms and software described in a recent book (Schafer, 1997a) allow us to create proper multiple imputati ons in complex multivariate settings. This article reviews the key ideas of multiple imputation, discusses the software programs currently available, and demonstrates their use on data from the Adolescent Alcohol Prevention T rial (Hansen & Graham, 1991).