Ca. Bernaards et K. Sijtsma, Influence of imputation and EM methods on factor analysis when item nonresponse in questionnaire data is nonignorable, MULTIV BE R, 35(3), 2000, pp. 321-364
This study deals with the influence of each of twelve imputation methods an
d two methods using the EM algorithm on the results of maximum likelihood f
actor analysis as compared with results obtained from the complete data fac
tor analysis (no missing scores). Complete questionnaire rating scale data
were simulated and, next, missing item scopes were created under both ignor
able and nonignorable nonresponse mechanisms. Next, imputation methods were
used to fill the gaps and factor analysis was applied to both the original
complete data and to the data sets including imputed scores. Each imputati
on method was implemented once with residual error and once without residua
l error. Also, one EM method estimated the factor loadings directly and the
other estimated the complete data covariance matrix, which subsequently wa
s factor analyzed. A design was analyzed with design factors Latent Trait S
tructure (technically called Mixing Configuration), Correlation Between Lat
ent Traits, Nonresponse Mechanism, Percentage of Missingness, Sample Size,
and Imputation Method. We found that, in general, methods that impute a sco
re based on a respondent's mean score obtained from his/her observed item s
cores best recovered the factor loadings structure from the complete data.
Moreover, for unidimensional data person mean methods with a residual error
gave better results than the other imputation methods, either with or with
out a residual error component. For the EM methods a smaller design was ana
lyzed. The conclusion was that both EM methods better recovered the complet
e data factor loadings than the imputation methods.