Gm. Lee et al., The relative appropriateness of eight measurement models for analyzing scores from tests composed of testlets, EDUC PSYC M, 61(6), 2001, pp. 958-975
It has been shown that fundamental assumptions associated with conventional
one-factor measurement models are frequently violated in analyses of score
s from a test composed of testlets, Eight different measurement models were
conceptualized for this kind of situation, and the goodness of fit of each
model was examined. Conventional essentially tau-equivalent and congeneric
models present worse model fit to data and overestimate the reliability wh
en testlets are involved. The one-factor congeneric model with correlated e
rror specifications seems to be the best measurement model for a test compo
sed of testlets if dichotomously scored items are used as the unit of analy
sis. However, in estimating score reliability for tests composed of testlet
s, the one-factor essentially tau-equivalent model with correlated error sp
ecifications also provides good estimates. Measurement models using passage
(testlet) scores would be alternatives for analyzing scores from tests com
posed of testlets when passage (testlet) scores are used as the unit of ana
lysis.