Statistical description of interrater variability in ordinal ratings

Citation
Jc. Nelson et Ms. Pepe, Statistical description of interrater variability in ordinal ratings, STAT ME M R, 9(5), 2000, pp. 475-496
Citations number
42
Categorie Soggetti
Health Care Sciences & Services
Journal title
STATISTICAL METHODS IN MEDICAL RESEARCH
ISSN journal
09622802 → ACNP
Volume
9
Issue
5
Year of publication
2000
Pages
475 - 496
Database
ISI
SICI code
0962-2802(200010)9:5<475:SDOIVI>2.0.ZU;2-J
Abstract
Ordinal categorical assessments are common in medical practice and in resea rch. Variability in such measurements amongst raters making the assessments can be problematic. In this paper we consider how such variability can be described statistically. We review three current approaches, including kapp a-type statistics, loglinear models for agreement, and latent class agreeme nt models, and discuss their limitations. We present a new graphical approa ch to describing interrater variability that involves a simple frequency di stribution display of the category probabilities. The method enables descri ption of interrater variability when raters are a random sample from some p opulation as opposed to the traditional setting in which only a few selecte d raters provide assessments. Advantages of this approach relative to curre nt approaches include the following: (1) it provides a simple visual summar y of the rating data, (2) description is closely linked to familiar methods for describing variability in continuous measurements, (3) interpretation is straightforward, and (4) a large sample of raters can be accommodated wi th ease. We illustrate the method on simulated ordinal data representing ra diologists' ratings of mammography images and on rating data from a nationa l image reading study of mammography screening.