ITA
ENG

THE MOS 36-ITEM SHORT-FORM HEALTH SURVEY (SF-36) .3. TESTS OF DATA QUALITY, SCALING ASSUMPTIONS, AND RELIABILITY ACROSS DIVERSE PATIENT GROUPS

Authors

MCHORNEY CA WARE JE LU JFR SHERBOURNE CD

Citation

Ca. Mchorney et al., THE MOS 36-ITEM SHORT-FORM HEALTH SURVEY (SF-36) .3. TESTS OF DATA QUALITY, SCALING ASSUMPTIONS, AND RELIABILITY ACROSS DIVERSE PATIENT GROUPS, Medical care, 32(1), 1994, pp. 40-66

Citations number

Categorie Soggetti

Heath Policy & Services","Public, Environmental & Occupation Heath

Journal title

Medical care → ACNP

ISSN journal

00257079

Volume

Issue

Year of publication

1994

Pages

40 - 66

Database

ISI

SICI code

0025-7079(1994)32:1<40:TM3SHS>2.0.ZU;2-V

Abstract

The widespread use of standardized health surveys is predicated on the largely untested assumption that scales constructed from those survey s will satisfy minimum psychometric requirements across diverse popula tion groups. Data from the Medical Outcomes Study (MOS) were used to e valuate data completeness and quality, test scaling assumptions, and e stimate internal-consistency reliability for the eight scales construc ted from the MOS SF-36 Health Survey. Analyses were conducted among 3, 445 patients and were replicated across 24 subgroups differing in soci odemographic characteristics, diagnosis, and disease severity. For eac h scale, item-completion rates were high across all groups (88% to 95% ), but tended to be somewhat lower among the elderly, those with less than a high school education, and those in poverty. On average, survey s were complete enough to compute scale scores for more than 96% of th e sample. Across patient groups, all scales passed tests for item-inte rnal consistency (97% passed) and item-discriminant validity (99% pass ed). Reliability coefficients ranged from a low of 0.65 to a high of 0 .94 across scales (median = 0.85) and varied somewhat across patient s ubgroups. Floor effects were negligible except for the two role disabi lity scales. Noteworthy ceiling effects were observed for both role di sability stales and the social functioning scale. These findings suppo rt the use of the SF-36 survey across the diverse populations studied and identify population groups in which use of standardized health sta tus measures may or may not be problematic.