This study investigated the comparability of Angoff-based item ratings
on a general education test battery made by judges from within-conten
t specialties and across content domains. Judges were from English, ma
thematics, science, and social studies specialities in teacher educati
on programs in a midwestern state. Cutscores established from the judg
es' ratings of out-of-content items differed little from the cutscores
set using the ratings made by the content specialists. Further, out-o
f-content ratings by judges were not more influence by performance dat
a than were the ratings provided by judges rating items within their c
ontent speciality. The degree to which these results generalize to oth
er content specialties needs to be investigated.