The predictive abilities of two-group classification models (CMs) are often
expressed in terms of their Cooper statistics. These statistics are often
reported without any indication of their uncertainty, making it impossible
to judge whether the predicted classifications are significantly better tha
n the predictions made by a different CM, or whether the predictive perform
ance of the CM exceeds predefined performance criteria in a statistically s
ignificant way. Bootstrap resampling routines are reported that provide a m
eans of expressing the uncertainty associated with Cooper statistics. The u
sefulness of the bootstrapping routines is illustrated by constructing 95%
confidence intervals for the Cooper statistics of four alternative skin-cor
rosivity tests (the rat skin transcutaneous electrical resistance assay, EP
ISKIN (TM), Skin(2)(TM) and CORROSITEX (TM)), and four two-step sequences i
n which each in vitro test is used in combination with a physicochemical te
st for skin corrosion based on pH measurements.