Criteria for the validation of surrogate endpoints in randomized experiments

Citation
M. Buyse et G. Molenberghs, Criteria for the validation of surrogate endpoints in randomized experiments, BIOMETRICS, 54(3), 1998, pp. 1014-1029
Citations number
31
Categorie Soggetti
Biology,Multidisciplinary
Journal title
BIOMETRICS
ISSN journal
0006341X → ACNP
Volume
54
Issue
3
Year of publication
1998
Pages
1014 - 1029
Database
ISI
SICI code
0006-341X(199809)54:3<1014:CFTVOS>2.0.ZU;2-7
Abstract
The validation of surrogate endpoints has been studied by Prentice (1989, S tatistics in Medicine 8, 431-440) and Freedman, Graubard, and Schatzkin (19 92, Statistics in Medicine 11, 167-178), We extend their proposals in the c ases where the surrogate and the final endpoints are both binary or normall y distributed. Letting T and S be random variables that denote the true and surrogate endpoint, respectively, and Z be an indicator variable for treat ment, Prentice's criteria are fulfilled if Z has a significant effect on T and on S, if S has a significant effect on T, and if Z has no effect on T g iven S. Freedman relaxed the latter criterion by estimating PE, the proport ion of the effect of Z on T that is explained by S, and by requiring that t he lower confidence limit of PE be larger than some proportion, say 0.5 or 0.75. This condition can only be verified if the treatment has a massively significant effect on the true endpoint, a rare situation. We argue that tw o other quantities must be considered in the validation of a surrogate endp oint: RE, the effect of Z on T relative to that of Z on S, and gamma(Z), th e association between S and T after adjustment for Z. A surrogate is said t o be perfect at the individual level when there is perfect association betw een the surrogate and the final endpoint after adjustment for treatment. A surrogate is said to be perfect at the population level if RE is 1. A perfe ct surrogate fulfills both conditions, in which case S and T are identical up to a deterministic transformation. Fieller's theorem is used for the est imation of PE, RE, and their respective confidence intervals. Logistic regr ession models and the global odds ratio model studied by Dale (1986, Biomet rics 42, 909-917) are used for binary endpoints. Linear models are employed for continuous endpoints. In order to be of practical value, the validatio n of surrogate endpoints is shown to require large numbers of observations.