Evaluation of old and new tests of heterogeneity in epidemiologic meta-analysis

Citation
B. Takkouche et al., Evaluation of old and new tests of heterogeneity in epidemiologic meta-analysis, AM J EPIDEM, 150(2), 1999, pp. 206-215
Citations number
21
Categorie Soggetti
Envirnomentale Medicine & Public Health","Medical Research General Topics
Journal title
AMERICAN JOURNAL OF EPIDEMIOLOGY
ISSN journal
00029262 → ACNP
Volume
150
Issue
2
Year of publication
1999
Pages
206 - 215
Database
ISI
SICI code
0002-9262(19990715)150:2<206:EOOANT>2.0.ZU;2-W
Abstract
The identification of heterogeneity in effects between studies is a key iss ue in meta-analyses of observational studies, since it is critical for dete rmining whether it is appropriate to pool the individual results into one s ummary measure. The result of a hypothesis test is often used as the decisi on criterion. In this paper, the authors use a large simulation study patte rned from the key features of five published epidemiologic meta-analyses to investigate the type I error and statistical power of five previously prop osed asymptotic homogeneity tests, a parametric bootstrap version of each o f the tests, and tau(2)-bootstrap, a test proposed by the authors. The resu lts show that the asymptotic DerSimonian and Laird Q statistic and the boot strap versions of the other tests give the correct type I error under the n ull hypothesis but that all of the tests considered have low statistical po wer, especially when the number of studies included in the meta-analysis is small (<20). From the point of view of validity, power, and computational ease, the Q statistic is clearly the best choice. The authors found that th e performance of all of the tests considered did not depend appreciably upo n the value of the pooled odds ratio, both for size and for power. Because tests for heterogeneity will often be underpowered, random effects models c an be used routinely, and heterogeneity can be quantified by means of R-t, the proportion of the total variance of the pooled effect measure due to be tween-study variance, and CVB, the between-study coefficient of variation.