POWER DETERMINATION FOR GEOGRAPHICALLY CLUSTERED DATA USING GENERALIZED ESTIMATING EQUATIONS

Citation
Sa. Hendricks et al., POWER DETERMINATION FOR GEOGRAPHICALLY CLUSTERED DATA USING GENERALIZED ESTIMATING EQUATIONS, Statistics in medicine, 15(17-18), 1996, pp. 1951-1960
Citations number
28
Categorie Soggetti
Statistic & Probability","Medicine, Research & Experimental","Public, Environmental & Occupation Heath","Statistic & Probability","Medical Informatics
Journal title
ISSN journal
02776715
Volume
15
Issue
17-18
Year of publication
1996
Pages
1951 - 1960
Database
ISI
SICI code
0277-6715(1996)15:17-18<1951:PDFGCD>2.0.ZU;2-O
Abstract
Study designs in public health research often require the estimation o f intervention effects that have been applied to a cluster of subjects in a common geographic area, rather than randomly assigned to individ ual subjects, and where the outcome is dichotomous. Statistical method s that account for the intracluster correlation of measurements must b e used or the standard errors of regression coefficients will be under estimated. Generalized estimating equations (GEE) can be used to accou nt for this correlation, although there are no straightforward methods to determine sample-size requirements for adequate power. A simulatio n study was performed to calculate power in a GEE model for a proposed study of the effect of an intervention, designed to reduce lower-back injuries among nursing personnel employed in nursing homes. Nursing h omes will be randomly assigned to either an intervention or control gr oup and all employees within a nursing home will be treated alike. His torical injury data indicates that the baseline-injury risk for each h ome can be reasonably modelled using a beta distribution. It is assume d that the risk for any individual nurse within a nursing home follows a Bernoulli probability distribution expressed as a legit function of fixed covariates, which have values of odds ratios determined from pr evious studies which represent characteristics of the study population , and a random-intercept term which is specific for each home. Results indicate that failure to account for intracluster correlation can lea d to overestimates of power as well as inflation of type I error by as much as 20 per cent. Although the GEE method accounted for the intrac luster correlation when present, estimates of the intracluster correla tion were negatively biased when no intracluster correlation was prese nt. In addition, and possibly related to the negatively biased estimat es of intracluster correlation, we also found inflated type I error es timates from the GEE method.