Background The search for interaction effects is common in epidemiological
studies, but the power of such studies is a major concern. This is a practi
cal issue as many future studies will wish to investigate potential gene-ge
ne and gene-environment interactions and therefore need to be planned on th
e basis of appropriate sample size calculations.
Methods and the Results The underlying model considered in this paper is a
simple linear regression and relating a continuous outcome to a continuousl
y distributed exposure variable. Results The slope of the regression line i
s taken to be dependent on genotype, and the ratio of the slopes for each g
enotype is considered as the interaction parameter. Sample size, is affecte
d by the allele frequency and whether the genetic model is dominant or rece
ssive. It is also critically dependent upon the size of the association bet
ween exposure and outcome, and the strength of the interaction term. The li
nk between these determinants is graphically displayed to allow sample size
and power to be estimated. An example of the analysis of the association b
etween physical activity and glucose intolerance demonstrates how informati
on from previous studies can be used to determine the sample size required
to examine gene-environment interactions.
Conclusions The formulae allowing the computation of the sample size requir
ed to study the interaction between a continuous environmental exposure and
a genetic factor on a continuous outcome variable should have a practical
utility in assisting the design of studies of appropriate power.