Mr. Nelson et al., A combinatorial partitioning method to identify multilocus genotypic partitions that predict quantitative trait variation, GENOME RES, 11(3), 2001, pp. 458-470
Recent advances in genome research have accelerated the process of locating
candidate genes and the variable sites within them and have simplified the
task of genotype measurement. The development of statistical and computati
onal strategies to utilize information on hundreds - soon thousands - of va
riable loci to investigate the relationships between genome variation and p
henotypic variation has not kept pace, particularly for quantitative traits
that do not follow simple Mendelian patterns of inheritance. We present he
re the combinatorial partitioning method (CPM) that examines multiple genes
, each containing multiple variable loci, to identify partitions of multilo
cus genotypes that predict interindividual variation in quantitative trait
levels. We illustrate this method with an application to plasma triglycerid
e levels collected on 188 males, ages 20-60 yr, ascertained without regard
to health status, from Rochester, Minnesota. Genotype information included
measurements at 18 diallelic loci in six coronary heart disease-candidate s
usceptibility gene regions: APOA1-C3-A4, APOB, APOE, LDLR, LPL, and PON1. T
o illustrate the CPM, we evaluated all possible partitions of two-locus gen
otypes into two to nine partitions (similar to 10(6) evaluations). We found
that many combinations of loci are involved in sets of genotypic partition
s that predict triglyceride variability and that the most predictive sets s
how nonadditivity. These results suggest that traditional methods of buildi
ng multilocus models that rely on statistically significant marginal, singl
e-locus effects, may fail to identify combinations of loci that best predic
t trait variability. The CPM offers a strategy for exploring the high-dimen
sional genotype state space so as to predict the quantitative trait variati
on in the population at large that does not require the conditioning of the
analysis on a prespecified genetic model.