Ce. Lopeznieto et Sk. Nigam, SELECTIVE AMPLIFICATION OF PROTEIN-CODING REGIONS OF LARGE SETS OF GENES USING STATISTICALLY DESIGNED PRIMER SETS, Nature biotechnology, 14(7), 1996, pp. 857-861
We describe a novel approach to design a set of primers selective for
large groups of genes. This method is based on the distribution freque
ncy of all nucleotide combinations (octa- to decanucleotides), and the
combined ability of primer pairs, based on these oligonucleotides, to
detect genes. By analyzing 1000 human mRNAs, we found that a surprisi
ngly small subset of octanucleotides is shared by a high proportion of
human protein-coding region sense strands. By computer simulation of
polymerase chain reactions, a set based on only 30 primers was able to
detect approximately 75% of known (and presumably unknown) human prot
ein-coding regions, To validate the method and provide experimental su
pport for the feasibility of the more ambitious goal of targeting huma
n protein-coding regions, we sought to apply the technique to a large
protein family: G-protein coupled receptors (GPCRs). Our results indic
ate that there is sufficient low level homology among human coding reg
ions to allow design of a limited set of primer pairs that can selecti
vely target coding regions in general, as well as genomic subsets (e.g
., GPCRs). The approach should be generally applicable to human coding
regions, and thus provide an efficient method for analyzing much of t
he transcriptionally active human genome.