L. Cardle et al., Computational and experimental characterization of physically clustered simple sequence repeats in plants, GENETICS, 156(2), 2000, pp. 847-854
The type and frequency of simple sequence repeats (SSRs) in plant genomes w
as investigated using the expanding quantity of DNA sequence data deposited
in public databases. In Arabidopsis, 306 genomic DNA sequences longer than
10 kb and 36,199 EST sequences were searched for all possible mono- to pen
tanucleotide repeats. The average frequency of SSRs was one every 6.04 kb i
n genomic DNA decreasing to one every 14 kb in ESTs. SSR frequency and type
differed between coding, intronic, and intergenic DNA. Similar frequencies
were found in other plant species. On the basis of these findings, an appr
oach is proposed and demonstrated for the targeted isolation of single or m
ultiple, physically clustered SSRs linked to ally gene that has been mapped
using low-copy DNA-based markers. The approach involves sample sequencing
a small number of subclones of selected randomly sheared large insert DNA c
lones (e.g., BACs). It is shown to be both feasible and practicable, given
the probability of fortuitously sequencing through an SSR. The approach is
demonstrated in barley where sample sequencing 34 subclones of a single BAC
selected by hybridization to the Big1 gene revealed three SSRs. These allo
wed Big1 to be located at the top of barley linkage group 6HS.