An exploration of the sequence of a 2.9-Mb region of the genome of Drosophila melanogaster: The Adh region

Citation
M. Ashburner et al., An exploration of the sequence of a 2.9-Mb region of the genome of Drosophila melanogaster: The Adh region, GENETICS, 153(1), 1999, pp. 179-219
Citations number
261
Categorie Soggetti
Biology,"Molecular Biology & Genetics
Journal title
GENETICS
ISSN journal
00166731 → ACNP
Volume
153
Issue
1
Year of publication
1999
Pages
179 - 219
Database
ISI
SICI code
0016-6731(199909)153:1<179:AEOTSO>2.0.ZU;2-O
Abstract
A contiguous sequence of nearly 3 Mb from the genome of Drosophila melanoga ster has been sequenced from a series of overlapping P1 and BAC clones. Thi s region covers 69 chromosome polytene bands on chromosome arm 2L, includin g the genetically well-characterized "Adh region." A computational analysis of the sequence predicts 218 protein-coding genes, 11 tRNAs, and iii trans posable element sequences. At least 38 of the protein-coding genes are arra nged in clusters of from 2 to 6 closely related genes, suggesting extensive tandem duplication. The gene density is one protein-coding gene every 13 k b; the transposable element density is one element every 171 kb. Of 73 gene s in this region identified by genetic analysis, 49 have been located on th e sequence; P-element insertions have been mapped to 43 genes. Ninety-five (44%) of the known and predicted genes match a Drosophila EST, and 144 (66% ) have clear similarities to proteins in other organisms. Genes known to ha ve mutant phenotypes are more likely to be represented in cDNA libraries, a nd far more likely to have products similar to proteins of other organisms, than are genes with no known mutant phenotype. Over 650 chromosome aberrat ion breakpoints map to this chromosome region, and their nonrandom distribu tion on the genetic map reflects variation in gene spacing on the DNA. This is the first large-scale analysis of the genome of D. melanogaster at the sequence level. In addition to the direct results obtained, this analysis h as allowed us to develop and test methods that will be needed to interpret the complete sequence of the genome of this species.