Escherichia coli genome is composed of two distinct types of nucleotide sequences

Authors
Citation
D. Haring et J. Kypr, Escherichia coli genome is composed of two distinct types of nucleotide sequences, BIOC BIOP R, 272(2), 2000, pp. 571-575
Citations number
33
Categorie Soggetti
Biochemistry & Biophysics
Journal title
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS
ISSN journal
0006291X → ACNP
Volume
272
Issue
2
Year of publication
2000
Pages
571 - 575
Database
ISI
SICI code
0006-291X(20000607)272:2<571:ECGICO>2.0.ZU;2-P
Abstract
We calculated correlations of the nucleotide distributions along the E. col i genome. Subsequent cluster analysis of the correlation distributions show ed that the genome was composed of two qualitatively different types of nuc leotide sequences. The first type exhibited strong correlations of the geno mic distributions of A with T and G with C, and high anticorrelations of A with C and G with T. In contrast, the second type was characterized by weak or negligible correlations typical of randomized sequences, Both types of sequences were almost equally abundant in the E, coli genome and their leng th varied from several hundred nucleotides to about 70 kilobases, They were not disjunct with respect to their (G + C) content but the high correlatio ns and anticorrelations were rather characteristic for (A + T)-rich genomic segments, We offer possible explanations of the mosaic structure of the E, coli genome. (C) 2000 Academic Press.