Motivation: Completely sequenced genomes allow for detection and analysis o
f the relatively weak periodicities of 10-11 basepairs (bp). Two sources co
ntribute to such signals: correlations in the corresponding protein sequenc
es due to the amphipatic character of alpha-helices and the folding of DNA
(nucleosomal patterns DNA supercoiling). Since the topological state of gen
omic DNA is of importance for its replication, recombination and transcript
ion, there is an immediate interest to obtain information about the superco
iled state from sequence periodicities.
Results: We show that correlations within proteins affect mainly the oscill
ations at distances below 35 bp. The long-ranging correlations up to 100 bp
reflect primarily DNA folding. For the yeast genome these oscillations are
consistent in detail with the chromatin structure. For eubacteria and arch
aea the periods deviate significantly from the 10.55 bp value for free DNA.
These deviations suggest that while a period of 11 bp in bacteria reflects
negative supercoiling, the significantly different period of thermophilic
archaea close to 10 bp corresponds to positive supercoiling of thermophilic
archaeal genomes.