In 1985 an analysis of the Escherichia coli 16 S rRNA covariation-based str
ucture model revealed a strong bias for unpaired adenosines. The same analy
sis revealed that the majority of the G, C, and U bases were paired. These
biases are (now:) consistent with the high percentage of unpaired adenosine
nucleotides in several structure motifs.
An analysis of a larger set of bacterial comparative 16 S and 23 S rRNA str
ucture models has substantiated this initial finding and revealed new biase
s in the distribution of adenosine nucleotides in loop regions. The majorit
y of the adenosine nucleotides are unpaired, while the majority of the G, C
:, and U bases are paired in the covariation-based structure model. The unp
aired adenosine nucleotides predominate in the middle and at the 3' end of
loops, and are the second most frequent nucleotide type at the 5' end of lo
ops (G is the most common nucleotide). There are additional biases for unpa
ired adenosine nucleotides at the 3' end of loops and adjacent to a G at th
e 5' end of the helix. The most prevalent consecutive nucleotides are GG, G
A, AG, and AA. A total of 70% of the GG sequences are within helices, while
more than 70% of the AA sequences are unpaired. Nearly 50% of the GA seque
nces are unpaired, and approximately one-third of the AG sequences are with
in helices while another third are at the 3' loop.5' helix junction.
Unpaired positions with an adenosine nucleotide in more than 50% of the seq
uences at the 3' end of 16 S and 23 S rRNA loops were identified and arrang
ed into the A-motif categories XAZ AAZ XAG AAG and AAG:U, where G or Z is p
aired, G:U is a base-pair, and X is not an A and Z is not a G in more than
50% of the sequences. These sequence motifs were associated with several st
ructural motifs, such as adenosine platforms, E and E-like loops, A:A and A
:G pairings at the end of helices, G:A tandem blase-pairs, GNRA tetraloop h
airpins, and U-turns. (C) 2000 Academic Press