A story: Unpaired adenosine bases in ribosomal RNAs

Citation
Rr. Gutell et al., A story: Unpaired adenosine bases in ribosomal RNAs, J MOL BIOL, 304(3), 2000, pp. 335-354
Citations number
54
Categorie Soggetti
Molecular Biology & Genetics
Journal title
JOURNAL OF MOLECULAR BIOLOGY
ISSN journal
00222836 → ACNP
Volume
304
Issue
3
Year of publication
2000
Pages
335 - 354
Database
ISI
SICI code
0022-2836(200012)304:3<335:ASUABI>2.0.ZU;2-E
Abstract
In 1985 an analysis of the Escherichia coli 16 S rRNA covariation-based str ucture model revealed a strong bias for unpaired adenosines. The same analy sis revealed that the majority of the G, C, and U bases were paired. These biases are (now:) consistent with the high percentage of unpaired adenosine nucleotides in several structure motifs. An analysis of a larger set of bacterial comparative 16 S and 23 S rRNA str ucture models has substantiated this initial finding and revealed new biase s in the distribution of adenosine nucleotides in loop regions. The majorit y of the adenosine nucleotides are unpaired, while the majority of the G, C :, and U bases are paired in the covariation-based structure model. The unp aired adenosine nucleotides predominate in the middle and at the 3' end of loops, and are the second most frequent nucleotide type at the 5' end of lo ops (G is the most common nucleotide). There are additional biases for unpa ired adenosine nucleotides at the 3' end of loops and adjacent to a G at th e 5' end of the helix. The most prevalent consecutive nucleotides are GG, G A, AG, and AA. A total of 70% of the GG sequences are within helices, while more than 70% of the AA sequences are unpaired. Nearly 50% of the GA seque nces are unpaired, and approximately one-third of the AG sequences are with in helices while another third are at the 3' loop.5' helix junction. Unpaired positions with an adenosine nucleotide in more than 50% of the seq uences at the 3' end of 16 S and 23 S rRNA loops were identified and arrang ed into the A-motif categories XAZ AAZ XAG AAG and AAG:U, where G or Z is p aired, G:U is a base-pair, and X is not an A and Z is not a G in more than 50% of the sequences. These sequence motifs were associated with several st ructural motifs, such as adenosine platforms, E and E-like loops, A:A and A :G pairings at the end of helices, G:A tandem blase-pairs, GNRA tetraloop h airpins, and U-turns. (C) 2000 Academic Press