E. Pauws et al., Heterogeneity in polyadenylation cleavage sites in mammalian mRNA sequences: implications for SAGE analysis, NUCL ACID R, 29(8), 2001, pp. 1690-1694
The analysis of a human thyroid serial analysis of gene expression (SAGE) l
ibrary shows the presence of an abundant SAGE tag corresponding to the mRNA
of thyroglobulin (TG). Additional, less abundant tags are present that can
not be linked to any other known gene, but show considerable homology to t
he wild-type TG tag. To determine whether these tags represent TG mRNA mole
cules with alternative cleavage, 3'-RACE clones were sequenced. The results
show that the three putative TG SAGE tags can be attributed to TG transcri
pts and reflect the use of alternative polyadenylation cleavage sites downs
tream of a single polyadenylation signal in vivo. By screening more than 30
0 000 sequences corresponding to human, mouse and rat transcripts for this
phenomenon we show that a considerable percentage of mRNA transcripts (44%
human, 22% mouse and 22% rat) show cleavage site heterogeneity. When analyz
ing SAGE-generated expression data, this phenomenon should be considered, s
ince, according to our calculations, 2.8% of human transcripts show two or
more different SAGE tags corresponding to a single gene because of alternat
ive cleavage site selection. Both experimental and in silico data show that
the selection of the specific cleavage site for poly(A) addition using a g
iven polyadenylation signal is more variable than was previously thought.