Origin and properties of non-coding ORFs in the yeast genome

Citation
P. Mackiewicz et al., Origin and properties of non-coding ORFs in the yeast genome, NUCL ACID R, 27(17), 1999, pp. 3503-3509
Citations number
19
Categorie Soggetti
Biochemistry & Biophysics
Journal title
NUCLEIC ACIDS RESEARCH
ISSN journal
03051048 → ACNP
Volume
27
Issue
17
Year of publication
1999
Pages
3503 - 3509
Database
ISI
SICI code
0305-1048(19990901)27:17<3503:OAPONO>2.0.ZU;2-V
Abstract
In a recent paper we have estimated the total number of protein coding open reading frames (ORFs) in the Saccharomyces cerevisiae genome, based on the ir properties, at about 4800, This number is much smaller than the 5800-600 0 which is widely accepted. In this paper we analyse differences between th e set of ORFs with known phenotypes annotated in the Munich Information Cen tre for Protein Sequences (MIPS) database and ORFs for which the probabilit y of coding, counted by us, is very low. We have found that many of the lat ter ORFs have properties of antisense sequences of coding ORFs, which sugge sts that they could have been generated by duplication of coding sequences. Since coding sequences generate ORFs inside themselves, with especially hi gh frequency in the antisense sequences, we have looked for homology betwee n known proteins and hypothetical polypeptides generated by ORFs under cons ideration in all the six phases. For many ORFs we have found paralogues and orthologues in phases different than the phase which had been assumed in t he MIPS database as coding.