P. Linder et al., LISTA, A COMPREHENSIVE COMPILATION OF NUCLEOTIDE-SEQUENCES ENCODING PROTEINS FROM THE YEAST SACCHAROMYCES, Nucleic acids research, 21(13), 1993, pp. 3001-3002
The amount of nucleotide sequence data is increasing exponentially. We
therefore made an effort to make a comprehensive database (LISTA) for
the yeast Saccharomyces cerevisiae. Each sequence has been attributed
a single genetic name and in the case of allelic duplicated sequences
, synonyms are given, if necessary. For the nomenclature we have intro
duced a standard principle for naming gene sequences based on priority
rules. We have also applied a simple method to distinguish duplicated
sequences of one and the same gene from non-allelic sequences of dupl
icated genes. By using these principles we have sorted out a lot of co
nfusion in the literature and databanks. Along with the genetic name,
the mnemonic from the EMBL databank, the codon bias, reference of the
publication of the sequence and the EMBL accession numbers are include
d in each entry.