Mendel-GFDb and Mendel-ESTS: databases of plant gene families and ESTs annotated with gene family numbers and gene family names

Citation
D. Lonsdale et al., Mendel-GFDb and Mendel-ESTS: databases of plant gene families and ESTs annotated with gene family numbers and gene family names, NUCL ACID R, 29(1), 2001, pp. 120-122
Citations number
16
Categorie Soggetti
Biochemistry & Biophysics
Journal title
NUCLEIC ACIDS RESEARCH
ISSN journal
03051048 → ACNP
Volume
29
Issue
1
Year of publication
2001
Pages
120 - 122
Database
ISI
SICI code
0305-1048(20010101)29:1<120:MAMDOP>2.0.ZU;2-V
Abstract
There is no control over the information provided with sequences when they are deposited in the sequence databases. Consequently mistakes can seed the incorrect annotation of other sequences. Grouping genes into families and applying controlled annotation overcomes the problems of incorrect annotati on associated with individual sequences. Two databases (http://www.mendel.a c.uk) were created to apply controtled annotation to plant genes and plant ESTs: Mendel-GFDb is a database of plant protein (gene) families based on g apped-BLAST analysis of all sequences in the SWISS-PROT family of databases . Sequences are aligned (ClustalW) and identical and similar residues shade d. The families are visually curated to ensure that one or more criteria, f or example overall relatedness andlor domain similarity relate ail sequence s within a family. Sequence families are assigned a 'Gene Family Number' an d a unified description is developed which best describes the family and it s members. If authority exists the gene family is assigned a 'Gene Family N ame'. This information is placed in MendelGFDb, Mendel-ESTS is primarily a database of plant ESTs, which have been compared to Mendel-GFDb, completely sequenced genomes and domain databases. This approach associated ESTs with individual sequences and the controlled annotation of gene families and pr otein domains; the information being placed in Mendel-ESTS, The controlled annotation applied to genes and ESTs provides a basis from which a plant tr anscription database can be developed.