GAIA - FRAMEWORK ANNOTATION OF GENOMIC SEQUENCE

Citation
Lc. Bailey et al., GAIA - FRAMEWORK ANNOTATION OF GENOMIC SEQUENCE, PCR methods and applications, 8(3), 1998, pp. 234-250
Citations number
29
Categorie Soggetti
Biothechnology & Applied Migrobiology",Biology,"Genetics & Heredity
ISSN journal
10549803
Volume
8
Issue
3
Year of publication
1998
Pages
234 - 250
Database
ISI
SICI code
1054-9803(1998)8:3<234:G-FAOG>2.0.ZU;2-M
Abstract
As increasing amounts of genomic sequence from many organisms become a vailable, and as DNA sequences become a primary reagent in biologic in vestigations, the role of annotation as a prospective guide for labora tory experiments will expand rapidly. Here we describe a process of hi gh-throughput, reliable annotation, called Framework annotation, which is designed to provide a foundation for initial biologic characteriza tion of previously unexamined sequence. To examine this concept in pra ctice, we have constructed Genome Annotation and Information Analysis (GAIA), a prototype software architecture that implements several elem ents important for framework annotation. The center of GAIA consists o f an annotation database and the associated data management subsystem that forms the software bus along which other components communicate. The schema for this database defines three principal concepts: (I) Ent ries, consisting of sequence and associated historical data; (2) Featu res, comprising information of biologic interest; and (3) Experiments, describing the evidence that supports Features. The database permits tracking of annotation results over time, as well as assessment of the reliability of particular results. New framework annotation is produc ed by CARTA, a set of autonomous sensors that perform automatic analys es and assert results into the annotation database. These results are available via a Web-based query interface that uses graphical lava app lets as well as text-based HTML pages to display data at different lev els of resolution and permit interactive exploration of annotation. We present results for initial application of framework annotation to a set of test sequences, demonstrating its effectiveness in providing a starting point for biologic investigation, and discuss ways in which t he current prototype can be improved. The prototype is available for p ublic use and comment at http://www.cbil.upenn.edu/gaia.