ITA
ENG

ALGORITHMS AND SOFTWARE FOR SUPPORT OF GENE IDENTIFICATION EXPERIMENTS

Authors

SZE SH ROYTBERG MA GELFAND MS MIRONOV AA ASTAKHOVA TV PEVZNER PA

Citation

Sh. Sze et al., ALGORITHMS AND SOFTWARE FOR SUPPORT OF GENE IDENTIFICATION EXPERIMENTS, BIOINFORMATICS, 14(1), 1998, pp. 14-19

Citations number

Categorie Soggetti

Computer Science Interdisciplinary Applications","Biology Miscellaneous","Computer Science Interdisciplinary Applications","Biochemical Research Methods

Journal title

BIOINFORMATICS → ACNP

ISSN journal

13674803

Volume

Issue

Year of publication

1998

Pages

14 - 19

Database

ISI

SICI code

1367-4803(1998)14:1<14:AASFSO>2.0.ZU;2-8

Abstract

Motivation: Gene annotation is the final goal of gene prediction algor ithms?. However these algorithms frequently make mistakes and therefor e the use of gene predictions for sequence annotation is hardly possib le. As a result, biologists ma forced to conduct time-consuming gene i dentification experiments by designing appropriate PCR primers to test cDNA libraries or applying RT-PCR, exon trapping/amplification, or ot her techniques. This process frequently amounts to 'guessing' PCR prim ers on top of unreliable gene predictions and frequently leads to wast ing of experimental efforts. Results: The present paper proposes a sim ple and reliable algorithm for experimental gene identification which by passes the unreliable gene prediction step. Studies of the performa nce of the algorithm on a sample of human genes indicate that an exper imental protocol based on the algorithms predictions achieves at? accu rate gene identification with relatively few PCR primers. Predictions of PCR primers may be used for exon amplification in preliminary! muta tion analysis during ai? attempt to identify a gene responsible for a disease. We propose a simple approach to find a short region from a ge nomic sequence that with high probability overlaps with some exon of t he gene. The algorithm is enhanced to find one or more segments that a l-e probably contained hilt the translated region of the gene and can be used as PCR primers to select appropriate clones in cDNA libraries by selective amplification. The algorithm is further extended to locat e a set of PCR primers that uniformly: cover ail translated regions an ti can be used Soi RT-PCR and further sequencing of (unknown) mRNA. Av ailability: The programs are implemented as Web servers (GenePrimer an d CASSANDRA) and can be reached at http://www-hto.usc.edu/software/pro crustes/ Contact: ssze@hto.usc.edu.