GeneID is a program to predict genes in anonymous genomic sequences designe
d with a hierarchical structure. In the First step, splice sites, and start
and stop codons are predicted and scored along the sequence using position
weight matrices (PWMs). In the second step, exons are built from the sites
. Exons are scored as the sum of the scores of the defining sites, plus the
log-likelihood ratio of a Markov model for coding DNA. In the last step, f
rom the set of predicted exons, the gene structure is assembled, maximizing
the sum of the scores of the assembled exons. In this paper we describe th
e obtention of PWMs for sites, and the Markov model of coding DNA in Drosop
hila melanogaster. We also compare other models of coding DNA with the Mark
ov model. Finally, we present and discuss the results obtained when GeneID
is used to predict genes in the Adh region. These results show that the acc
uracy of GeneID predictions compares currently with that of other existing
tools but that GeneID is likely to be more efficient in terms of speed and
memory usage. GeneID is available at http://wwwl.imim.es/similar to eblanco
/Geneld.