H. Bono et al., RECONSTRUCTION OF AMINO-ACID BIOSYNTHESIS PATHWAYS FROM THE COMPLETE GENOME SEQUENCE, PCR methods and applications, 8(3), 1998, pp. 203-210
The complete genome sequence of an organism contains information that
has not been fully utilized in the current prediction methods of gene
functions, which are based on piece-by-piece similarity searches of in
dividual genes. We present here a method that utilizes a higher level
information of molecular pathways to reconstruct a complete functional
unit from a set of genes. Specifically, a genome-by-genome comparison
is first made for identifying enzyme genes and assigning EC numbers,
which is followed by the reconstruction of selected portions of the me
tabolic pathways by use of the reference biochemical knowledge. The co
mpleteness of the reconstructed pathway is an indicator of the correct
ness of the initial gene function assignment. This feature has become
possible because of our efforts to computerize the current knowledge o
f metabolic pathways under the KEGG project. We found that the biosynt
hesis pathways of all 20 amino acids were completely reconstructed in
Escherichia coli, Haemophilus influenzae, and Bacillus subtilis, and p
robably in Synechocystis and Saccharomyces cerevisiae as well, althoug
h it was necessary to assume wider substrate specificity for aspartate
aminotransferases.