Sa. Teichmann et al., The evolution and structural anatomy of the small molecule metabolic pathways in Escherichia coli, J MOL BIOL, 311(4), 2001, pp. 693-708
The 106 small molecule metabolic (SMM) pathways in Escherichia coli are for
med by the protein products of 581 genes. We can define 722 domains, nearly
all of which are homologous to proteins of known structure, that form all
or part of 510 of these proteins. This information allows us to answer gene
ral questions on the structural anatomy of the SMM pathway proteins and to
trace family relationships and recruitment events within and across pathway
s. Half the gene products contain a single domain and half are formed by co
mbinations of between two and six domains. The 722 domains belong to one of
213 families that have between one and 51 members. Family members usually
conserve their catalytic or cofactor binding properties; substrate recognit
ion is rarely conserved. Of the 213 families, members of only a quarter occ
ur in isolation, i.e. they form single-domain proteins. Most members of the
other families combine with domains from just one or two other families an
d a few more versatile families can combine with several different partners
.
Excluding isoenzymes, more than twice as many homologues are distributed ac
ross pathways as within pathways. However, serial recruitment, with two con
secutive enzymes both being recruited to another pathway, is rare and recru
itment of three consecutive enzymes is not observed. Only eight of the 106
pathways have a high number of homologues. Homology between consecutive pai
rs of enzymes with conservation of the main substrate-binding site but chan
ge in catalytic mechanism (which would support a simple model of retrograde
pathway evolution) occurs only six times in the whole set of enzymes. Most
of the domains that form SMM pathways have homologues in non-SMM pathways.
Taken together, these results imply a pervasive "mosaic" model for the for
mation of protein repertoires and pathways. (C) 2001 Academic Press.