Analysis of codon usage diversity of bacterial genes with a self-organizing map (SOM): characterization of horizontally transferred genes with emphasis on the E. coli O157 genome
S. Kanaya et al., Analysis of codon usage diversity of bacterial genes with a self-organizing map (SOM): characterization of horizontally transferred genes with emphasis on the E. coli O157 genome, GENE, 276(1-2), 2001, pp. 89-99
With increases in the amounts of available DNA sequence data, it has become
increasingly important to develop tools for comprehensive systematic analy
sis and comparison of species-specific characteristics of protein-coding se
quences for a wide variety of genomes. In the present study, we used a nove
l neural-network algorithm, a self-organizing map (SOM), to efficiently and
comprehensively analyze codon usage in approximately 60,000 genes from 29
bacterial species simultaneously. This SOM makes it possible to cluster and
visualize genes of individual species separately at a much higher resoluti
on than can be obtained with principal component analysis. The organization
of the SOM can be explained by the genome G + C% and tRNA compositions of
the individual species. We used SOM to examine codon usage heterogeneity in
the E. coli O157 genome, which contains 'O157-unique segments' (O-islands)
, and showed that SOM is a powerful tool for characterization of horizontal
ly transferred genes. (C) 2001 Elsevier Science B.V. All rights reserved.