In the context of the international project aimed at sequencing the wh
ole genome of Bacillus subtilis we have developed a non-redundant, ful
ly annotated database of sequences from this organism. Starting from t
he B.subtilis sequences available in the EMBL, GenBank and DDBJ collec
tions we have removed all encountered duplications and then added extr
a annotations to the sequences (e.g. accession numbers for the genes,
locations on the genetic map, codon usage, etc.) We have also added cr
oss-references to the EMBL, MEDLINE, SWISS-PROT and ENZYME data banks.
The present system results from merging of the NRSub and SubtiList da
tabases and the sequence contigs used in the two systems are identical
. NRSub is distributed as a flatfile in EMBL format (which is supporte
d by most sequence analysis software packages) and as an ACNUC databas
e, while SubtiList is distributed as a relational database under 4th D
imension. It is possible to access the data through two dedicated Worl
d Wide Web servers located in France and Japan.