Low-redundancy sequencing of the entire Lactococcus lactis IL1403 genome

Citation
A. Bolotin et al., Low-redundancy sequencing of the entire Lactococcus lactis IL1403 genome, ANTON LEEUW, 76(1), 1999, pp. 27-76
Citations number
94
Categorie Soggetti
Microbiology
Journal title
ANTONIE VAN LEEUWENHOEK INTERNATIONAL JOURNAL OF GENERAL AND MOLECULAR MICROBIOLOGY
ISSN journal
00036072 → ACNP
Volume
76
Issue
1
Year of publication
1999
Pages
27 - 76
Database
ISI
SICI code
0003-6072(199911)76:1<27:LSOTEL>2.0.ZU;2-P
Abstract
Lactococcus lactis is an AT-rich gram positive bacterium phylogenetically c lose to the genus Streptococcus. Various strains of L. lactis are used in d airy industry as starters for cheese making. L. lactis is also one of the w ell characterized laboratory microorganisms, widely used for studies on phy siology of lactic acid bacteria. We describe here a low redundancy sequence of the genome of the strain L. lactis IL1403. The strategy which we follow ed to determine the sequence consists of two main steps. First, a limited n umber of plasmids and lambda-phages that carry random segments of the genom e were sequenced. Second, sequences of the inserts were used for production of novel sequencing templates by applying Multiplex Long Accurate PCR prot ocols. Using of these PCR products allowed to determine the sequence of the entire 2.35 Mb genome with a very low redundancy, close to 2. The error ra te of the sequence is estimated to be below 1%. The correctness of the sequ ence assembly was confirmed by PCR amplification of the entire L. lactis IL 1403 genome, using a set of 266 oligonucleotides. Anotation of the sequence was undertaken by using automatic gene prediction computer tools. This all owed to identify 1495 protein-encoding genes, to locate them on the genome map and to classify their functions on the basis of homology to known prote ins. The function of about 700 genes expected to encode proteins that lack homologs in data bases cannot be reliably predicted in this way. The approa ch which we used eliminates high redundancy sequencing and mapping efforts, needed to obtain detailed and comprehensive genetic and physical maps of a bacterium. Availability of detailed genetic and physical maps of the L. lactis IL1403 genome provides many entries to study metabolism and physiology of bacteria from this group. The presence of 42 copies of five different IS elements i n the IL1403 genome confirms the importance of these elements for genetic e xchange in Lactococci. These include two previously unknown elements, prese nt at seven and fifteen copies and designated IS1077 and IS983, respectivel y. Five potential or rudimentary prophages were identified in the genome by detecting clusters of phage-related genes. The metabolic and regulatory po tential of L. lactis was evaluated by inspecting gene sets classified into different functional categories. L. lactis has the genetic potential to syn thesise 20 standard amino acids, purine and pyrimidine nucleotides and at l east four cofactors. Some of these metabolites, which are usually present i n chemically defined media, can probably be omitted. About twenty compounds can be used by L. lactis as a sole carbon source. Some 83 regulators were revealed, indicating a regulatory potential close to that of Haemophilus in fluenzae, a bacterium with a similar genome size. Unexpectedly, L. lactis h as a complete set of late competence genes, which may have concerted transc riptional regulation and unleadered polycistronic mRNAs. These findings ope n new possibilities for developing genetic tools, useful for studies of gen e regulation in AT-rich gram positive bacteria and for engineering of new s trains for the diary industry.