THE MODULAR STRUCTURE OF INFORMATIONAL SEQUENCES

Citation
Ao. Schmitt et al., THE MODULAR STRUCTURE OF INFORMATIONAL SEQUENCES, Biosystems, 37(3), 1996, pp. 199-210
Citations number
12
Categorie Soggetti
Biology
Journal title
ISSN journal
03032647
Volume
37
Issue
3
Year of publication
1996
Pages
199 - 210
Database
ISI
SICI code
0303-2647(1996)37:3<199:TMSOIS>2.0.ZU;2-F
Abstract
It is shown that DNA sequences can be decomposed into smaller units mu ch the same as texts can be decomposed into syllables, words, or group s of words. Those smaller units (modules) are extracted from DNA seque nces according to statistical criteria. Tests with sequences of known modular structure (two novels and a FORTRAN source code) were performe d. The rate to which DNA sequences can be decomposed into modules (mod ularity) turns out to be a very sensitive measure to distinguish DNA s equences from random sequences.