INFORMATION-CONTENT IN NUCLEOTIDE-SEQUENCES

Citation
Nn. Bugaenko et al., INFORMATION-CONTENT IN NUCLEOTIDE-SEQUENCES, Molecular biology, 30(3), 1996, pp. 313-320
Citations number
29
Categorie Soggetti
Biology
Journal title
ISSN journal
00268933
Volume
30
Issue
3
Year of publication
1996
Part
1
Pages
313 - 320
Database
ISI
SICI code
0026-8933(1996)30:3<313:IIN>2.0.ZU;2-7
Abstract
We assessed the information content in nucleotide sequences through th e efficiency of complete nucleotide sequence reconstruction from a set of its fragments (frequency-correlation dictionary), using the increa se in the reconstructed sequence entropy for frequency-correlation dic tionaries of q-letter-long words as a measure of efficiency. Human gen es have a maximum increase with q = 5, 6, and 7. We also revealed abno rmally efficient reconstruction using dictionaries with q = 3 and 2, w hich distinguishes the natural nucleotide sequences from random ones.