Evaluation of the genetic text redundancy by the high-frequency component of the l-gram graph

Citation
Os. Kislyuk et al., Evaluation of the genetic text redundancy by the high-frequency component of the l-gram graph, BIOFIZIKA, 44(4), 1999, pp. 639-648
Citations number
15
Categorie Soggetti
Biochemistry & Biophysics
Journal title
BIOFIZIKA
ISSN journal
00063029 → ACNP
Volume
44
Issue
4
Year of publication
1999
Pages
639 - 648
Database
ISI
SICI code
0006-3029(199907/08)44:4<639:EOTGTR>2.0.ZU;2-4
Abstract
Various approaches to the estimation of DNA redundancy are compared: the Sh annon entropy, the Lempel-Ziv complexity, and a new method, the computation of the low-frequency component of the l-gram graph. Although these methods are based on different ideas, they satisfy some reasonable requirements, T he ability of these methods to find various kinds of repeats in genetic tex ts is compared. The resolution of the new method for calculation of DNA red undancy is discussed on the example of well-known repeats in the Epstein-Ba rr virus genome, The intrinsic discrepancy of high-frequency profile and Sh annon redundancy profile were observed in some functionally significant reg ions of sequences being investigated.