The stationary statistical properties of human coding sequences

Citation
Dc. Torney et al., The stationary statistical properties of human coding sequences, J MOL BIOL, 286(5), 1999, pp. 1461-1469
Citations number
17
Categorie Soggetti
Molecular Biology & Genetics
Journal title
JOURNAL OF MOLECULAR BIOLOGY
ISSN journal
00222836 → ACNP
Volume
286
Issue
5
Year of publication
1999
Pages
1461 - 1469
Database
ISI
SICI code
0022-2836(19990312)286:5<1461:TSSPOH>2.0.ZU;2-O
Abstract
We introduce a generally applicable method for the discovery and quantitati on of all of the characteristic statistical properties of a class of biolog ical sequences, given examples from the class. This method employs a revers ible binary encoding of sequences into the binary digits -1 and +1. Then, p rovided that the sample is sufficient, the sample cumulants on the subsets of digit positions will manifest all of the statistical properties of the c lass. As an illustration, we present the main results of a complete charact erization of the stationary statistical properties of human coding sequence s, in terms of their sample cumulants. Many of the telling sample cumulants are described.