Finding borders between coding and noncoding DNA regions by an entropic segmentation method

Citation
P. Bernaola-galvan et al., Finding borders between coding and noncoding DNA regions by an entropic segmentation method, PHYS REV L, 85(6), 2000, pp. 1342-1345
Citations number
19
Categorie Soggetti
Physics
Journal title
PHYSICAL REVIEW LETTERS
ISSN journal
00319007 → ACNP
Volume
85
Issue
6
Year of publication
2000
Pages
1342 - 1345
Database
ISI
SICI code
0031-9007(20000807)85:6<1342:FBBCAN>2.0.ZU;2-Q
Abstract
We present a new computational approach to finding borders between coding a nd noncoding DNA. This approach has two features: (i) DNA sequences are des cribed by a 12-letter alphabet that captures the differential base composit ion at each codon position, and (ii) the search for the borders is carried out by means of an entropic;segmentation method which uses only the general statistical properties of coding DNA. We find that this method is highly a ccurate in finding borders between coding and noncoding regions and require s no "prior training" on known data sets. Our results appear to be more acc urate than those obtained with moving windows in the discrimination of codi ng from noncoding DNA.