A continuous-index Bayesian hidden Markov model for prediction of nucleosome positioning in genomic DNA

Citation
Mitra, Ritendranath et Gupta, Mayetri, A continuous-index Bayesian hidden Markov model for prediction of nucleosome positioning in genomic DNA, Biostatistics (Oxford. Print) , 12(3), 2011, pp. 462-477
ISSN journal
14654644
Volume
12
Issue
3
Year of publication
2011
Pages
462 - 477
Database
ACNP
SICI code
Abstract
Nucleosomes are units of chromatin structure, consisting of DNA sequence wrapped around proteins called 'histones'.Nucleosomes occur at variable intervals throughout genomic DNA and prevent transcription factor (TF) binding by blocking TF access to the DNA.A map of nucleosomal locations would enable researchers to detect TF binding sites with greater efficiency.Our objective is to construct an accurate genomic map of nucleosome-free regions (NFRs) based on data from high-throughput genomic tiling arrays in yeast.These high-volume data typically have a complex structure in the form of dependence on neighboring probes as well as underlying DNA sequence, variable-sized gaps, and missing data.We propose a novel continuous-index model appropriate for non-equispaced tiling array data that simultaneously incorporates DNA sequence features relevant to nucleosome formation.Simulation studies and an application to a yeast nucleosomal assay demonstrate the advantages of using the new modeling framework, as well as its robustness to distributional misspecifications.Our results reinforce the previous biological hypothesis that higher-order nucleotide combinations are important in distinguishing nucleosomal regions from NFRs.