A fully Bayesian hidden Ising model for ChIP-seq data analysis

Authors
Citation
Mo, Qianxing, A fully Bayesian hidden Ising model for ChIP-seq data analysis, Biostatistics (Oxford. Print) , 13(1), 2012, pp. 113-128
ISSN journal
14654644
Volume
13
Issue
1
Year of publication
2012
Pages
113 - 128
Database
ACNP
SICI code
Abstract
Chromatin immunoprecipitation followed by next generation sequencing (ChIP-seq) is a powerful technique that is being used in a wide range of biological studies including genome-wide measurements of protein.DNA interactions, DNA methylation, and histone modifications.The vast amount of data and biases introduced by sequencing and/or genome mapping pose new challenges and call for effective methods and fast computer programs for statistical analysis.To systematically model ChIP-seq data, we build a dynamic signal profile for each chromosome and then model the profile using a fully Bayesian hidden Ising model.The proposed model naturally takes into account spatial dependency and global and local distributions of sequence tags.It can be used for one-sample and two-sample analyses.Through model diagnosis, the proposed method can detect falsely enriched regions caused by sequencing and/or mapping errors, which is usually not offered by the existing hypothesis.testing-based methods.The proposed method is illustrated using 3 transcription factor (TF) ChIP-seq data sets and 2 mixed ChIP-seq data sets and compared with 4 popular and/or well-documented methods: MACS, CisGenome, BayesPeak, and SISSRs.The results indicate that the proposed method achieves equivalent or higher sensitivity and spatial resolution in detecting TF binding sites with false discovery rate at a much lower level.