Motivation: Transcriptional activation in eukaryotic organisms normally req
uires combinatorial interactions of multiple transcription factors. Though
several methods exist for identification of individual protein binding site
patterns in DNA sequences, there are few methods for discovery of binding
site patterns for cooperatively acting factors. Here we present an algorith
m, Co-Bind (for COperative BINDing), for discovering DNA target sites for c
ooperatively acting transcription factors. The method utilizes a Gibbs samp
ling strategy to model the cooperativity between two transcription factors
and defines position weight matrices for the binding sites. Sequences from
both the training set and the entire genome are taken into account, in orde
r to discriminate against commonly occurring patterns in the genome, and pr
oduce patterns which are significant only in the training set.
Results: We have tested Co-Bind on semi-synthetic and real data sets to sho
w it can efficiently identify DNA target site patterns for cooperatively bi
nding transcription factors. In cases where binding site patterns are weak
and cannot be identified by other available methods, Co-Bind, by virtue of
modeling the cooperativity between factors, can identify those sites effici
ently. Though developed to model protein-DNA interactions, the scope of Co-
Bind may be extended to combinatorial, sequence specific, interactions in o
ther macromolecules.