Clustering gene expression patterns

Citation
A. Ben-dor et al., Clustering gene expression patterns, J COMPUT BI, 6(3-4), 1999, pp. 281-297
Citations number
34
Categorie Soggetti
Biochemistry & Biophysics
Journal title
JOURNAL OF COMPUTATIONAL BIOLOGY
ISSN journal
10665277 → ACNP
Volume
6
Issue
3-4
Year of publication
1999
Pages
281 - 297
Database
ISI
SICI code
1066-5277(199923)6:3-4<281:CGEP>2.0.ZU;2-H
Abstract
Recent advances in biotechnology allow researchers to measure expression le vels for thousands of genes simultaneously, across different conditions and over time. Analysis of data produced by such experiments offers potential insight into gene function and regulatory mechanisms. A key step in the ana lysis of gene expression data is the detection of groups of genes that mani fest similar expression patterns. The corresponding algorithmic problem is to cluster multicondition gene expression patterns, In this paper we descri be a novel clustering algorithm that was developed for analysis of gene exp ression data. We define an appropriate stochastic error model on the input, and prove that under the conditions of the model, the algorithm recovers t he duster structure with high probability, The running time of the algorith m on an n-gene dataset is O{n(2)[log(n)](c)}. We also present a practical h euristic based on the same algorithmic ideas. The heuristic was implemented and its performance is demonstrated on simulated data and on real gene exp ression data, with very promising results.