ITA
ENG

CerBeruS: A system supporting the sequential screening process

Authors

Engels, MFM Thielemans, T Verbinnen, D Tollenaere, JP Verbeeck, R

Citation

Mfm. Engels et al., CerBeruS: A system supporting the sequential screening process, J CHEM INF, 40(2), 2000, pp. 241-245

Citations number

Categorie Soggetti

Chemistry

Journal title

JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES

ISSN journal

00952338 → ACNP

Volume

Issue

Year of publication

2000

Pages

241 - 245

Database

ISI

SICI code

0095-2338(200003/04)40:2<241:CASSTS>2.0.ZU;2-E

Abstract

This paper describes the general design and application of CerBeruS, a comp uter-based system for supporting the process of sequential screening. CerBe ruS stands for cluster-based selection, with cluster analysis forming the p ivotal part of the system. CerBeruS uses the Ward's clustering method for p artitioning the data set to be screened into smaller, more homogeneous subs ets. One representative is picked from each subset and suggested as a scree ning candidate. Although the number of compounds submitted to screening is most often driven by the capacity of the assay, CerBeruS provides a statist ical measure that computes the optimal number of clusters in the data set. This measure forms a point of reference for all screening experiments. Diff erent hierarchies of subsets are stored in an Oracle database. Information about the size and content of a cluster can be retrieved from this database via a Visual Basic application. How these components work together in the CerBeruS system is demonstrated on a large data set. In addition, we show t hat, using the statistical measure, one can find an optimal trade-off betwe en screening effort and number of hits.