CerBeruS: A system supporting the sequential screening process

Citation
Mfm. Engels et al., CerBeruS: A system supporting the sequential screening process, J CHEM INF, 40(2), 2000, pp. 241-245
Citations number
17
Categorie Soggetti
Chemistry
Journal title
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES
ISSN journal
00952338 → ACNP
Volume
40
Issue
2
Year of publication
2000
Pages
241 - 245
Database
ISI
SICI code
0095-2338(200003/04)40:2<241:CASSTS>2.0.ZU;2-E
Abstract
This paper describes the general design and application of CerBeruS, a comp uter-based system for supporting the process of sequential screening. CerBe ruS stands for cluster-based selection, with cluster analysis forming the p ivotal part of the system. CerBeruS uses the Ward's clustering method for p artitioning the data set to be screened into smaller, more homogeneous subs ets. One representative is picked from each subset and suggested as a scree ning candidate. Although the number of compounds submitted to screening is most often driven by the capacity of the assay, CerBeruS provides a statist ical measure that computes the optimal number of clusters in the data set. This measure forms a point of reference for all screening experiments. Diff erent hierarchies of subsets are stored in an Oracle database. Information about the size and content of a cluster can be retrieved from this database via a Visual Basic application. How these components work together in the CerBeruS system is demonstrated on a large data set. In addition, we show t hat, using the statistical measure, one can find an optimal trade-off betwe en screening effort and number of hits.