EXPERIMENTAL-DESIGNS FOR SELECTING MOLECULES FROM LARGE CHEMICAL DATABASES

Citation
Re. Higgs et al., EXPERIMENTAL-DESIGNS FOR SELECTING MOLECULES FROM LARGE CHEMICAL DATABASES, Journal of chemical information and computer sciences, 37(5), 1997, pp. 861-870
Citations number
23
Categorie Soggetti
Information Science & Library Science","Computer Application, Chemistry & Engineering","Computer Science Interdisciplinary Applications",Chemistry,"Computer Science Information Systems
ISSN journal
00952338
Volume
37
Issue
5
Year of publication
1997
Pages
861 - 870
Database
ISI
SICI code
0095-2338(1997)37:5<861:EFSMFL>2.0.ZU;2-B
Abstract
Recent developments in high-throughput screening and combinatorial che mistry have generated interest in experimental design methods to selec t subsets of molecules from large chemical databases. In this manuscri pt three methods for selecting molecules from large databases are desc ribed: edge designs, spread designs, and coverage designs. Two algorit hms with linear time complexity that approximate spread and coverage d esigns are described. These algorithms can be threaded for multiproces sor systems, are compatible with any definition of molecular distance, and may be applied to very large chemical databases. For example, ten thousand molecules were selected using the maximum dissimilarity appr oximation to a spread design from a sixty-dimensional simulated molecu lar database of one million molecules in approximately 6 h on a UNIX w orkstation.