Re. Higgs et al., EXPERIMENTAL-DESIGNS FOR SELECTING MOLECULES FROM LARGE CHEMICAL DATABASES, Journal of chemical information and computer sciences, 37(5), 1997, pp. 861-870
Citations number
23
Categorie Soggetti
Information Science & Library Science","Computer Application, Chemistry & Engineering","Computer Science Interdisciplinary Applications",Chemistry,"Computer Science Information Systems
Recent developments in high-throughput screening and combinatorial che
mistry have generated interest in experimental design methods to selec
t subsets of molecules from large chemical databases. In this manuscri
pt three methods for selecting molecules from large databases are desc
ribed: edge designs, spread designs, and coverage designs. Two algorit
hms with linear time complexity that approximate spread and coverage d
esigns are described. These algorithms can be threaded for multiproces
sor systems, are compatible with any definition of molecular distance,
and may be applied to very large chemical databases. For example, ten
thousand molecules were selected using the maximum dissimilarity appr
oximation to a spread design from a sixty-dimensional simulated molecu
lar database of one million molecules in approximately 6 h on a UNIX w
orkstation.