Jn. Weinstein et al., PREDICTIVE STATISTICS AND ARTIFICIAL-INTELLIGENCE IN THE TITUTES-DRUG-DISCOVERY-PROGRAM-FOR-CANCER-AND-AIDS, Stem cells, 12(1), 1994, pp. 13-22
The National Cancer Institute's drug discovery program screens more th
an 20,000 chemical compounds and natural products a year for activity
against a panel of 60 tumor cell lines in vitro. The result is an info
rmation-rich database of patterns that form the basis for what we term
an ''information-intensive'' approach to the process of drug discover
y. The first step was a demonstration, both by statistical methods (in
cluding the program COMPARE) and by neural networks, that patterns of
activity in the screen can be used to predict a compound's mechanism o
f action. Given this finding, the overall plan has been to develop thr
ee large matrices of information: the first (designated A) gives the p
attern of activity for each compound tested against each cell line in
the screen; the second (S) encodes any of a number of types of 2-D or
3-D structural motifs for each compound; the third (T) indicates each
cell's expression of molecular targets (e.g., from 2-dimensional prote
in gel electrophoresis). Construction and updating of these matrices i
s an ongoing process. The matrices can be concatenated in various ways
to test a variety of specific hypotheses about compounds screened, as
well as to ''prioritize'' candidate compounds for testing. To aid in
these efforts, we have developed the DISCOVERY program package, which
integrates the matrix data for visual pattern recognition. The ''infor
mation-intensive'' approach summarized here in some senses serves to b
ridge the perceived gap between screening and structure-based drug des
ign.