ITA
ENG

MINING THE NCI ANTICANCER DRUG DISCOVERY DATABASES - GENETIC FUNCTIONAPPROXIMATION FOR THE QSAR STUDY OF ANTICANCER ELLIPTICINE ANALOGS

Authors

SHI LM FAN Y MYERS TG OCONNOR PM PAULL KD FRIEND SH WEINSTEIN JN

Citation

Lm. Shi et al., MINING THE NCI ANTICANCER DRUG DISCOVERY DATABASES - GENETIC FUNCTIONAPPROXIMATION FOR THE QSAR STUDY OF ANTICANCER ELLIPTICINE ANALOGS, Journal of chemical information and computer sciences, 38(2), 1998, pp. 189-199

Citations number

Categorie Soggetti

Computer Science Interdisciplinary Applications","Computer Science Information Systems","Computer Science Interdisciplinary Applications",Chemistry,"Computer Science Information Systems

Journal title

Journal of chemical information and computer sciences → ACNP

ISSN journal

00952338

Volume

Issue

Year of publication

1998

Pages

189 - 199

Database

ISI

SICI code

0095-2338(1998)38:2<189:MTNADD>2.0.ZU;2-K

Abstract

The U.S. National Cancer Institute (NCI) conducts a drug discovery pro gram in which similar to 10 000 compounds are screened every year in v itro against a panel of 60 human cancer cell lines from different orga ns of origin. Since 1990, similar to 63 000 compounds have been tested , and their patterns of activity profiled. Recently, we analyzed the a ntitumor activity patterns of 112 ellipticine analogues using a hierar chical clustering algorithm. Dramatic coherence between molecular stru ctures and activity patterns was observed qualitatively from the clust er tree. In the present study, we further investigate the quantitative structure-activity relationships (QSAR) of these compounds, in partic ular with respect to the influence of p53-status and the CNS cell sele ctivity of the activity patterns. Independent variables (i.e., chemica l structural descriptors of the ellipticine analogues) were calculated from the Cerius(2) molecular modeling package. Important structural d escriptors, including partial atomic charges on the ellipticine ring-f orming atoms, were identified by the recently developed genetic functi on approximation (GFA) method. For our data set, the GFA method gave b etter correlation and cross-validation results (R-2 and CVR2 were usua lly similar to 0.3 higher) than did classical stepwise linear regressi on. A procedure for improving the performance of GFA is proposed, and the relative advantages and disadvantages of using GFA for QSAR studie s are discussed.