Multivariate data analysis using D-optimal designs, partial least squares,and response surface modeling: A directional approach for the analysis of farnesyltransferase inhibitors

Citation
E. Giraud et al., Multivariate data analysis using D-optimal designs, partial least squares,and response surface modeling: A directional approach for the analysis of farnesyltransferase inhibitors, J MED CHEM, 43(9), 2000, pp. 1807-1816
Citations number
40
Categorie Soggetti
Chemistry & Analysis
Journal title
JOURNAL OF MEDICINAL CHEMISTRY
ISSN journal
00222623 → ACNP
Volume
43
Issue
9
Year of publication
2000
Pages
1807 - 1816
Database
ISI
SICI code
0022-2623(20000504)43:9<1807:MDAUDD>2.0.ZU;2-Y
Abstract
We have investigated the combined use of partial least squares (PLS) and st atistical design principles in principal property space (PP-space), derived from principal component analysis (PCA), to analyze farnesyltransferase in hibitors in order to identify "activity trends" (an approach we call a "dir ectional" approach) and quantitative structure-activity relationships (QSAR ) for a congeneric series of inhibitors: the benzo[f]perhydroisoindole (BPH I) series. Trends observed in the PCA showed that the descriptors used were relevant to describe our structural data set by clearly identifying two we ll-defined structural subclasses of inhibitors. D-Optimal design techniques allowed us to define a training set for PLS study in PP-space. Models were derived for each biological assay under evaluation: the in vitro Ki-Ras an d cellular HCT116 tests. Each of these assay-based sets was subdivided once more into two subsets according to two structural classes in this BPHI ser ies as revealed by the PCA model. The response surface modeling (RSM) metho dology was used for each subset, and the corresponding RSM plots helped us identify "activity trends" exploited to guide further analogue design. For more precise activity predictions more refined PLS models on constrained PP -spaces were developed for each subset. This approach was validated with pr edicted sets and demonstrates that useful information can be extracted from just a few very informative and representative compounds. Finally, we also showed the potential use of such a strategy at an early stage of an optimi zation process to extract the first "activity trends" that might support de cision making and guide medicinal chemists in the initial design of new ana logues and/or lead followup libraries.