Identification of the descriptor pharmacophores using variable selection QSAR: Applications to database mining

Citation
A. Tropsha et Wf. Zhang, Identification of the descriptor pharmacophores using variable selection QSAR: Applications to database mining, CUR PHARM D, 7(7), 2001, pp. 599-612
Citations number
75
Categorie Soggetti
Pharmacology & Toxicology
Journal title
CURRENT PHARMACEUTICAL DESIGN
ISSN journal
13816128 → ACNP
Volume
7
Issue
7
Year of publication
2001
Pages
599 - 612
Database
ISI
SICI code
1381-6128(200105)7:7<599:IOTDPU>2.0.ZU;2-V
Abstract
The pharmacophore concept is central to the rational drug design and discov ery process. Traditionally, a pharmacophore is defined as a specific three- dimensional (3D) arrangement of chemical functional groups found in active molecules, which are characteristic of a certain pharmacological class of c ompounds. Herein, by analogy with 3D pharmacophores, a more general concept of descriptor pharmacophore is introduced. The descriptor pharmacophores a re defined by the means of variable selection QSAR as a subset of molecular descriptors that afford the most statistically significant structure-activ ity correlation. The two variable selection QSAR methods developed in this laboratory are discussed; these include Genetic Algorithms - Partial Least Squares (GA-PLS) and K-Nearest Neighbors (KNN). Both methods employ multipl e topological descriptors of chemical structures such as molecular connecti vity indices or atom pairs (AP), and stochastic optimization algorithms to achieve a robust QSAR model, which is characterized by the highest value of cross-validated R-2 (q(2)). By default, the descriptor pharmacophore repre sents an invariant selection of descriptor types however, descriptor values are generally different for different molecules. We demonstrate that chemi cal similarity searches using descriptor pharmacophores as opposed to using all descriptors afford more efficient mining of chemical databases or virt ual libraries to discover compounds with a desired biological activity.