ITA
ENG

Similarity searching in large combinatorial chemistry spaces

Authors

Rarey, M Stahl, M

Citation

M. Rarey et M. Stahl, Similarity searching in large combinatorial chemistry spaces, J COMPUT A, 15(6), 2001, pp. 497-520

Citations number

Categorie Soggetti

Chemistry & Analysis

Journal title

JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN

ISSN journal

0920654X → ACNP

Volume

Issue

Year of publication

2001

Pages

497 - 520

Database

ISI

SICI code

0920-654X(2001)15:6<497:SSILCC>2.0.ZU;2-3

Abstract

We present a novel algorithm, called Ftrees-FS, for similarity searching in large chemistry spaces based on dynamic programming. Given a query compoun d, the algorithm generates sets of compounds from a given chemistry space t hat are similar to the query. The similarity search is based on the feature tree similarity measure representing molecules by tree structures. This de scriptor allows handling combinatorial chemistry spaces as a whole instead of looking at subsets of enumerated compounds. Within few minutes of comput ing time, the algorithm is able to find the most similar compound in very l arge spaces as well as sets of compounds at an arbitrary similarity level. In addition, the diversity among the generated compounds can be controlled. A set of 17 000 fragments of known drugs, generated by the RECAP procedure from the World Drug Index, was used as the search chemistry space. These f ragments can be combined to more than 10(18) compounds of reasonable size. For validation, known antagonists/inhibitors of several targets including d opamine D4, histamine H1, and COX2 are used as queries. Comparison of the c ompounds created by Ftrees-FS to other known actives demonstrates the abili ty of the method to jump between structurally unrelated molecule classes.