Mining the chemical quarry with joint chemical probes: An application of latent semantic structure indexing (LaSSI) and TOPOSIM (dice) to chemical database mining

Citation
Sb. Singh et al., Mining the chemical quarry with joint chemical probes: An application of latent semantic structure indexing (LaSSI) and TOPOSIM (dice) to chemical database mining, J MED CHEM, 44(10), 2001, pp. 1564-1575
Citations number
13
Categorie Soggetti
Chemistry & Analysis
Journal title
JOURNAL OF MEDICINAL CHEMISTRY
ISSN journal
00222623 → ACNP
Volume
44
Issue
10
Year of publication
2001
Pages
1564 - 1575
Database
ISI
SICI code
0022-2623(20010510)44:10<1564:MTCQWJ>2.0.ZU;2-B
Abstract
In this study we use a novel similarity search technique called latent sema ntic structure indexing (LaSSI) with joint chemical probes as queries to mi ne the MDL drug data report database. LaSSI is based on latent semantic ind exing developed for searching textual databases. We use atom pair and topol ogical torsion descriptors in our calculations. The results obtained with L aSSI are compared with another in-house similarity search technique TOPOSIM . The results from the similarity searches using joint chemical probes are significantly better than searches using single chemical probes for both La SSI and TOPOSIM. The selected molecules are closely related in activity to their queries and are ranked among the top 300 scoring molecules of the 82 860 entries in the database. Our implementation of LaSSI is very fast and e fficient in finding active compounds. The results also show that LaSSI cons istently retrieves more diverse chemical structures representative of the j oint chemical probes in comparison to TOPOSIM. The use of multimolecule top ological probes to identify compounds complements the use of searching data bases with 3D pharmacophore hypotheses.