TOWARD SIMILARITY MEASURES FOR MACROMOLECULAR BODIES - MEDLA TEST CALCULATIONS FOR SUBSTITUTED BENZENE SYSTEMS

Citation
Pd. Walker et Pg. Mezey, TOWARD SIMILARITY MEASURES FOR MACROMOLECULAR BODIES - MEDLA TEST CALCULATIONS FOR SUBSTITUTED BENZENE SYSTEMS, Journal of computational chemistry, 16(10), 1995, pp. 1238-1249
Citations number
22
Categorie Soggetti
Chemistry
ISSN journal
01928651
Volume
16
Issue
10
Year of publication
1995
Pages
1238 - 1249
Database
ISI
SICI code
0192-8651(1995)16:10<1238:TSMFMB>2.0.ZU;2-D
Abstract
A new method is proposed for the evaluation of numerical similarity me asures for large molecules, defined in terms of their electron density (ED) distributions. The technique is based on the Molecular Electron Density Lego Assembler (MEDLA) approach, proposed earlier for the gene ration of ab initio quality electron densities for proteins and other macromolecules. The reliability of the approach is tested using a fami ly of 13 substituted aromatic systems for which both standard ab initi o electron density computations and the MEDLA technique are applicable . These tests also provide additional examples for evaluating the accu racy of the MEDLA technique. Electron densities for a series of 13 sub stituted benzenes were calculated using the standard ab initio method with STO-3G, 3-21G, and 6-31G* basis sets as well as the MEDLA approa ch with a 6-31G* database of electron density fragments. For each typ e of calculation, pairwise similarity measures of these compounds were calculated using a point-by-point numerical comparison of the EDs. Fr om these results, 2D similarity maps were constructed, serving as an a id for quick visual comparisons for the entire molecular family. The M EDLA approach is shown to give virtually equivalent numerical similari ty measures and similarity maps as the standard ab initio method using a 6-31G* basis set. By contrast, significant differences are found b etween the standard ab initio 6-31G* results and the standard ab init io results obtained with smaller STO-SG and 3-21G basis sets. These te sts indicate that the MEDLA-based similarity measures faithfully mimic the actual, standard ab initio 6-31G* similarity measures, suggestin g the MEDLA method as a reliable technique to assess the shape similar ities of proteins and other macromolecules. The speed of the MEDLA com putations allows rapid, pairwise comparisons of the actual EDs for a s eries of molecules, requiring no more computer time than other simplif ied, less detailed representations of molecular shape. The MEDLA metho d also reduces the need to store large volumes of numerical density da ta on disk, as these densities can be quickly recomputed when needed. For these reasons, the proposed MEDLA similarity analysis technique is likely to become a useful tool in computational drug design. (C) 1995 by John Wiley & Sons, Inc.