ON CHARACTERIZATION OF MOLECULAR SHAPES

Citation
M. Randic et M. Razinger, ON CHARACTERIZATION OF MOLECULAR SHAPES, Journal of chemical information and computer sciences, 35(3), 1995, pp. 594-606
Citations number
73
Categorie Soggetti
Information Science & Library Science","Computer Application, Chemistry & Engineering","Computer Science Interdisciplinary Applications",Chemistry,"Computer Science Information Systems
ISSN journal
00952338
Volume
35
Issue
3
Year of publication
1995
Pages
594 - 606
Database
ISI
SICI code
0095-2338(1995)35:3<594:OCOMS>2.0.ZU;2-T
Abstract
We consider the problem of characterization of molecular shape. In par ticular we consider shapes of planar objects derived by fused regular hexagons, i.e., shapes that can be embedded on a regular hexagonal gri d. We have introduced binary codes for the representation of molecular shapes and have limited our attention to planar benzenoidal forms of a constant perimeter. The form of the code depends on the selection of the starting bond and the sense of circling around the molecular peri meter. We selected as the canonical code one that results in the small est binary number for the periphery code. The binary canonical codes o ffer a systematic catalog of all possible shapes defined on the graphi te lattice. The binary codes also fully reflect the symmetry of shapes , while chiral structures show different codes for their mirror forms. Finally, we discussed similarity of shapes (using Hamming distance be tween the codes as the measure of dissimilarity). The numerical magnit ude of the similarity of codes depends on the length of the fragment o f the code to be compared. With more entries (longer the local sequenc es of comparison) a better discrimination between similar objects is o btained. Calculating similarity by comparing codes using a longer ''wi ndow'' (longer local subsequence) was accomplished by a computer progr am written for that purpose (in FORTRAN).