ANATOMY OF PROTEIN POCKETS AND CAVITIES - MEASUREMENT OF BINDING-SITEGEOMETRY AND IMPLICATIONS FOR LIGAND DESIGN

Citation
J. Liang et al., ANATOMY OF PROTEIN POCKETS AND CAVITIES - MEASUREMENT OF BINDING-SITEGEOMETRY AND IMPLICATIONS FOR LIGAND DESIGN, Protein science, 7(9), 1998, pp. 1884-1897
Citations number
59
Categorie Soggetti
Biology
Journal title
ISSN journal
09618368
Volume
7
Issue
9
Year of publication
1998
Pages
1884 - 1897
Database
ISI
SICI code
0961-8368(1998)7:9<1884:AOPPAC>2.0.ZU;2-3
Abstract
Identification and size characterization of surface pockets and occlud ed cavities are initial steps in protein structure-based ligand design . A new program, CAST, for automatically locating and measuring protei n pockets and cavities, is based on precise computational geometry met hods, including alpha shape and discrete flow theory. CAST identifies and measures pockets and pocket mouth openings, as well as cavities. T he program specifies the atoms lining pockets, pocket openings. and bu ried cavities; the volume and area of pockets and cavities; and the ar ea and circumference of mouth openings. CAST analysis of over 100 prot eins has been carried out; proteins examined include a set of 51 monom eric enzyme-ligand structures, several elastase-inhibitor complexes, t he FK506 binding protein, 30 HIV-1 protease-inhibitor complexes, and a number of small and large protein inhibitors, Medium-sized globular p roteins typically have 10-20 pockets/cavities. Most often, binding sit es are pockets with 1-2 mouth openings; much less frequently they are cavities. Ligand binding pockets vary widely in size, most within the range 10(2)-10(3) Angstrom(3). Statistical analysis reveals that the n umber of pockets and cavities is correlated with protein size, but the re is no correlation between the size of the protein and the size of b inding sites. Most frequently, the largest pocket/cavity is thp active site, but there are a number of instructive exceptions. Ligand volume and binding site volume are somewhat correlated when binding site vol ume is less than or equal to 700 Angstrom(3), but the ligand seldom oc cupies the entire site. Auxiliary pockets near the active site have be en suggested as additional binding surface for designed ligands (Matto s C ct al., 1993, Nat Struct Biol 1:55-58). Analysis of elastase-inhib itor complexes suggests that CAST can identify ancillary pockets, suit able for recruitment in ligand design strategies. Analysis of the FK50 6 binding protein, and of compounds developed in SAR by NMR (Shuker SE et al.. 1996, Science 274:1531-1534), indicates that CAST pocket comp utation may provide a priori identification of target proteins for Lin ked-fragment design. CAST analysis of 30 HIV-1 protease-inhibitor comp lexes shows that the flexible active site pocket can vary over a range of 853-1,566 Angstrom(3), and that there are two pockets near or adjo ining the active site that may be recruited for ligand design.