A rapid method for exploring the protein structure universe

Citation
Mm. Young et al., A rapid method for exploring the protein structure universe, PROTEINS, 34(3), 1999, pp. 317-332
Citations number
39
Categorie Soggetti
Biochemistry & Biophysics
Journal title
PROTEINS-STRUCTURE FUNCTION AND GENETICS
ISSN journal
08873585 → ACNP
Volume
34
Issue
3
Year of publication
1999
Pages
317 - 332
Database
ISI
SICI code
0887-3585(19990215)34:3<317:ARMFET>2.0.ZU;2-2
Abstract
We have developed an automatic protein fingerprinting method for the evalua tion of protein structural similarities based on secondary structure elemen t compositions, spatial arrangements, lengths, and topologies, This method can rapidly identify proteins sharing structural homologies as we demonstra te with five test cases: the globins, the mammalian trypsinlike serine prot eases, the immunoglobulins, the cupredoxins, and the actinlike ATPase domai n-containing proteins. Principal components analysis of the similarity dist ance matrix calculated from an all-by-all comparison of 1,031 unique chains in the Protein Data Bank has produced a distribution of structures within a high-dimensional structural space. Fifty percent of the variance observed for this distribution is bounded by six axes, two of which encode structur al variability within two large families, the immunoglobulins and the tryps inlike serine proteases, Many aspects of the spatial distribution remain st able upon reduction of the database to 140 proteins with minimal family ove rlap. The axes correlated with specific structural families are no longer o bserved. A clear hierarchy of organization is seen in the arrangement of pr otein structures in the universe. At the highest level, protein structures populate regions corresponding to the all-alpha, all-beta, and alpha/beta s uperfamilies, Large protein families are arranged along family-specific axe s, forming local densely populated regions within the space. The lowest lev el of organization is intrafamilial; homologous structures are ordered by v ariations in peripheral secondary structure elements or by conformational s hifts in the tertiary structure. Proteins 1999; 34:317-332, (C) 1999Wiley-L iss, Inc.