SIZE-INDEPENDENT COMPARISON OF PROTEIN 3-DIMENSIONAL STRUCTURES

Citation
Vn. Maiorov et Gm. Crippen, SIZE-INDEPENDENT COMPARISON OF PROTEIN 3-DIMENSIONAL STRUCTURES, Proteins, 22(3), 1995, pp. 273-283
Citations number
19
Categorie Soggetti
Biology
Journal title
ISSN journal
08873585
Volume
22
Issue
3
Year of publication
1995
Pages
273 - 283
Database
ISI
SICI code
0887-3585(1995)22:3<273:SCOP3S>2.0.ZU;2-0
Abstract
Protein structures are routinely compared by their root-mean-square de viation (RMSD) in atomic coordinates after optimal rigid body superpos ition. What is not so clear is the significance of different RMSD valu es, particularly above the customary arbitrary cutoff for obvious simi larity of 2-3 Angstrom. Our earlier work argued for an intrinsic cutof f for protein similarity that varied with the number of residues in th e polypeptide chains being compared. Here we introduce a new measure, rho, of structural similarity based on RMSD that is independent of the sizes of the molecules involved, or of any other special properties o f molecules. When rho is less than 0.4-0.5, protein structures are vis ually recognized to be obviously similar, but the mathematically pleas ing intrinsic cutoff of rho < 1.0 corresponds to overall similarity in folding motif at a level not usually recognized until smoothing of th e polypeptide chain path makes it striking. When the structures are sc aled to unit radius of gyration and equal principle moments of inertia , the comparisons are even more universal, since they are no longer ob scured by differences in overall size and ellipticity, With increasing chain length, the distribution of rho for pairs of random structures is skewed to higher values, but the value for the best 1% of the compa risons rises only slowly with the number of residues. This level is cl ose to an intrinsic cutoff between similar and dissimilar comparisons, namely the maximal scaled rho possible for the two structures to be m ore similar to each other than one is to the other's mirror image. The intrinsic cutoff is independent of the number of residues or points b eing compared. For proteins having fewer than 100 residues, the 1% rho falls below the intrinsic cutoff, so that for very small proteins, ge ometrically significant similarity can often occur by chance. We belie ve these ideas will be helpful in judging success in NMR structure det ermination and protein folding modeling. (C) 1995 Wiley-Liss, Inc.