Estimating the total number of protein folds

Citation
S. Govindarajan et al., Estimating the total number of protein folds, PROTEINS, 35(4), 1999, pp. 408-414
Citations number
21
Categorie Soggetti
Biochemistry & Biophysics
Journal title
PROTEINS-STRUCTURE FUNCTION AND GENETICS
ISSN journal
08873585 → ACNP
Volume
35
Issue
4
Year of publication
1999
Pages
408 - 414
Database
ISI
SICI code
0887-3585(19990601)35:4<408:ETTNOP>2.0.ZU;2-0
Abstract
Many seemingly unrelated protein families share common folds. Theoretical m odels based on structure designability have suggested that a few folds shou ld be very common while many others have low probability. In agreement with the predictions of these models, we show that the distribution of observed protein families over different folds can be modeled with a highly-stretch ed exponential. Our results suggest that there are approximately 4,000 poss ible folds, some so unlikely that only approximately 2,000 folds existing a mong naturally-occurring proteins. Due to the large number of extremely rar e folds, constructing a comprehensive database of all existent folds would be difficult. Constructing a database of the most-likely folds representing the vast majority of protein families would be considerably easier. Protei ns 1999;35:408-414. (C) 1999 Wiley-Liss, Inc.