ITA
ENG

Estimating the total number of protein folds

Authors

Govindarajan, S Recabarren, R Goldstein, RK

Citation

S. Govindarajan et al., Estimating the total number of protein folds, PROTEINS, 35(4), 1999, pp. 408-414

Citations number

Categorie Soggetti

Biochemistry & Biophysics

Journal title

PROTEINS-STRUCTURE FUNCTION AND GENETICS

ISSN journal

08873585 → ACNP

Volume

Issue

Year of publication

1999

Pages

408 - 414

Database

ISI

SICI code

0887-3585(19990601)35:4<408:ETTNOP>2.0.ZU;2-0

Abstract

Many seemingly unrelated protein families share common folds. Theoretical m odels based on structure designability have suggested that a few folds shou ld be very common while many others have low probability. In agreement with the predictions of these models, we show that the distribution of observed protein families over different folds can be modeled with a highly-stretch ed exponential. Our results suggest that there are approximately 4,000 poss ible folds, some so unlikely that only approximately 2,000 folds existing a mong naturally-occurring proteins. Due to the large number of extremely rar e folds, constructing a comprehensive database of all existent folds would be difficult. Constructing a database of the most-likely folds representing the vast majority of protein families would be considerably easier. Protei ns 1999;35:408-414. (C) 1999 Wiley-Liss, Inc.