De novo protein design. II. Plasticity in sequence space

Citation
P. Koehl et M. Levitt, De novo protein design. II. Plasticity in sequence space, J MOL BIOL, 293(5), 1999, pp. 1183-1193
Citations number
59
Categorie Soggetti
Molecular Biology & Genetics
Journal title
JOURNAL OF MOLECULAR BIOLOGY
ISSN journal
00222836 → ACNP
Volume
293
Issue
5
Year of publication
1999
Pages
1183 - 1193
Database
ISI
SICI code
0022-2836(19991112)293:5<1183:DNPDIP>2.0.ZU;2-A
Abstract
It is generally accepted that many different protein sequences have similar folded structures, and that there is a relatively high probability that a new sequence possesses a previously observed fold. An indirect consequence of this is that protein design should define the sequence space accessible to a given structure, rather than providing a single optimized sequence. We have recently developed a new approach for protein sequence design, which optimizes the complete sequence of a protein based on the knowledge of its backbone structure, its amino acid composition and a physical energy functi on including van der Waals interactions, electrostatics, and environment fr ee energy. The specificity of the designed sequence for its template backbo ne is imposed by keeping the amino acid composition fixed. Here, we show th at our procedure converges in sequence space, albeit not to the native sequ ence of the protein. We observe that while polar residues are well conserve d in our designed sequences, non-polar amino acids at the surface of a prot ein are often replaced by polar residues. The designed sequences provide a multiple alignment of sequences that all adopt the same three-dimensional f old. This alignment is used to derive a profile matrix for chicken triose p hosphate isomerase, TIM. The matrix is found to recognize significantly the native sequence for TIM, as well as closely related sequences. Possible ap plication of this approach to protein fold recognition is discussed. (C) 19 99 Academic Press.