ALL-IN-ONE - A HIGHLY DETAILED ROTAMER LIBRARY IMPROVES BOTH ACCURACYAND SPEED IN THE MODELING OF SIDE-CHAINS BY DEAD-END ELIMINATION

Citation
M. Demaeyer et al., ALL-IN-ONE - A HIGHLY DETAILED ROTAMER LIBRARY IMPROVES BOTH ACCURACYAND SPEED IN THE MODELING OF SIDE-CHAINS BY DEAD-END ELIMINATION, Folding & design, 2(1), 1997, pp. 53-66
Citations number
65
Categorie Soggetti
Biology,Biophysics
Journal title
ISSN journal
13590278
Volume
2
Issue
1
Year of publication
1997
Pages
53 - 66
Database
ISI
SICI code
1359-0278(1997)2:1<53:A-AHDR>2.0.ZU;2-0
Abstract
Background: About a decade ago, the concept of rotamer libraries was i ntroduced to model sidechains given known mainchain coordinates. Since then, several groups have developed methods to handle the challenging combinatorial problem that is faced when searching rotamer libraries. To avoid a combinatorial explosion, the dead-end elimination method d etects and eliminates rotamers that cannot be members of the global mi nimum energy conformation (GMEC). Several groups have applied and furt her developed this method in the fields of homology modelling and prot ein design. Results: This work addresses at the same time increased pr ediction accuracy and calculation speed improvements. The proposed enh ancements allow the elimination of more than one-third of the possible rotameric states before applying the dead-end elimination method. Thi s is achieved by using a highly detailed rotamer library allowing the safe application of an energy-based rejection criterion without riskin g the elimination of a GMEC rotamer. As a result, we gain both in mode lling accuracy and in computational speed. Being completely automated, the current implementation of the dead-end elimination prediction of protein sidechains can be applied to the modelling of sidechains of pr oteins of any size on the high-end computer systems currently used in molecular modelling. The improved accuracy is highlighted in a compara tive study on a collection of proteins of varying size for which score results have previously been published by multiple groups. Furthermor e, we propose a new validation method for the scoring of the modelled structure versus the experimental data based upon the volume overlap o f the predicted and observed sidechains. This overlap criterion is dis cussed in relation to the classic RMSD and the frequently used +/-40 d egrees window in comparing chi 1 and chi 2 angles. Conclusions: We hav e shown that a very detailed library allows the introduction of a safe energy threshold rejection criterion, thereby increasing both the exe cution speed and the accuracy of the modelling program. We speculate t hat the current method will allow the sidechain prediction of medium-s ized proteins and complex protein interfaces involving up to 150 resid ues on low-end desktop computers.