PROGEN - AN AUTOMATED MODELING ALGORITHM FOR THE GENERATION OF COMPLETE PROTEIN STRUCTURES FROM THE ALPHA-CARBON ATOMIC COORDINATES

Citation
C. Mandal et Ds. Linthicum, PROGEN - AN AUTOMATED MODELING ALGORITHM FOR THE GENERATION OF COMPLETE PROTEIN STRUCTURES FROM THE ALPHA-CARBON ATOMIC COORDINATES, Journal of computer-aided molecular design, 7(2), 1993, pp. 199-224
Citations number
17
Categorie Soggetti
Biology
ISSN journal
0920654X
Volume
7
Issue
2
Year of publication
1993
Pages
199 - 224
Database
ISI
SICI code
0920-654X(1993)7:2<199:P-AAMA>2.0.ZU;2-M
Abstract
A modelling algorithm (PROGEN) for the generation of complete protein atomic coordinates from only the alpha-carbon coordinates is described . PROGEN utilizes an optimal geometry parameter (OGP) database for the positioning of atoms for each amino acid of the polypeptide model. Th e OGP database was established by examining the statistical correlatio ns between 23 different intra-peptide and inter-peptide geometric para meters relative to the alpha-carbon distances for each amino acid in a library of 19 known proteins from the Brookhaven Protein Database (BP DB). The OGP files for specific amino acids and peptides were used to generate the atomic positions, with respect to alpha-carbons, for main -chain and side-chain atoms in the modelled structure. Refinement of t he initial model was accomplished using energy minimization (EM) and m olecular dynamics techniques. PROGEN was tested using 60 known protein s in the BPDB, representing a wide spectrum of primary and secondary s tructures. Comparison between PROGEN models and BPDB crystal reference structures gave r.m.s.d. values for peptide main-chain atoms between 0.29 and 0.76 angstrom, with a grand average of 0.53 angstrom for all 60 models. The r.m.s.d. for all non-hydrogen atoms ranged between 1.44 and 1.93 angstrom for the 60 polypeptide models. PROGEN was also able to make the correct assignment of cis- or trans-proline configuration s in the protein structures examined. PROGEN offers a fully automatic building and refinement procedure and requires no special or specific structural considerations for the protein to be modelled.