N. Colloch et al., COMPARISON OF 3 ALGORITHMS FOR THE ASSIGNMENT OF SECONDARY STRUCTURE IN PROTEINS - THE ADVANTAGES OF A CONSENSUS ASSIGNMENT, Protein engineering, 6(4), 1993, pp. 377-382
Accurate assignments of secondary structures in proteins are crucial f
or a useful comparison with theoretical predictions. Three major progr
ams which automatically determine the location of helices and strands
are used for this purpose, namely DSSP, P-Curve and Define. Their resu
lts have been compared for a non-redundant database of 154 proteins. O
n a residue per residue basis, the percentage match score is only 63%
between the three methods. While these methods agree on the overall nu
mber of residues in each of the three states (helix, strand or coil),
they differ on the number of helices or strands, thus implying a wide
discrepancy in the length of assigned structural elements. Moreover, t
he length distribution of helices and strands points to the existence
of artefacts inherent to each assignment algorithm. To overcome these
difficulties a consensus assignment is proposed where each residue is
assigned to the state determined by at least two of the three methods.
With this assignment the artefacts of each algorithm are attenuated.
The residues assigned in the same state by the three methods are bette
r predicted than the others. This assignment will thus be useful for a
nalysing the success rate of prediction methods more accurately.