Structure-based evaluation of sequence comparison and fold recognition alignment accuracy

Citation
Fs. Domingues et al., Structure-based evaluation of sequence comparison and fold recognition alignment accuracy, J MOL BIOL, 297(4), 2000, pp. 1003-1013
Citations number
40
Categorie Soggetti
Molecular Biology & Genetics
Journal title
JOURNAL OF MOLECULAR BIOLOGY
ISSN journal
00222836 → ACNP
Volume
297
Issue
4
Year of publication
2000
Pages
1003 - 1013
Database
ISI
SICI code
0022-2836(20000407)297:4<1003:SEOSCA>2.0.ZU;2-B
Abstract
The biological role, biochemical function, and structure of uncharacterized protein sequences is often inferred from their similarity to known protein s. A constant goal is to increase the reliability, sensitivity, and accurac y of alignment techniques to enable the detection of increasingly distant r elationships. Development, tuning, and testing of these methods benefit fro m appropriate benchmarks for the assessment of alignment accuracy. Here, we describe a benchmark protocol to estimate sequence-to-sequence and sequence-to-structure alignment accuracy. The protocol consists of structu rally related pairs of proteins and procedures to evaluate alignment accura cy over the whole set. The set of protein pairs covers all the currently kn own fold types. The benchmark is challenging in the sense that it consists of proteins lacking clear sequence similarity. Correct target alignments are derived from the three-dimensional structures of these pairs by rigid body superposition. An evaluation engine computes the accuracy of alignments obtained from a particular algorithm in terms of alignment shifts with respect to the structure derived alignments. Using t his benchmark we estimate that the best results can be obtained from a comb ination of amino acid residue substitution matrices and knowledge-based pot entials. (C) 2000 Academic Press.