A METHOD TO RECOGNIZE DISTANT REPEATS IN PROTEIN SEQUENCES

Authors
Citation
J. Heringa et P. Argos, A METHOD TO RECOGNIZE DISTANT REPEATS IN PROTEIN SEQUENCES, Proteins, 17(4), 1993, pp. 391-411
Citations number
42
Categorie Soggetti
Biology
Journal title
ISSN journal
08873585
Volume
17
Issue
4
Year of publication
1993
Pages
391 - 411
Database
ISI
SICI code
0887-3585(1993)17:4<391:AMTRDR>2.0.ZU;2-5
Abstract
An automated algorithm is presented that delineates protein sequence f ragments which display similarity. The method incorporates a selection of a number of local nonoverlapping sequence alignments with the high est similarity scores and a graph-theoretical approach to elucidate th e consistent start and end points of the fragments comprising one or m ore ensembles of related subsequences. The procedure allows the simult aneous identification of different types of repeats within one sequenc e. A multiple alignment of the resulting fragments is performed and a consensus sequence derived from the ensemble(s). Finally, a profile is constructed from the multiple alignment to detect possible and more d istant members within the sequence. The method tolerates mutations in the repeats as well as insertions and deletions. The sequence spans be tween the various repeats or repeat clusters may be of different lengt hs. The technique has been applied to a number of proteins where the r epeating fragments have been derived from information additional to th e protein sequences. (C) 1993 Wiley-Liss, Inc.