Jd. Thompson et al., DbClustal: rapid and reliable global multiple alignments of protein sequences detected by database searches, NUCL ACID R, 28(15), 2000, pp. 2919-2926
DbClustal addresses the important problem of the automatic multiple alignme
nt of the top scoring full-length sequences detected by a database homology
search. By combining the advantages of both local and global alignment alg
orithms into a single system, DbClustal is able to provide accurate global
alignments of highly divergent, complex sequence sets. Local alignment info
rmation is incorporated into a ClustalW global alignment in the form of a l
ist of anchor points between pairs of sequences. The method is demonstrated
using anchors supplied by the Blast post-processing program, Ballast. The
rapidity and reliability of DbClustal have been demonstrated using the rece
ntly annotated Pyrococcus abyssi proteome where the number of alignments wi
th totally misaligned sequences was reduced from 20% to <2%. A web site has
been implemented proposing BlastP database searches with automatic alignme
nt of the top hits by DbClustal.