Domain identification by clustering sequence alignments

Authors
Citation
Xj. Guan et L. Du, Domain identification by clustering sequence alignments, BIOINFORMAT, 14(9), 1998, pp. 783-788
Citations number
13
Categorie Soggetti
Multidisciplinary
Journal title
BIOINFORMATICS
ISSN journal
13674803 → ACNP
Volume
14
Issue
9
Year of publication
1998
Pages
783 - 788
Database
ISI
SICI code
1367-4803(1998)14:9<783:DIBCSA>2.0.ZU;2-I
Abstract
Motivation: As sequence databases grow rapidly results from sequence compar ison searches using fast search methods such as BLAST and FASTA tend to be long and difficult to digest. Results: In this paper, we present a new method to extract domain informati on from sequence comparison searches by clustering the resulting alignments according to their similarity to the query sequence. Efficient tree struct ures and algorithms are used to organize the alignment data such that struc turally conserved elements can be easily identified. The hierarchical natur e of the data structures used and the flexible X-Window-based interface pro vide an efficient and intuitive means to explore the alignment data at diff erent levels so that the common domains, as well as distantly related featu res, can be explored.