Secator: A program for inferring protein subfamilies from phylogenetic trees

Citation
N. Wicker et al., Secator: A program for inferring protein subfamilies from phylogenetic trees, MOL BIOL EV, 18(8), 2001, pp. 1435-1441
Citations number
25
Categorie Soggetti
Biology,"Experimental Biology
Journal title
MOLECULAR BIOLOGY AND EVOLUTION
ISSN journal
07374038 → ACNP
Volume
18
Issue
8
Year of publication
2001
Pages
1435 - 1441
Database
ISI
SICI code
0737-4038(200108)18:8<1435:SAPFIP>2.0.ZU;2-#
Abstract
With the huge increase of protein data, an important problem is to estimate , within a large protein family, the number of sensible subsets for subsequ ent in-depth structural, functional, and evolutionary analyses. To tackle t his problem, we developed a new program, Secator, which implements the prin ciple of an ascending hierarchical method using a distance matrix based on a multiple alignment of protein sequences. Dissimilarity values assigned to the nodes of a deduced phylogenetic tree are partitioned by a new stopping rule introduced to automatically determine the significant dissimilarity v alues. The quality of the clusters obtained by Secator is verified by a sep arate Jackknife study. The method is demonstrated on 24 large protein famil ies covering a wide spectrum of structural and sequence conservation and it s usefulness and accuracy with real biological data is illustrated on two w ell-studied protein families (the Sm proteins and the nuclear receptors).