ITA
ENG

Secator: A program for inferring protein subfamilies from phylogenetic trees

Authors

Wicker, N Perrin, GR Thierry, JC Poch, O

Citation

N. Wicker et al., Secator: A program for inferring protein subfamilies from phylogenetic trees, MOL BIOL EV, 18(8), 2001, pp. 1435-1441

Citations number

Categorie Soggetti

Biology,"Experimental Biology

Journal title

MOLECULAR BIOLOGY AND EVOLUTION

ISSN journal

07374038 → ACNP

Volume

Issue

Year of publication

2001

Pages

1435 - 1441

Database

ISI

SICI code

0737-4038(200108)18:8<1435:SAPFIP>2.0.ZU;2-#

Abstract

With the huge increase of protein data, an important problem is to estimate , within a large protein family, the number of sensible subsets for subsequ ent in-depth structural, functional, and evolutionary analyses. To tackle t his problem, we developed a new program, Secator, which implements the prin ciple of an ascending hierarchical method using a distance matrix based on a multiple alignment of protein sequences. Dissimilarity values assigned to the nodes of a deduced phylogenetic tree are partitioned by a new stopping rule introduced to automatically determine the significant dissimilarity v alues. The quality of the clusters obtained by Secator is verified by a sep arate Jackknife study. The method is demonstrated on 24 large protein famil ies covering a wide spectrum of structural and sequence conservation and it s usefulness and accuracy with real biological data is illustrated on two w ell-studied protein families (the Sm proteins and the nuclear receptors).