SPECTRAL-ANALYSIS OF PHYLOGENETIC DATA

Authors
Citation
Md. Hendy, SPECTRAL-ANALYSIS OF PHYLOGENETIC DATA, Journal of classification, 10(1), 1993, pp. 5-24
Citations number
25
Categorie Soggetti
Social Sciences, Mathematical Methods","Mathematical, Methods, Social Sciences
Journal title
ISSN journal
01764268
Volume
10
Issue
1
Year of publication
1993
Pages
5 - 24
Database
ISI
SICI code
0176-4268(1993)10:1<5:SOPD>2.0.ZU;2-0
Abstract
The spectral analysis of sequence and distance data is a new approach to phylogenetic analysis. For two-state character sequences, the chara cter values at a given site split the set of taxa into two subsets, a bipartition of the taxa set. The vector which counts the relative numb ers of each of these bipartitions over all sites is called a sequence spectrum. Applying a transformation called a Hadamard conjugation, the sequence spectrum is transformed to the conjugate spectrum. This conj ugation corrects for unobserved changes in the data, independently fro m the choice of phylogenetic tree. For any given phylogenetic tree wit h edge weights (probabilities of state change), we define a correspond ing tree spectrum. The selection of a weighted phylogenetic tree from the given sequence data is made by matching the conjugate spectrum wit h a tree spectrum. We develop an optimality selection procedure using a least squares best fit, to find the phylogenetic tree whose tree spe ctrum most closely matches the conjugate spectrum. An inferred sequenc e spectrum can be derived from the selected tree spectrum using the in verse Hadamard conjugation to allow a comparison with the original seq uence spectrum.