Phylogenetic mixtures: Concentration of measure in the large-tree limit

Citation
Mossel, Elchanan et Roch, Sebastien, Phylogenetic mixtures: Concentration of measure in the large-tree limit, Annals of applied probability , 22(6), 2012, pp. 2429-2459
ISSN journal
10505164
Volume
22
Issue
6
Year of publication
2012
Pages
2429 - 2459
Database
ACNP
SICI code
Abstract
The reconstruction of phylogenies from DNA or protein sequences is a major task of computational evolutionary biology. Common phenomena, notably variations in mutation rates across genomes and incongruences between gene lineage histories, often make it necessary to model molecular data as originating from a mixture of phylogenies. Such mixed models play an increasingly important role in practice. Using concentration of measure techniques, we show that mixtures of large trees are typically identifiable. We also derive sequence-length requirements for high-probability reconstruction.