Identifiability of a Markovian model of molecular evolution with gamma-distributed rates

Citation
S. Allman, Elizabeth et al., Identifiability of a Markovian model of molecular evolution with gamma-distributed rates, Advances in applied probability , 40(1), 2008, pp. 229-249
ISSN journal
00018678
Volume
40
Issue
1
Year of publication
2008
Pages
229 - 249
Database
ACNP
SICI code
Abstract
Inference of evolutionary trees and rates from biological sequences is commonly performed using continuous-time Markov models of character change. The Markov process evolves along an unknown tree while observations arise only from the tips of the tree. Rate heterogeneity is present in most real data sets and is accounted for by the use of flexible mixture models where each site is allowed its own rate. Very little has been rigorously established concerning the identifiability of the models currently in common use in data analysis, although nonidentifiability was proven for a semiparametric model and an incorrect proof of identifiability was published for a general parametric model (GTR + . + I). Here we prove that one of the most widely used models (GTR + .) is identifiable for generic parameters, and for all parameter choices in the case of four-state (DNA) models. This is the first proof of identifiability of a phylogenetic model with a continuous distribution of rates.