{"title":"Inconsistency of parsimony under the multispecies coalescent","authors":"Daniel Rickert, Wai-Tong Fan, Matthew Hahn","doi":"arxiv-2407.02634","DOIUrl":null,"url":null,"abstract":"While it is known that parsimony can be statistically inconsistent under\ncertain models of evolution due to high levels of homoplasy, the consistency of\nparsimony under the multispecies coalescent (MSC) is less well studied.\nPrevious studies have shown the consistency of concatenated parsimony\n(parsimony applied to concatenated alignments) under the MSC for the rooted\n4-taxa case under an infinite-sites model of mutation; on the other hand, other\nwork has also established the inconsistency of concatenated parsimony for the\nunrooted 6-taxa case. These seemingly contradictory results suggest that\nconcatenated parsimony may fail to be consistent for trees with more than 5\ntaxa, for all unrooted trees, or for some combination of the two. Here, we\npresent a technique for computing the expected internal branch lengths of gene\ntrees under the MSC. This technique allows us to determine the regions of the\nparameter space of the species tree under which concatenated parsimony fails\nfor different numbers of taxa, for rooted or unrooted trees. We use our new\napproach to demonstrate that there are always regions of statistical\ninconsistency for concatenated parsimony for the 5- and 6-taxa cases,\nregardless of rooting. Our results therefore suggest that parsimony is not\ngenerally dependable under the MSC.","PeriodicalId":501044,"journal":{"name":"arXiv - QuanBio - Populations and Evolution","volume":"25 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Populations and Evolution","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.02634","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
While it is known that parsimony can be statistically inconsistent under
certain models of evolution due to high levels of homoplasy, the consistency of
parsimony under the multispecies coalescent (MSC) is less well studied.
Previous studies have shown the consistency of concatenated parsimony
(parsimony applied to concatenated alignments) under the MSC for the rooted
4-taxa case under an infinite-sites model of mutation; on the other hand, other
work has also established the inconsistency of concatenated parsimony for the
unrooted 6-taxa case. These seemingly contradictory results suggest that
concatenated parsimony may fail to be consistent for trees with more than 5
taxa, for all unrooted trees, or for some combination of the two. Here, we
present a technique for computing the expected internal branch lengths of gene
trees under the MSC. This technique allows us to determine the regions of the
parameter space of the species tree under which concatenated parsimony fails
for different numbers of taxa, for rooted or unrooted trees. We use our new
approach to demonstrate that there are always regions of statistical
inconsistency for concatenated parsimony for the 5- and 6-taxa cases,
regardless of rooting. Our results therefore suggest that parsimony is not
generally dependable under the MSC.