Ian G Brennan, Alan R Lemmon, Emily Moriarty Lemmon, Conrad J Hoskin, Stephen C Donnellan, J Scott Keogh
{"title":"Populating a Continent: Phylogenomics Reveal the Timing of Australian Frog Diversification.","authors":"Ian G Brennan, Alan R Lemmon, Emily Moriarty Lemmon, Conrad J Hoskin, Stephen C Donnellan, J Scott Keogh","doi":"10.1093/sysbio/syad048","DOIUrl":"10.1093/sysbio/syad048","url":null,"abstract":"<p><p>The Australian continent's size and isolation make it an ideal place for studying the accumulation and evolution of biodiversity. Long separated from the ancient supercontinent Gondwana, most of Australia's plants and animals are unique and endemic, including the continent's frogs. Australian frogs comprise a remarkable ecological and morphological diversity categorized into a small number of distantly related radiations. We present a phylogenomic hypothesis based on an exon-capture dataset that spans the main clades of Australian myobatrachoid, pelodryadid hyloid, and microhylid frogs. Our time-calibrated phylogenomic-scale phylogeny identifies great disparity in the relative ages of these groups that vary from Gondwanan relics to recent immigrants from Asia and include arguably the continent's oldest living vertebrate radiation. This age stratification provides insight into the colonization of, and diversification on, the Australian continent through deep time, during periods of dramatic climatic and community changes. Contemporary Australian frog diversity highlights the adaptive capacity of anurans, particularly in response to heat and aridity, and explains why they are one of the continent's most visible faunas. [Anuran; adaptive radiation; Gondwana; phylogenetics].</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"1-11"},"PeriodicalIF":6.5,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9917177","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mark P Khurana, Neil Scheidwasser-Clow, Matthew J Penn, Samir Bhatt, David A Duchêne
{"title":"The Limits of the Constant-rate Birth-Death Prior for Phylogenetic Tree Topology Inference.","authors":"Mark P Khurana, Neil Scheidwasser-Clow, Matthew J Penn, Samir Bhatt, David A Duchêne","doi":"10.1093/sysbio/syad075","DOIUrl":"10.1093/sysbio/syad075","url":null,"abstract":"<p><p>Birth-death models are stochastic processes describing speciation and extinction through time and across taxa and are widely used in biology for inference of evolutionary timescales. Previous research has highlighted how the expected trees under the constant-rate birth-death (crBD) model tend to differ from empirical trees, for example, with respect to the amount of phylogenetic imbalance. However, our understanding of how trees differ between the crBD model and the signal in empirical data remains incomplete. In this Point of View, we aim to expose the degree to which the crBD model differs from empirically inferred phylogenies and test the limits of the model in practice. Using a wide range of topology indices to compare crBD expectations against a comprehensive dataset of 1189 empirically estimated trees, we confirm that crBD model trees frequently differ topologically compared with empirical trees. To place this in the context of standard practice in the field, we conducted a meta-analysis for a subset of the empirical studies. When comparing studies that used Bayesian methods and crBD priors with those that used other non-crBD priors and non-Bayesian methods (i.e., maximum likelihood methods), we do not find any significant differences in tree topology inferences. To scrutinize this finding for the case of highly imbalanced trees, we selected the 100 trees with the greatest imbalance from our dataset, simulated sequence data for these tree topologies under various evolutionary rates, and re-inferred the trees under maximum likelihood and using the crBD model in a Bayesian setting. We find that when the substitution rate is low, the crBD prior results in overly balanced trees, but the tendency is negligible when substitution rates are sufficiently high. Overall, our findings demonstrate the general robustness of crBD priors across a broad range of phylogenetic inference scenarios but also highlight that empirically observed phylogenetic imbalance is highly improbable under the crBD model, leading to systematic bias in data sets with limited information content.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"235-246"},"PeriodicalIF":6.5,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11129600/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139058708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Loïs Rancilhac, Erik D Enbody, Rebecca Harris, Takema Saitoh, Martin Irestedt, Yang Liu, Fumin Lei, Leif Andersson, Per Alström
{"title":"Introgression Underlies Phylogenetic Uncertainty But Not Parallel Plumage Evolution in a Recent Songbird Radiation.","authors":"Loïs Rancilhac, Erik D Enbody, Rebecca Harris, Takema Saitoh, Martin Irestedt, Yang Liu, Fumin Lei, Leif Andersson, Per Alström","doi":"10.1093/sysbio/syad062","DOIUrl":"10.1093/sysbio/syad062","url":null,"abstract":"<p><p>Instances of parallel phenotypic evolution offer great opportunities to understand the evolutionary processes underlying phenotypic changes. However, confirming parallel phenotypic evolution and studying its causes requires a robust phylogenetic framework. One such example is the \"black-and-white wagtails,\" a group of 5 species in the songbird genus Motacilla: 1 species, Motacilla alba, shows wide intra-specific plumage variation, while the 4r others form 2 pairs of very similar-looking species (M. aguimp + M. samveasnae and M. grandis + M. maderaspatensis, respectively). However, the 2 species in each of these pairs were not recovered as sisters in previous phylogenetic inferences. Their relationships varied depending on the markers used, suggesting that gene tree heterogeneity might have hampered accurate phylogenetic inference. Here, we use whole genome resequencing data to explore the phylogenetic relationships within this group, with a special emphasis on characterizing the extent of gene tree heterogeneity and its underlying causes. We first used multispecies coalescent methods to generate a \"complete evidence\" phylogenetic hypothesis based on genome-wide variants, while accounting for incomplete lineage sorting (ILS) and introgression. We then investigated the variation in phylogenetic signal across the genome to quantify the extent of discordance across genomic regions and test its underlying causes. We found that wagtail genomes are mosaics of regions supporting variable genealogies, because of ILS and inter-specific introgression. The most common topology across the genome, supporting M. alba and M. aguimp as sister species, appears to be influenced by ancient introgression. Additionally, we inferred another ancient introgression event, between M. alba and M. grandis. By combining results from multiple analyses, we propose a phylogenetic network for the black-and-white wagtails that confirms that similar phenotypes evolved in non-sister lineages, supporting parallel plumage evolution. Furthermore, the inferred reticulations do not connect species with similar plumage coloration, suggesting that introgression does not underlie parallel plumage evolution in this group. Our results demonstrate the importance of investing genome-wide patterns of gene tree heterogeneity to help understand the mechanisms underlying phenotypic evolution. [Gene tree heterogeneity; incomplete lineage sorting; introgression; parallel evolution; phylogenomics; plumage evolution; wagtails.].</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"12-25"},"PeriodicalIF":6.5,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11129591/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41161583","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jessica A Rick, Chad D Brock, Alexander L Lewanski, Jimena Golcher-Benavides, Catherine E Wagner
{"title":"Reference Genome Choice and Filtering Thresholds Jointly Influence Phylogenomic Analyses.","authors":"Jessica A Rick, Chad D Brock, Alexander L Lewanski, Jimena Golcher-Benavides, Catherine E Wagner","doi":"10.1093/sysbio/syad065","DOIUrl":"10.1093/sysbio/syad065","url":null,"abstract":"<p><p>Molecular phylogenies are a cornerstone of modern comparative biology and are commonly employed to investigate a range of biological phenomena, such as diversification rates, patterns in trait evolution, biogeography, and community assembly. Recent work has demonstrated that significant biases may be introduced into downstream phylogenetic analyses from processing genomic data; however, it remains unclear whether there are interactions among bioinformatic parameters or biases introduced through the choice of reference genome for sequence alignment and variant calling. We address these knowledge gaps by employing a combination of simulated and empirical data sets to investigate the extent to which the choice of reference genome in upstream bioinformatic processing of genomic data influences phylogenetic inference, as well as the way that reference genome choice interacts with bioinformatic filtering choices and phylogenetic inference method. We demonstrate that more stringent minor allele filters bias inferred trees away from the true species tree topology, and that these biased trees tend to be more imbalanced and have a higher center of gravity than the true trees. We find the greatest topological accuracy when filtering sites for minor allele count (MAC) >3-4 in our 51-taxa data sets, while tree center of gravity was closest to the true value when filtering for sites with MAC >1-2. In contrast, filtering for missing data increased accuracy in the inferred topologies; however, this effect was small in comparison to the effect of minor allele filters and may be undesirable due to a subsequent mutation spectrum distortion. The bias introduced by these filters differs based on the reference genome used in short read alignment, providing further support that choosing a reference genome for alignment is an important bioinformatic decision with implications for downstream analyses. These results demonstrate that attributes of the study system and dataset (and their interaction) add important nuance for how best to assemble and filter short-read genomic data for phylogenetic inference.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"76-101"},"PeriodicalIF":6.5,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50162993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Is Over-parameterization a Problem for Profile Mixture Models?","authors":"Hector Baños, Edward Susko, Andrew J Roger","doi":"10.1093/sysbio/syad063","DOIUrl":"10.1093/sysbio/syad063","url":null,"abstract":"<p><p>Biochemical constraints on the admissible amino acids at specific sites in proteins lead to heterogeneity of the amino acid substitution process over sites in alignments. It is well known that phylogenetic models of protein sequence evolution that do not account for site heterogeneity are prone to long-branch attraction (LBA) artifacts. Profile mixture models were developed to model heterogeneity of preferred amino acids at sites via a finite distribution of site classes each with a distinct set of equilibrium amino acid frequencies. However, it is unknown whether the large number of parameters in such models associated with the many amino acid frequency vectors can adversely affect tree topology estimates because of over-parameterization. Here, we demonstrate theoretically that for long sequences, over-parameterization does not create problems for estimation with profile mixture models. Under mild conditions, tree, amino acid frequencies, and other model parameters converge to true values as sequence length increases, even when there are large numbers of components in the frequency profile distributions. Because large sample theory does not necessarily imply good behavior for shorter alignments we explore the performance of these models with short alignments simulated with tree topologies that are prone to LBA artifacts. We find that over-parameterization is not a problem for complex profile mixture models even when there are many amino acid frequency vectors. In fact, simple models with few site classes behave poorly. Interestingly, we also found that misspecification of the amino acid frequency vectors does not lead to increased LBA artifacts as long as the estimated cumulative distribution function of the amino acid frequencies at sites adequately approximates the true one. In contrast, misspecification of the amino acid exchangeability rates can severely negatively affect parameter estimation. Finally, we explore the effects of including in the profile mixture model an additional \"F-class\" representing the overall frequencies of amino acids in the data set. Surprisingly, the F-class does not help parameter estimation significantly and can decrease the probability of correct tree estimation, depending on the scenario, even though it tends to improve likelihood scores.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"53-75"},"PeriodicalIF":6.5,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11129589/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41238682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Todd W Pierson, Kenneth H Kozak, Travis C Glenn, Benjamin M Fitzpatrick
{"title":"River Drainage Reorganization and Reticulate Evolution in the Two-Lined Salamander (Eurycea bislineata) Species Complex.","authors":"Todd W Pierson, Kenneth H Kozak, Travis C Glenn, Benjamin M Fitzpatrick","doi":"10.1093/sysbio/syad064","DOIUrl":"10.1093/sysbio/syad064","url":null,"abstract":"<p><p>The origin and eventual loss of biogeographic barriers can create alternating periods of allopatry and secondary contact, facilitating gene flow among distinct metapopulations and generating reticulate evolutionary histories that are not adequately described by a bifurcating evolutionary tree. One such example may exist in the two-lined salamander (Eurycea bislineata) species complex, where discordance among morphological and molecular datasets has created a \"vexing taxonomic challenge.\" Previous phylogeographic analyses of mitochondrial DNA (mtDNA) suggested that the reorganization of Miocene paleodrainages drove vicariance and dispersal, but the inherent limitations of a single-locus dataset precluded the evaluation of subsequent gene flow. Here, we generate triple-enzyme restriction site-associated DNA sequencing (3RAD) data for > 100 individuals representing all major mtDNA lineages and use a suite of complementary methods to demonstrate that discordance among earlier datasets is best explained by a reticulate evolutionary history influenced by river drainage reorganization. Systematics of such groups should acknowledge these complex histories and relationships that are not strictly hierarchical. [Amphibian; hybridization; introgression; Plethodontidae; stream capture.].</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"26-35"},"PeriodicalIF":6.5,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50163002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ammon Thompson, Benjamin J Liebeskind, Erik J Scully, Michael J Landis
{"title":"Deep Learning and Likelihood Approaches for Viral Phylogeography Converge on the Same Answers Whether the Inference Model Is Right or Wrong.","authors":"Ammon Thompson, Benjamin J Liebeskind, Erik J Scully, Michael J Landis","doi":"10.1093/sysbio/syad074","DOIUrl":"10.1093/sysbio/syad074","url":null,"abstract":"<p><p>Analysis of phylogenetic trees has become an essential tool in epidemiology. Likelihood-based methods fit models to phylogenies to draw inferences about the phylodynamics and history of viral transmission. However, these methods are often computationally expensive, which limits the complexity and realism of phylodynamic models and makes them ill-suited for informing policy decisions in real-time during rapidly developing outbreaks. Likelihood-free methods using deep learning are pushing the boundaries of inference beyond these constraints. In this paper, we extend, compare, and contrast a recently developed deep learning method for likelihood-free inference from trees. We trained multiple deep neural networks using phylogenies from simulated outbreaks that spread among 5 locations and found they achieve close to the same levels of accuracy as Bayesian inference under the true simulation model. We compared robustness to model misspecification of a trained neural network to that of a Bayesian method. We found that both models had comparable performance, converging on similar biases. We also implemented a method of uncertainty quantification called conformalized quantile regression that we demonstrate has similar patterns of sensitivity to model misspecification as Bayesian highest posterior density (HPD) and greatly overlap with HPDs, but have lower precision (more conservative). Finally, we trained and tested a neural network against phylogeographic data from a recent study of the SARS-Cov-2 pandemic in Europe and obtained similar estimates of region-specific epidemiological parameters and the location of the common ancestor in Europe. Along with being as accurate and robust as likelihood-based methods, our trained neural networks are on average over 3 orders of magnitude faster after training. Our results support the notion that neural networks can be trained with simulated data to accurately mimic the good and bad statistical properties of the likelihood functions of generative phylogenetic models.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"183-206"},"PeriodicalIF":6.1,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11249978/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139378301","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
James Rosindell, Kerry Manson, Rikki Gumbs, William D Pearse, Mike Steel
{"title":"Phylogenetic Biodiversity Metrics Should Account for Both Accumulation and Attrition of Evolutionary Heritage.","authors":"James Rosindell, Kerry Manson, Rikki Gumbs, William D Pearse, Mike Steel","doi":"10.1093/sysbio/syad072","DOIUrl":"10.1093/sysbio/syad072","url":null,"abstract":"<p><p>Phylogenetic metrics are essential tools used in the study of ecology, evolution and conservation. Phylogenetic diversity (PD) in particular is one of the most prominent measures of biodiversity and is based on the idea that biological features accumulate along the edges of phylogenetic trees that are summed. We argue that PD and many other phylogenetic biodiversity metrics fail to capture an essential process that we term attrition. Attrition is the gradual loss of features through causes other than extinction. Here we introduce \"EvoHeritage\", a generalization of PD that is founded on the joint processes of accumulation and attrition of features. We argue that while PD measures evolutionary history, EvoHeritage is required to capture a more pertinent subset of evolutionary history including only components that have survived attrition. We show that EvoHeritage is not the same as PD on a tree with scaled edges; instead, accumulation and attrition interact in a more complex non-monophyletic way that cannot be captured by edge lengths alone. This leads us to speculate that the one-dimensional edge lengths of classic trees may be insufficiently flexible to capture the nuances of evolutionary processes. We derive a measure of EvoHeritage and show that it elegantly reproduces species richness and PD at opposite ends of a continuum based on the intensity of attrition. We demonstrate the utility of EvoHeritage in ecology as a predictor of community productivity compared with species richness and PD. We also show how EvoHeritage can quantify living fossils and resolve their associated controversy. We suggest how the existing calculus of PD-based metrics and other phylogenetic biodiversity metrics can and should be recast in terms of EvoHeritage accumulation and attrition.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"158-182"},"PeriodicalIF":6.5,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11129585/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138810808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Lukas J Musher, Glaucia Del-Rio, Rafael S Marcondes, Robb T Brumfield, Gustavo A Bravo, Gregory Thom
{"title":"Geogenomic Predictors of Genetree Heterogeneity Explain Phylogeographic and Introgression History: A Case Study in an Amazonian Bird (Thamnophilus aethiops).","authors":"Lukas J Musher, Glaucia Del-Rio, Rafael S Marcondes, Robb T Brumfield, Gustavo A Bravo, Gregory Thom","doi":"10.1093/sysbio/syad061","DOIUrl":"10.1093/sysbio/syad061","url":null,"abstract":"<p><p>Can knowledge about genome architecture inform biogeographic and phylogenetic inference? Selection, drift, recombination, and gene flow interact to produce a genomic landscape of divergence wherein patterns of differentiation and genealogy vary nonrandomly across the genomes of diverging populations. For instance, genealogical patterns that arise due to gene flow should be more likely to occur on smaller chromosomes, which experience high recombination, whereas those tracking histories of geographic isolation (reduced gene flow caused by a barrier) and divergence should be more likely to occur on larger and sex chromosomes. In Amazonia, populations of many bird species diverge and introgress across rivers, resulting in reticulated genomic signals. Herein, we used reduced representation genomic data to disentangle the evolutionary history of 4 populations of an Amazonian antbird, Thamnophilus aethiops, whose biogeographic history was associated with the dynamic evolution of the Madeira River Basin. Specifically, we evaluate whether a large river capture event ca. 200 Ka, gave rise to reticulated genealogies in the genome by making spatially explicit predictions about isolation and gene flow based on knowledge about genomic processes. We first estimated chromosome-level phylogenies and recovered 2 primary topologies across the genome. The first topology (T1) was most consistent with predictions about population divergence and was recovered for the Z-chromosome. The second (T2), was consistent with predictions about gene flow upon secondary contact. To evaluate support for these topologies, we trained a convolutional neural network to classify our data into alternative diversification models and estimate demographic parameters. The best-fit model was concordant with T1 and included gene flow between non-sister taxa. Finally, we modeled levels of divergence and introgression as functions of chromosome length and found that smaller chromosomes experienced higher gene flow. Given that (1) genetrees supporting T2 were more likely to occur on smaller chromosomes and (2) we found lower levels of introgression on larger chromosomes (and especially the Z-chromosome), we argue that T1 represents the history of population divergence across rivers and T2 the history of secondary contact due to barrier loss. Our results suggest that a significant portion of genomic heterogeneity arises due to extrinsic biogeographic processes such as river capture interacting with intrinsic processes associated with genome architecture. Future phylogeographic studies would benefit from accounting for genomic processes, as different parts of the genome reveal contrasting, albeit complementary histories, all of which are relevant for disentangling the intricate geogenomic mechanisms of biotic diversification. [Amazonia; biogeography; demographic modeling; gene flow; gene tree; genome architecture; geogenomics; introgression; linked selection; neural network; phylogenomic; p","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"36-52"},"PeriodicalIF":6.5,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41149145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Richard Adams, Zoe Cain, Raquel Assis, Michael DeGiorgio
{"title":"Robust Phylogenetic Regression.","authors":"Richard Adams, Zoe Cain, Raquel Assis, Michael DeGiorgio","doi":"10.1093/sysbio/syad070","DOIUrl":"10.1093/sysbio/syad070","url":null,"abstract":"<p><p>Modern comparative biology owes much to phylogenetic regression. At its conception, this technique sparked a revolution that armed biologists with phylogenetic comparative methods (PCMs) for disentangling evolutionary correlations from those arising from hierarchical phylogenetic relationships. Over the past few decades, the phylogenetic regression framework has become a paradigm of modern comparative biology that has been widely embraced as a remedy for shared ancestry. However, recent evidence has shown doubt over the efficacy of phylogenetic regression, and PCMs more generally, with the suggestion that many of these methods fail to provide an adequate defense against unreplicated evolution-the primary justification for using them in the first place. Importantly, some of the most compelling examples of biological innovation in nature result from abrupt lineage-specific evolutionary shifts, which current regression models are largely ill equipped to deal with. Here we explore a solution to this problem by applying robust linear regression to comparative trait data. We formally introduce robust phylogenetic regression to the PCM toolkit with linear estimators that are less sensitive to model violations than the standard least-squares estimator, while still retaining high power to detect true trait associations. Our analyses also highlight an ingenuity of the original algorithm for phylogenetic regression based on independent contrasts, whereby robust estimators are particularly effective. Collectively, we find that robust estimators hold promise for improving tests of trait associations and offer a path forward in scenarios where classical approaches may fail. Our study joins recent arguments for increased vigilance against unreplicated evolution and a better understanding of evolutionary model performance in challenging-yet biologically important-settings.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"140-157"},"PeriodicalIF":6.5,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11129599/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138462764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}