Guy Katriel, Udi Mahanaymi, Shelly Brezner, Noor Kezel, Christoph Koutschan, Doron Zeilberger, Mike Steel, Sagi Snir
{"title":"Gene Transfer-Based Phylogenetics: Analytical Expressions and Additivity via Birth-Death Theory.","authors":"Guy Katriel, Udi Mahanaymi, Shelly Brezner, Noor Kezel, Christoph Koutschan, Doron Zeilberger, Mike Steel, Sagi Snir","doi":"10.1093/sysbio/syad060","DOIUrl":"10.1093/sysbio/syad060","url":null,"abstract":"<p><p>The genomic era has opened up vast opportunities in molecular systematics, one of which is deciphering the evolutionary history in fine detail. Under this mass of data, analyzing the point mutations of standard markers is often too crude and slow for fine-scale phylogenetics. Nevertheless, genome dynamics (GD) events provide alternative, often richer information. The synteny index (SI) between a pair of genomes combines gene order and gene content information, allowing the comparison of genomes of unequal gene content, together with order considerations of their common genes. Recently, genome dynamics has been modeled as a continuous-time Markov process, and gene distance in the genome as a birth-death-immigration process. Nevertheless, due to complexities arising in this setting, no precise and provably consistent estimators could be derived, resulting in heuristic solutions. Here, we extend this modeling approach by using techniques from birth-death theory to derive explicit expressions of the system's probabilistic dynamics in the form of rational functions of the model parameters. This, in turn, allows us to infer analytically accurate distances between organisms based on their SI. Subsequently, we establish additivity of this estimated evolutionary distance (a desirable property yielding phylogenetic consistency). Applying the new measure in simulation studies shows that it provides accurate results in realistic settings and even under model extensions such as gene gain/loss or over a tree structure. In the real-data realm, we applied the new formulation to unique data structure that we constructed-the ordered orthology DB-based on a new version of the EggNOG database, to construct a tree with more than 4.5K taxa. To the best of our knowledge, this is the largest gene-order-based tree constructed and it overcomes shortcomings found in previous approaches. Constructing a GD-based tree allows to confirm and contrast findings based on other phylogenetic approaches, as we show.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"1403-1417"},"PeriodicalIF":6.5,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49682532","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Burma Terrane Amber Fauna Shows Connections to Gondwana and Transported Gondwanan Lineages to the Northern Hemisphere (Araneae: Palpimanoidea).","authors":"Hannah M Wood, Jörg Wunderlich","doi":"10.1093/sysbio/syad047","DOIUrl":"10.1093/sysbio/syad047","url":null,"abstract":"<p><p>Burmese amber is a significant source of fossils that documents the mid-Cretaceous biota. This deposit was formed around 99 Ma on the Burma Terrane, which broke away from Gondwana and later collided with Asia, although the timing is disputed. Palpimanoidea is a dispersal-limited group that was a dominant element of the Mesozoic spider fauna, and has an extensive fossil record, particularly from Burmese amber. Using morphological and molecular data, evolutionary relationships of living and fossil Palpimanoidea are examined. Divergence dating with fossils as terminal tips shows timing of diversification is contemporaneous with continental breakup.Ancestral range estimations show widespread ancestral ranges that divide into lineages that inherit different Pangean fragments, consistent with vicariance. Our results suggest that the Burmese amber fauna has ties to Gondwana due to a historical connection in the Early Cretaceous, and that the Burma Terrane facilitated biotic exchange by transporting lineages from Gondwana into the Holarctic in the Cretaceous.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"1233-1246"},"PeriodicalIF":6.5,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9920283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Speciation-by-Extinction.","authors":"Glenn F Seeholzer, Robb T Brumfield","doi":"10.1093/sysbio/syad049","DOIUrl":"10.1093/sysbio/syad049","url":null,"abstract":"<p><p>Extinction is a dominant force shaping patterns of biodiversity through time; however its role as a catalyst of speciation through its interaction with intraspecific variation has been overlooked. Here, we synthesize ideas alluded to by Darwin and others into the model of \"speciation-by-extinction\" in which speciation results from the extinction of intermediate populations within a single geographically variable species. We explore the properties and distinguishing features of speciation-by-extinction with respect to other established speciation models. We demonstrate its plausibility by showing that the experimental extinction of populations within variable species can result in speciation. The prerequisites for speciation-by-extinction, geographically structured intraspecific variation and local extinction, are ubiquitous in nature. We propose that speciation-by-extinction may be a prevalent, but underappreciated, speciation mechanism.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"1433-1442"},"PeriodicalIF":6.5,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9994984","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Deep Learning from Phylogenies for Diversification Analyses.","authors":"Sophia Lambert, Jakub Voznica, Hélène Morlon","doi":"10.1093/sysbio/syad044","DOIUrl":"10.1093/sysbio/syad044","url":null,"abstract":"<p><p>Birth-death (BD) models are widely used in combination with species phylogenies to study past diversification dynamics. Current inference approaches typically rely on likelihood-based methods. These methods are not generalizable, as a new likelihood formula must be established each time a new model is proposed; for some models, such a formula is not even tractable. Deep learning can bring solutions in such situations, as deep neural networks can be trained to learn the relation between simulations and parameter values as a regression problem. In this paper, we adapt a recently developed deep learning method from pathogen phylodynamics to the case of diversification inference, and we extend its applicability to the case of the inference of state-dependent diversification models from phylogenies associated with trait data. We demonstrate the accuracy and time efficiency of the approach for the time-constant homogeneous BD model and the Binary-State Speciation and Extinction model. Finally, we illustrate the use of the proposed inference machinery by reanalyzing a phylogeny of primates and their associated ecological role as seed dispersers. Deep learning inference provides at least the same accuracy as likelihood-based inference while being faster by several orders of magnitude, offering a promising new inference approach for the deployment of future models in the field.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"1262-1279"},"PeriodicalIF":6.5,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10020465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Brian J Sanderson, Diksha Gambhir, Guanqiao Feng, Nan Hu, Quentin C Cronk, Diana M Percy, Francisco Molina Freaner, Matthew G Johnson, Lawrence B Smart, Ken Keefover-Ring, Tongming Yin, Tao Ma, Stephen P DiFazio, Jianquan Liu, Matthew S Olson
{"title":"Phylogenomics reveals patterns of ancient hybridization and differential diversification that contribute to phylogenetic conflict in willows, poplars, and close relatives.","authors":"Brian J Sanderson, Diksha Gambhir, Guanqiao Feng, Nan Hu, Quentin C Cronk, Diana M Percy, Francisco Molina Freaner, Matthew G Johnson, Lawrence B Smart, Ken Keefover-Ring, Tongming Yin, Tao Ma, Stephen P DiFazio, Jianquan Liu, Matthew S Olson","doi":"10.1093/sysbio/syad042","DOIUrl":"10.1093/sysbio/syad042","url":null,"abstract":"<p><p>Despite the economic, ecological, and scientific importance of the genera Salix L. (willows) and Populus L. (poplars, cottonwoods, and aspens) Salicaceae, we know little about the sources of differences in species diversity between the genera and of the phylogenetic conflict that often confounds estimating phylogenetic trees. Salix subgenera and sections, in particular, have been difficult to classify, with one recent attempt termed a \"spectacular failure\" due to a speculated radiation of the subgenera Vetrix and Chamaetia. Here, we use targeted sequence capture to understand the evolutionary history of this portion of the Salicaceae plant family. Our phylogenetic hypothesis was based on 787 gene regions and identified extensive phylogenetic conflict among genes. Our analysis supported some previously described subgeneric relationships and confirmed the polyphyly of others. Using an fbranch analysis, we identified several cases of hybridization in deep branches of the phylogeny, which likely contributed to discordance among gene trees. In addition, we identified a rapid increase in diversification rate near the origination of the Vetrix-Chamaetia clade in Salix. This region of the tree coincided with several nodes that lacked strong statistical support, indicating a possible increase in incomplete lineage sorting due to rapid diversification. The extraordinary level of both recent and ancient hybridization in both Salix and Populus have played important roles in the diversification and diversity in these two genera.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"1220-1232"},"PeriodicalIF":6.5,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9768582","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"How Important Is Budding Speciation for Comparative Studies?","authors":"Daniel S Caetano, Tiago B Quental","doi":"10.1093/sysbio/syad050","DOIUrl":"10.1093/sysbio/syad050","url":null,"abstract":"<p><p>The acknowledgment of evolutionary dependence among species has fundamentally changed how we ask biological questions. Phylogenetic models became the standard approach for studies with 3 or more lineages, in particular those using extant species. Most phylogenetic comparative methods (PCMs) translate relatedness into covariance, meaning that evolutionary changes before lineages split should be interpreted together whereas after the split lineages are expected to change independently. This clever realization has shaped decades of research. Here, we discuss one element of the comparative method often ignored or assumed as unimportant: if nodes of a phylogeny represent the dissolution of the ancestral lineage into two new ones or if the ancestral lineage can survive speciation events (i.e., budding). Budding speciation is often reported in paleontological studies, due to the nature of the evidence for budding in the fossil record, but it is surprisingly absent in comparative methods. Here, we show that many PCMs assume that divergence happens as a symmetric split, even if these methods do not explicitly mention this assumption. We discuss the properties of trait evolution models for continuous and discrete traits and their adequacy under a scenario of budding speciation. We discuss the effects of budding speciation under a series of plausible evolutionary scenarios and show when and how these can influence our estimates. We also propose that long-lived lineages that have survived through a series of budding speciation events and given birth to multiple new lineages can produce evolutionary patterns that challenge our intuition about the most parsimonious history of trait changes in a clade. We hope our discussion can help bridge comparative approaches in paleontology and neontology as well as foster awareness about the assumptions we make when we use phylogenetic trees.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"1443-1453"},"PeriodicalIF":6.5,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10004968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Skyline Fossilized Birth-Death Model is Robust to Violations of Sampling Assumptions in Total-Evidence Dating.","authors":"Chi Zhang, Fredrik Ronquist, Tanja Stadler","doi":"10.1093/sysbio/syad054","DOIUrl":"10.1093/sysbio/syad054","url":null,"abstract":"<p><p>Several total-evidence dating studies under the fossilized birth-death (FBD) model have produced very old age estimates, which are not supported by the fossil record. This phenomenon has been termed \"deep root attraction (DRA).\" For two specific data sets, involving divergence time estimation for the early radiations of ants, bees, and wasps (Hymenoptera) and of placental mammals (Eutheria), it has been shown that the DRA effect can be greatly reduced by accommodating the fact that extant species in these trees have been sampled to maximize diversity, so-called diversified sampling. Unfortunately, current methods to accommodate diversified sampling only consider the extreme case where it is possible to identify a cut-off time such that all splits occurring before this time are represented in the sampled tree but none of the younger splits. In reality, the sampling bias is rarely this extreme and may be difficult to model properly. Similar modeling challenges apply to the sampling of the fossil record. This raises the question of whether it is possible to find dating methods that are more robust to sampling biases. Here, we show that the skyline FBD (SFBD) process, where the diversification and fossil-sampling rates can vary over time in a piecewise fashion, provides age estimates that are more robust to inadequacies in the modeling of the sampling process and less sensitive to DRA effects. In the SFBD model we consider, rates in different time intervals are either considered to be independent and identically distributed or assumed to be autocorrelated following an Ornstein-Uhlenbeck (OU) process. Through simulations and reanalyses of Hymenoptera and Eutheria data, we show that both variants of the SFBD model unify age estimates under random and diversified sampling assumptions. The SFBD model can resolve DRA by absorbing the deviations from the sampling assumptions into the inferred dynamics of the diversification process over time. Although this means that the inferred diversification dynamics must be interpreted with caution, taking sampling biases into account, we conclude that the SFBD model represents the most robust approach currently available for addressing DRA in total-evidence dating.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"1316-1336"},"PeriodicalIF":6.5,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10040045","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Evan Twomey, Paulo Melo-Sampaio, Lisa M Schulte, Franky Bossuyt, Jason L Brown, Santiago Castroviejo-Fisher
{"title":"Multiple Routes to Color Convergence in a Radiation of Neotropical Poison Frogs.","authors":"Evan Twomey, Paulo Melo-Sampaio, Lisa M Schulte, Franky Bossuyt, Jason L Brown, Santiago Castroviejo-Fisher","doi":"10.1093/sysbio/syad051","DOIUrl":"10.1093/sysbio/syad051","url":null,"abstract":"<p><p>Convergent evolution is defined as the independent evolution of similar phenotypes in different lineages. Its existence underscores the importance of external selection pressures in evolutionary history, revealing how functionally similar adaptations can evolve in response to persistent ecological challenges through a diversity of evolutionary routes. However, many examples of convergence, particularly among closely related species, involve parallel changes in the same genes or developmental pathways, raising the possibility that homology at deeper mechanistic levels is an important facilitator of phenotypic convergence. Using the genus Ranitomeya, a young, color-diverse radiation of Neotropical poison frogs, we set out to 1) provide a phylogenetic framework for this group, 2) leverage this framework to determine if color phenotypes are convergent, and 3) to characterize the underlying coloration mechanisms to test whether color convergence occurred through the same or different physical mechanisms. We generated a phylogeny for Ranitomeya using ultraconserved elements and investigated the physical mechanisms underlying bright coloration, focusing on skin pigments. Using phylogenetic comparative methods, we identified several instances of color convergence, involving several gains and losses of carotenoid and pterin pigments. We also found a compelling example of nonparallel convergence, where, in one lineage, red coloration evolved through the red pterin pigment drosopterin, and in another lineage through red ketocarotenoids. Additionally, in another lineage, \"reddish\" coloration evolved predominantly through structural color mechanisms. Our study demonstrates that, even within a radiation of closely related species, convergent evolution can occur through both parallel and nonparallel mechanisms, challenging the assumption that similar phenotypes among close relatives evolve through the same mechanisms.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"1247-1261"},"PeriodicalIF":6.5,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10924724/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10021110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The Cauchy Process on Phylogenies: A Tractable Model for Pulsed Evolution.","authors":"Paul Bastide, Gilles Didier","doi":"10.1093/sysbio/syad053","DOIUrl":"10.1093/sysbio/syad053","url":null,"abstract":"<p><p>Phylogenetic comparative methods use random processes, such as the Brownian Motion, to model the evolution of continuous traits on phylogenetic trees. Growing evidence for non-gradual evolution motivated the development of complex models, often based on Lévy processes. However, their statistical inference is computationally intensive and currently relies on approximations, high-dimensional sampling, or numerical integration. We consider here the Cauchy Process (CP), a particular pure-jump Lévy process in which the trait increment along each branch follows a centered Cauchy distribution with a dispersion proportional to its length. In this work, we derive an exact algorithm to compute both the joint probability density of the tip trait values of a phylogeny under a CP and the ancestral trait values and branch increments posterior densities in quadratic time. A simulation study shows that the CP generates patterns in comparative data that are distinct from any Gaussian process, and that restricted maximum likelihood parameter estimates and root trait reconstruction are unbiased and accurate for trees with 200 tips or less. The CP has only two parameters but is rich enough to capture complex-pulsed evolution. It can reconstruct posterior ancestral trait distributions that are multimodal, reflecting the uncertainty associated with the inference of the evolutionary history of a trait from extant taxa only. Applied on empirical datasets taken from the Evolutionary Ecology and Virology literature, the CP suggests nuanced scenarios for the body size evolution of Greater Antilles Lizards and for the geographical spread of the West Nile Virus epidemics in North America, both consistent with previous studies using more complex models. The method is efficiently implemented in C with an R interface in package cauphy, which is open source and freely available online.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"1296-1315"},"PeriodicalIF":6.5,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10096345","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Robustness of Felsenstein's Versus Transfer Bootstrap Supports With Respect to Taxon Sampling.","authors":"Paul Zaharias, Frédéric Lemoine, Olivier Gascuel","doi":"10.1093/sysbio/syad052","DOIUrl":"10.1093/sysbio/syad052","url":null,"abstract":"<p><p>The bootstrap method is based on resampling sequence alignments and re-estimating trees. Felsenstein's bootstrap proportions (FBP) are the most common approach to assess the reliability and robustness of sequence-based phylogenies. However, when increasing taxon sampling (i.e., the number of sequences) to hundreds or thousands of taxa, FBP tend to return low support for deep branches. The transfer bootstrap expectation (TBE) has been recently suggested as an alternative to FBP. TBE is measured using a continuous transfer index in [0,1] for each bootstrap tree, instead of the binary {0,1} index used in FBP to measure the presence/absence of the branch of interest. TBE has been shown to yield higher and more informative supports while inducing a very low number of falsely supported branches. Nonetheless, it has been argued that TBE must be used with care due to sampling issues, especially in datasets with a high number of closely related taxa. In this study, we conduct multiple experiments by varying taxon sampling and comparing FBP and TBE support values on different phylogenetic depths, using empirical datasets. Our results show that the main critique of TBE stands in extreme cases with shallow branches and highly unbalanced sampling among clades, but that TBE is still robust in most cases, while FBP is inescapably negatively impacted by high taxon sampling. We suggest guidelines and good practices in TBE (and FBP) computing and interpretation.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"1280-1295"},"PeriodicalIF":6.1,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10939309/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41110434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}