Systematic Biology最新文献

筛选
英文 中文
Inferring the Evolutionary Model of Community-Structuring Traits with Convolutional Kitchen Sinks. 用卷积厨房水槽推断群落结构特征的进化模型
IF 6.1 1区 生物学
Systematic Biology Pub Date : 2024-09-05 DOI: 10.1093/sysbio/syae026
Avery Kruger, Vaishaal Shankar, T Jonathan Davies
{"title":"Inferring the Evolutionary Model of Community-Structuring Traits with Convolutional Kitchen Sinks.","authors":"Avery Kruger, Vaishaal Shankar, T Jonathan Davies","doi":"10.1093/sysbio/syae026","DOIUrl":"10.1093/sysbio/syae026","url":null,"abstract":"<p><p>When communities are assembled through processes such as filtering or limiting similarity acting on phylogenetically conserved traits, the evolutionary signature of those traits may be reflected in patterns of community membership. We show how the model of trait evolution underlying community-structuring traits can be inferred from community membership data using both a variation of a traditional eco-phylogenetic metric-the mean pairwise phylogenetic distance (MPD) between taxa-and a recent machine learning tool, Convolutional Kitchen Sinks (CKS). Both methods perform well across a range of phylogenetically informative evolutionary models, but CKS outperforms MPD as tree size increases. We demonstrate CKS by inferring the evolutionary history of freeze tolerance in angiosperms. Our analysis is consistent with a late burst model, suggesting freeze tolerance evolved recently. We suggest that multiple data types that are ordered on phylogenies, such as trait values, species interactions, or community presence/absence, are good candidates for CKS modeling because the generative models produce structured differences between neighboring points that CKS is well-suited for. We introduce the R package kitchen to perform CKS for generic application of the technique.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"546-561"},"PeriodicalIF":6.1,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11377182/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141065607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Sequential Direct and Indirect Effects of Mountain Uplift, Climatic Niche, and Floral Trait Evolution on Diversification Dynamics in an Andean Plant Clade. 山地隆起、气候生态位和花性状进化对安第斯植物支系多样化动态的直接和间接影响。
IF 6.1 1区 生物学
Systematic Biology Pub Date : 2024-09-05 DOI: 10.1093/sysbio/syae011
Agnes S Dellinger, Laura Lagomarsino, Fabián Michelangeli, Stefan Dullinger, Stacey D Smith
{"title":"The Sequential Direct and Indirect Effects of Mountain Uplift, Climatic Niche, and Floral Trait Evolution on Diversification Dynamics in an Andean Plant Clade.","authors":"Agnes S Dellinger, Laura Lagomarsino, Fabián Michelangeli, Stefan Dullinger, Stacey D Smith","doi":"10.1093/sysbio/syae011","DOIUrl":"10.1093/sysbio/syae011","url":null,"abstract":"<p><p>Why and how organismal lineages radiate is commonly studied through either assessing abiotic factors (biogeography, geomorphological processes, and climate) or biotic factors (traits and interactions). Despite increasing awareness that both abiotic and biotic processes may have important joint effects on diversification dynamics, few attempts have been made to quantify the relative importance and timing of these factors, and their potentially interlinked direct and indirect effects, on lineage diversification. We here combine assessments of historical biogeography, geomorphology, climatic niche, vegetative, and floral trait evolution to test whether these factors jointly, or in isolation, explain diversification dynamics of a Neotropical plant clade (Merianieae, Melastomataceae). After estimating ancestral areas and the changes in niche and trait disparity over time, we employ Phylogenetic Path Analyses as a synthesis tool to test eleven hypotheses on the individual direct and indirect effects of these factors on diversification rates. We find strongest support for interlinked effects of colonization of the uplifting Andes during the mid-Miocene and rapid abiotic climatic niche evolution in explaining a burst in diversification rate in Merianieae. Within Andean habitats, later increases in floral disparity allowed for the exploitation of wider pollination niches (i.e., shifts from bee to vertebrate pollinators), but did not affect diversification rates. Our approach of including both vegetative and floral trait evolution, rare in assessments of plant diversification in general, highlights that the evolution of woody habit and larger flowers preceded the colonization of the Andes, but was likely critical in enabling the rapid radiation in montane environments. Overall, and in concert with the idea that ecological opportunity is a key element of evolutionary radiations, our results suggest that a combination of rapid niche evolution and trait shifts was critical for the exploitation of newly available niche space in the Andes in the mid-Miocene. Further, our results emphasize the importance of incorporating both abiotic and biotic factors into the same analytical framework if we aim to quantify the relative and interlinked effects of these processes on diversification.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"594-612"},"PeriodicalIF":6.1,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11377192/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140330144","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Clockor2: Inferring Global and Local Strict Molecular Clocks Using Root-to-Tip Regression. Clockor2:利用根尖回归推断全局和局部严格分子钟。
IF 6.1 1区 生物学
Systematic Biology Pub Date : 2024-09-05 DOI: 10.1093/sysbio/syae003
Leo A Featherstone, Andrew Rambaut, Sebastian Duchene, Wytamma Wirth
{"title":"Clockor2: Inferring Global and Local Strict Molecular Clocks Using Root-to-Tip Regression.","authors":"Leo A Featherstone, Andrew Rambaut, Sebastian Duchene, Wytamma Wirth","doi":"10.1093/sysbio/syae003","DOIUrl":"10.1093/sysbio/syae003","url":null,"abstract":"<p><p>Molecular sequence data from rapidly evolving organisms are often sampled at different points in time. Sampling times can then be used for molecular clock calibration. The root-to-tip (RTT) regression is an essential tool to assess the degree to which the data behave in a clock-like fashion. Here, we introduce Clockor2, a client-side web application for conducting RTT regression. Clockor2 allows users to quickly fit local and global molecular clocks, thus handling the increasing complexity of genomic datasets that sample beyond the assumption of homogeneous host populations. Clockor2 is efficient, handling trees of up to the order of 104 tips, with significant speed increases compared with other RTT regression applications. Although clockor2 is written as a web application, all data processing happens on the client-side, meaning that data never leave the user's computer. Clockor2 is freely available at https://clockor2.github.io/.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"623-628"},"PeriodicalIF":6.1,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11377183/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139898272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Distinguishing Cophylogenetic Signal from Phylogenetic Congruence Clarifies the Interplay Between Evolutionary History and Species Interactions. 将同源遗传信号与系统发育一致性区分开来,可以澄清进化历史与物种相互作用之间的相互作用。
IF 6.1 1区 生物学
Systematic Biology Pub Date : 2024-09-05 DOI: 10.1093/sysbio/syae013
Benoît Perez-Lamarque, Hélène Morlon
{"title":"Distinguishing Cophylogenetic Signal from Phylogenetic Congruence Clarifies the Interplay Between Evolutionary History and Species Interactions.","authors":"Benoît Perez-Lamarque, Hélène Morlon","doi":"10.1093/sysbio/syae013","DOIUrl":"10.1093/sysbio/syae013","url":null,"abstract":"<p><p>Interspecific interactions, including host-symbiont associations, can profoundly affect the evolution of the interacting species. Given the phylogenies of host and symbiont clades and knowledge of which host species interact with which symbiont, two questions are often asked: \"Do closely related hosts interact with closely related symbionts?\" and \"Do host and symbiont phylogenies mirror one another?.\" These questions are intertwined and can even collapse under specific situations, such that they are often confused one with the other. However, in most situations, a positive answer to the first question, hereafter referred to as \"cophylogenetic signal,\" does not imply a close match between the host and symbiont phylogenies. It suggests only that past evolutionary history has contributed to shaping present-day interactions, which can arise, for example, through present-day trait matching, or from a single ancient vicariance event that increases the probability that closely related species overlap geographically. A positive answer to the second, referred to as \"phylogenetic congruence,\" is more restrictive as it suggests a close match between the two phylogenies, which may happen, for example, if symbiont diversification tracks host diversification or if the diversifications of the two clades were subject to the same succession of vicariance events. Here we apply a set of methods (ParaFit, PACo, and eMPRess), whose significance is often interpreted as evidence for phylogenetic congruence, to simulations under 3 biologically realistic scenarios of trait matching, a single ancient vicariance event, and phylogenetic tracking with frequent cospeciation events. The latter is the only scenario that generates phylogenetic congruence, whereas the first 2 generate a cophylogenetic signal in the absence of phylogenetic congruence. We find that tests of global-fit methods (ParaFit and PACo) are significant under the 3 scenarios, whereas tests of event-based methods (eMPRess) are only significant under the scenario of phylogenetic tracking. Therefore, significant results from global-fit methods should be interpreted in terms of cophylogenetic signal and not phylogenetic congruence; such significant results can arise under scenarios when hosts and symbionts had independent evolutionary histories. Conversely, significant results from event-based methods suggest a strong form of dependency between hosts and symbionts evolutionary histories. Clarifying the patterns detected by different cophylogenetic methods is key to understanding how interspecific interactions shape and are shaped by evolution.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"613-622"},"PeriodicalIF":6.1,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140111462","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hierarchical Heuristic Species Delimitation under the Multispecies Coalescent Model with Migration 多物种聚合模型下的分层启发式物种划分与迁移
IF 6.5 1区 生物学
Systematic Biology Pub Date : 2024-08-20 DOI: 10.1093/sysbio/syae050
Daniel Kornai, Xiyun Jiao, Jiayi Ji, Tomáš Flouri, Ziheng Yang
{"title":"Hierarchical Heuristic Species Delimitation under the Multispecies Coalescent Model with Migration","authors":"Daniel Kornai, Xiyun Jiao, Jiayi Ji, Tomáš Flouri, Ziheng Yang","doi":"10.1093/sysbio/syae050","DOIUrl":"https://doi.org/10.1093/sysbio/syae050","url":null,"abstract":"The multispecies coalescent (MSC) model accommodates genealogical fluctuations across the genome and provides a natural framework for comparative analysis of genomic sequence data from closely related species to infer the history of species divergence and gene flow. Given a set of populations, hypotheses of species delimitation (and species phylogeny) may be formulated as instances of MSC models (e.g., MSC for one species versus MSC for two species) and compared using Bayesian model selection. This approach, implemented in the program bpp, has been found to be prone to over-splitting. Alternatively heuristic criteria based on population parameters (such as popula- tion split times, population sizes, and migration rates) estimated from genomic data may be used to delimit species. Here we develop hierarchical merge and split algorithms for heuristic species delimitation based on the genealogical divergence index (𝑔𝑑𝑖) and implement them in a python pipeline called hhsd. We characterize the behavior of the 𝑔𝑑𝑖 under a few simple scenarios of gene flow. We apply the new approaches to a dataset simulated under a model of isolation by distance as well as three empirical datasets. Our tests suggest that the new approaches produced sensible results and were less prone to over-splitting. We discuss possible strategies for accommodating paraphyletic species in the hierarchical algorithm, as well as the challenges of species delimitation based on heuristic criteria.","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":"4 1","pages":""},"PeriodicalIF":6.5,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142045640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MAST: Phylogenetic Inference with Mixtures Across Sites and Trees. MAST:利用跨位点和树的混合物进行系统发育推断。
IF 6.1 1区 生物学
Systematic Biology Pub Date : 2024-07-27 DOI: 10.1093/sysbio/syae008
Thomas K F Wong, Caitlin Cherryh, Allen G Rodrigo, Matthew W Hahn, Bui Quang Minh, Robert Lanfear
{"title":"MAST: Phylogenetic Inference with Mixtures Across Sites and Trees.","authors":"Thomas K F Wong, Caitlin Cherryh, Allen G Rodrigo, Matthew W Hahn, Bui Quang Minh, Robert Lanfear","doi":"10.1093/sysbio/syae008","DOIUrl":"10.1093/sysbio/syae008","url":null,"abstract":"<p><p>Hundreds or thousands of loci are now routinely used in modern phylogenomic studies. Concatenation approaches to tree inference assume that there is a single topology for the entire dataset, but different loci may have different evolutionary histories due to incomplete lineage sorting (ILS), introgression, and/or horizontal gene transfer; even single loci may not be treelike due to recombination. To overcome this shortcoming, we introduce an implementation of a multi-tree mixture model that we call mixtures across sites and trees (MAST). This model extends a prior implementation by Boussau et al. (2009) by allowing users to estimate the weight of each of a set of pre-specified bifurcating trees in a single alignment. The MAST model allows each tree to have its own weight, topology, branch lengths, substitution model, nucleotide or amino acid frequencies, and model of rate heterogeneity across sites. We implemented the MAST model in a maximum-likelihood framework in the popular phylogenetic software, IQ-TREE. Simulations show that we can accurately recover the true model parameters, including branch lengths and tree weights for a given set of tree topologies, under a wide range of biologically realistic scenarios. We also show that we can use standard statistical inference approaches to reject a single-tree model when data are simulated under multiple trees (and vice versa). We applied the MAST model to multiple primate datasets and found that it can recover the signal of ILS in the Great Apes, as well as the asymmetry in minor trees caused by introgression among several macaque species. When applied to a dataset of 4 Platyrrhine species for which standard concatenated maximum likelihood (ML) and gene tree approaches disagree, we observe that MAST gives the highest weight (i.e., the largest proportion of sites) to the tree also supported by gene tree approaches. These results suggest that the MAST model is able to analyze a concatenated alignment using ML while avoiding some of the biases that come with assuming there is only a single tree. We discuss how the MAST model can be extended in the future.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"375-391"},"PeriodicalIF":6.1,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11282360/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139991260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Effect of Copy Number Hemiplasy on Gene Family Evolution. 拷贝数半重复对基因家族进化的影响
IF 6.1 1区 生物学
Systematic Biology Pub Date : 2024-07-27 DOI: 10.1093/sysbio/syae007
Qiuyi Li, Yao-Ban Chan, Nicolas Galtier, Celine Scornavacca
{"title":"The Effect of Copy Number Hemiplasy on Gene Family Evolution.","authors":"Qiuyi Li, Yao-Ban Chan, Nicolas Galtier, Celine Scornavacca","doi":"10.1093/sysbio/syae007","DOIUrl":"10.1093/sysbio/syae007","url":null,"abstract":"<p><p>The evolution of gene families is complex, involving gene-level evolutionary events such as gene duplication, horizontal gene transfer, and gene loss, and other processes such as incomplete lineage sorting (ILS). Because of this, topological differences often exist between gene trees and species trees. A number of models have been recently developed to explain these discrepancies, the most realistic of which attempts to consider both gene-level events and ILS. When unified in a single model, the interaction between ILS and gene-level events can cause polymorphism in gene copy number, which we refer to as copy number hemiplasy (CNH). In this paper, we extend the Wright-Fisher process to include duplications and losses over several species, and show that the probability of CNH for this process can be significant. We study how well two unified models-multilocus multispecies coalescent (MLMSC), which models CNH, and duplication, loss, and coalescence (DLCoal), which does not-approximate the Wright-Fisher process with duplication and loss. We then study the effect of CNH on gene family evolution by comparing MLMSC and DLCoal. We generate comparable gene trees under both models, showing significant differences in various summary statistics; most importantly, CNH reduces the number of gene copies greatly. If this is not taken into account, the traditional method of estimating duplication rates (by counting the number of gene copies) becomes inaccurate. The simulated gene trees are also used for species tree inference with the summary methods ASTRAL and ASTRAL-Pro, demonstrating that their accuracy, based on CNH-unaware simulations calibrated on real data, may have been overestimated.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"355-374"},"PeriodicalIF":6.1,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139707930","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Considering Decoupled Phenotypic Diversification Between Ontogenetic Phases in Macroevolution: An Example Using Triggerfishes (Balistidae). 考虑宏观进化中本体发育阶段之间的脱钩表型多样化:以触发鱼(Balistidae)为例。
IF 6.1 1区 生物学
Systematic Biology Pub Date : 2024-07-27 DOI: 10.1093/sysbio/syae014
Alex Dornburg, Katerina L Zapfe, Rachel Williams, Michael E Alfaro, Richard Morris, Haruka Adachi, Joseph Flores, Francesco Santini, Thomas J Near, Bruno Frédérich
{"title":"Considering Decoupled Phenotypic Diversification Between Ontogenetic Phases in Macroevolution: An Example Using Triggerfishes (Balistidae).","authors":"Alex Dornburg, Katerina L Zapfe, Rachel Williams, Michael E Alfaro, Richard Morris, Haruka Adachi, Joseph Flores, Francesco Santini, Thomas J Near, Bruno Frédérich","doi":"10.1093/sysbio/syae014","DOIUrl":"10.1093/sysbio/syae014","url":null,"abstract":"<p><p>Across the Tree of Life, most studies of phenotypic disparity and diversification have been restricted to adult organisms. However, many lineages have distinct ontogenetic phases that differ from their adult forms in morphology and ecology. Focusing disproportionately on the evolution of adult forms unnecessarily hinders our understanding of the pressures shaping evolution over time. Non-adult disparity patterns are particularly important to consider for coastal ray-finned fishes, which can have juvenile phases with distinct phenotypes. These juvenile forms are often associated with sheltered nursery environments, with phenotypic shifts between adults and juvenile stages that are readily apparent in locomotor morphology. Whether this ontogenetic variation in locomotor morphology reflects a decoupling of diversification dynamics between life stages remains unknown. Here we investigate the evolutionary dynamics of locomotor morphology between adult and juvenile triggerfishes. We integrate a time-calibrated phylogenetic framework with geometric morphometric approaches and measurement data of fin aspect ratio and incidence, and reveal a mismatch between morphospace occupancy, the evolution of morphological disparity, and the tempo of trait evolution between life stages. Collectively, our results illuminate how the heterogeneity of morpho-functional adaptations can decouple the mode and tempo of morphological diversification between ontogenetic stages.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"434-454"},"PeriodicalIF":6.1,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140137307","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving the gold standard in NCBI GenBank and related databases: DNA sequences from type specimens and type strains. 模式标本和类型菌株的DNA序列——如何增加它们的数量并改进它们在NCBI GenBank和相关数据库中的注释。
IF 6.1 1区 生物学
Systematic Biology Pub Date : 2024-07-27 DOI: 10.1093/sysbio/syad068
Susanne S Renner, Mark D Scherz, Conrad L Schoch, Marc Gottschling, Miguel Vences
{"title":"Improving the gold standard in NCBI GenBank and related databases: DNA sequences from type specimens and type strains.","authors":"Susanne S Renner, Mark D Scherz, Conrad L Schoch, Marc Gottschling, Miguel Vences","doi":"10.1093/sysbio/syad068","DOIUrl":"10.1093/sysbio/syad068","url":null,"abstract":"<p><p>Scientific names permit humans and search engines to access knowledge about the biodiversity that surrounds us, and names linked to DNA sequences are playing an ever-greater role in search-and-match identification procedures. Here, we analyze how users and curators of the National Center for Biotechnology Information (NCBI) are flagging and curating sequences derived from nomenclatural type material, which is the only way to improve the quality of DNA-based identification in the long run. For prokaryotes, 18,281 genome assemblies from type strains have been curated by NCBI staff and improve the quality of prokaryote naming. For Fungi, type-derived sequences representing over 21,000 species are now essential for fungus naming and identification. For the remaining eukaryotes, however, the numbers of sequences identifiable as type-derived are minuscule, representing only 739 species of arthropods, 1542 vertebrates, and 125 embryophytes. An increase in the production and curation of such sequences will come from (i) sequencing of types or topotypic specimens in museum collections, (ii) the March 2023 rule changes at the International Nucleotide Sequence Database Collaboration requiring more metadata for specimens, and (iii) efforts by data submitters to facilitate curation, including informing NCBI curators about a specimen's type status. We illustrate different type-data submission journeys and provide best-practice examples from a range of organisms. Expanding the number of type-derived sequences in DNA databases, especially of eukaryotes, is crucial for capturing, documenting, and protecting biodiversity.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"486-494"},"PeriodicalIF":6.1,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11502950/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"92156794","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian Phylogenetic Analysis on Multi-Core Compute Architectures: Implementation and Evaluation of BEAGLE in RevBayes With MPI. 多核计算架构上的贝叶斯系统发育分析:使用 MPI 实现和评估 RevBayes 中的 BEAGLE。
IF 6.1 1区 生物学
Systematic Biology Pub Date : 2024-07-27 DOI: 10.1093/sysbio/syae005
Killian Smith, Daniel Ayres, René Neumaier, Gert Wörheide, Sebastian Höhna
{"title":"Bayesian Phylogenetic Analysis on Multi-Core Compute Architectures: Implementation and Evaluation of BEAGLE in RevBayes With MPI.","authors":"Killian Smith, Daniel Ayres, René Neumaier, Gert Wörheide, Sebastian Höhna","doi":"10.1093/sysbio/syae005","DOIUrl":"10.1093/sysbio/syae005","url":null,"abstract":"<p><p>Phylogenies are central to many research areas in biology and commonly estimated using likelihood-based methods. Unfortunately, any likelihood-based method, including Bayesian inference, can be restrictively slow for large datasets-with many taxa and/or many sites in the sequence alignment-or complex substitutions models. The primary limiting factor when using large datasets and/or complex models in probabilistic phylogenetic analyses is the likelihood calculation, which dominates the total computation time. To address this bottleneck, we incorporated the high-performance phylogenetic library BEAGLE into RevBayes, which enables multi-threading on multi-core CPUs and GPUs, as well as hardware specific vectorized instructions for faster likelihood calculations. Our new implementation of RevBayes+BEAGLE retains the flexibility and dynamic nature that users expect from vanilla RevBayes. In addition, we implemented native parallelization within RevBayes without an external library using the message passing interface (MPI); RevBayes+MPI. We evaluated our new implementation of RevBayes+BEAGLE using multi-threading on CPUs and 2 different powerful GPUs (NVidia Titan V and NVIDIA A100) against our native implementation of RevBayes+MPI. We found good improvements in speedup when multiple cores were used, with up to 20-fold speedup when using multiple CPU cores and over 90-fold speedup when using multiple GPU cores. The improvement depended on the data type used, DNA or amino acids, and the size of the alignment, but less on the size of the tree. We additionally investigated the cost of rescaling partial likelihoods to avoid numerical underflow and showed that unnecessarily frequent and inefficient rescaling can increase runtimes up to 4-fold. Finally, we presented and compared a new approach to store partial likelihoods on branches instead of nodes that can speed up computations up to 1.7 times but comes at twice the memory requirements.</p>","PeriodicalId":22120,"journal":{"name":"Systematic Biology","volume":" ","pages":"455-469"},"PeriodicalIF":6.1,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139571417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信