蚁群低覆盖全基因组测序的系统基因组推断管道比较

IF 4.7 1区 农林科学 Q1 ENTOMOLOGY
Junxia Zhang, Long Lin, Yannan Mu, Alan Brelsford, Jessica Purcell
{"title":"蚁群低覆盖全基因组测序的系统基因组推断管道比较","authors":"Junxia Zhang,&nbsp;Long Lin,&nbsp;Yannan Mu,&nbsp;Alan Brelsford,&nbsp;Jessica Purcell","doi":"10.1111/syen.12670","DOIUrl":null,"url":null,"abstract":"<p>A rapid proliferation in the availability of whole genome sequences (WGS), often with relatively low read depth, offers an unprecedented opportunity for phylogenomic advances using publicly available data, but there are several key challenges in applying these data. Using low-coverage WGS data for the ant species of <i>Formica</i>, we conducted detailed comparisons on two different analytical pipelines (reference-based vs. de novo genome assembly), four types of datasets (5-kbp-window, ultra-conserved element [UCE], single-copy ortholog [BUSCO] and mitogenome), and a series of analytical procedures (e.g. concatenation vs. coalescent analyses) to identify which are robust to typical WGS data. The results show that at a shallow scale of phylogenetic relationships of closely related species 5-kbp-windows from the reference-based pipeline and UCEs from the de novo assemblies are more successful than the BUSCOs in recovering informative markers for phylogenetic inference. Compared with concatenation analyses, coalescent analyses often resulted in disparate deeper relationships in the phylogeny. This study also uncovers evident mito-nuclear discordance and demonstrates genome-wide gene conflicts in phylogenetic signals, both pointing to possible incomplete lineage sorting and/or hybridization during the early, rapid radiation of <i>Formica</i> ants. Divergence dating analyses show that different types of data and analytical methods could result in inconsistent time estimates, highlighting the potential need for multiple approaches to better understand species divergence. The strengths and weaknesses of different analytical pipelines and strategies are discussed. Findings from this study provide valuable insights for large-scale phylogenomic projects using WGS data.</p>","PeriodicalId":22126,"journal":{"name":"Systematic Entomology","volume":"50 3","pages":"611-629"},"PeriodicalIF":4.7000,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A comparison of phylogenomic inference pipelines for low-coverage whole-genome sequencing in Formica ants\",\"authors\":\"Junxia Zhang,&nbsp;Long Lin,&nbsp;Yannan Mu,&nbsp;Alan Brelsford,&nbsp;Jessica Purcell\",\"doi\":\"10.1111/syen.12670\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>A rapid proliferation in the availability of whole genome sequences (WGS), often with relatively low read depth, offers an unprecedented opportunity for phylogenomic advances using publicly available data, but there are several key challenges in applying these data. Using low-coverage WGS data for the ant species of <i>Formica</i>, we conducted detailed comparisons on two different analytical pipelines (reference-based vs. de novo genome assembly), four types of datasets (5-kbp-window, ultra-conserved element [UCE], single-copy ortholog [BUSCO] and mitogenome), and a series of analytical procedures (e.g. concatenation vs. coalescent analyses) to identify which are robust to typical WGS data. The results show that at a shallow scale of phylogenetic relationships of closely related species 5-kbp-windows from the reference-based pipeline and UCEs from the de novo assemblies are more successful than the BUSCOs in recovering informative markers for phylogenetic inference. Compared with concatenation analyses, coalescent analyses often resulted in disparate deeper relationships in the phylogeny. This study also uncovers evident mito-nuclear discordance and demonstrates genome-wide gene conflicts in phylogenetic signals, both pointing to possible incomplete lineage sorting and/or hybridization during the early, rapid radiation of <i>Formica</i> ants. Divergence dating analyses show that different types of data and analytical methods could result in inconsistent time estimates, highlighting the potential need for multiple approaches to better understand species divergence. The strengths and weaknesses of different analytical pipelines and strategies are discussed. Findings from this study provide valuable insights for large-scale phylogenomic projects using WGS data.</p>\",\"PeriodicalId\":22126,\"journal\":{\"name\":\"Systematic Entomology\",\"volume\":\"50 3\",\"pages\":\"611-629\"},\"PeriodicalIF\":4.7000,\"publicationDate\":\"2025-01-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Systematic Entomology\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/syen.12670\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENTOMOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Systematic Entomology","FirstCategoryId":"97","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/syen.12670","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENTOMOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

全基因组序列(WGS)的可用性迅速增加,通常具有相对较低的读取深度,为利用公开可用的数据进行系统基因组学研究提供了前所未有的机会,但在应用这些数据时存在几个关键挑战。利用Formica蚁种的低覆盖率WGS数据,我们对两种不同的分析管道(基于参考的基因组组装与从头组装)、四种类型的数据集(5kbp -window、超保守元件(UCE)、单拷贝同源物(BUSCO)和有丝分裂基因组)以及一系列分析方法(例如串联分析与聚结分析)进行了详细的比较,以确定哪些对典型的WGS数据具有鲁棒性。结果表明,在近缘物种系统发育关系的浅层尺度上,基于参考管道的5-kbp窗口和来自从头组装的UCEs在恢复系统发育推断的信息标记方面比busco更成功。与串联分析相比,聚结分析往往导致系统发育中不同的更深层次的关系。该研究还发现了明显的有丝分裂核不一致,并在系统发育信号中证明了全基因组基因冲突,这两者都指向了在Formica蚂蚁早期快速辐射期间可能不完整的谱系分类和/或杂交。差异定年分析表明,不同类型的数据和分析方法可能导致时间估计不一致,这突出了对多种方法的潜在需求,以更好地了解物种差异。讨论了不同分析管道和策略的优缺点。本研究的发现为利用WGS数据进行大规模系统基因组项目提供了有价值的见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A comparison of phylogenomic inference pipelines for low-coverage whole-genome sequencing in Formica ants

A rapid proliferation in the availability of whole genome sequences (WGS), often with relatively low read depth, offers an unprecedented opportunity for phylogenomic advances using publicly available data, but there are several key challenges in applying these data. Using low-coverage WGS data for the ant species of Formica, we conducted detailed comparisons on two different analytical pipelines (reference-based vs. de novo genome assembly), four types of datasets (5-kbp-window, ultra-conserved element [UCE], single-copy ortholog [BUSCO] and mitogenome), and a series of analytical procedures (e.g. concatenation vs. coalescent analyses) to identify which are robust to typical WGS data. The results show that at a shallow scale of phylogenetic relationships of closely related species 5-kbp-windows from the reference-based pipeline and UCEs from the de novo assemblies are more successful than the BUSCOs in recovering informative markers for phylogenetic inference. Compared with concatenation analyses, coalescent analyses often resulted in disparate deeper relationships in the phylogeny. This study also uncovers evident mito-nuclear discordance and demonstrates genome-wide gene conflicts in phylogenetic signals, both pointing to possible incomplete lineage sorting and/or hybridization during the early, rapid radiation of Formica ants. Divergence dating analyses show that different types of data and analytical methods could result in inconsistent time estimates, highlighting the potential need for multiple approaches to better understand species divergence. The strengths and weaknesses of different analytical pipelines and strategies are discussed. Findings from this study provide valuable insights for large-scale phylogenomic projects using WGS data.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Systematic Entomology
Systematic Entomology 生物-进化生物学
CiteScore
10.50
自引率
8.30%
发文量
49
审稿时长
>12 weeks
期刊介绍: Systematic Entomology publishes original papers on insect systematics, phylogenetics and integrative taxonomy, with a preference for general interest papers of broad biological, evolutionary or zoogeographical relevance.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信