A phylogenomic study of Steganinae fruit flies (Diptera: Drosophilidae): strong gene tree heterogeneity and evidence for monophyly.

IF 3.4 Q1 Agricultural and Biological Sciences
Guilherme Rezende Dias, Eduardo Guimarães Dupim, Thyago Vanderlinde, Beatriz Mello, Antonio Bernardo Carvalho
{"title":"A phylogenomic study of Steganinae fruit flies (Diptera: Drosophilidae): strong gene tree heterogeneity and evidence for monophyly.","authors":"Guilherme Rezende Dias,&nbsp;Eduardo Guimarães Dupim,&nbsp;Thyago Vanderlinde,&nbsp;Beatriz Mello,&nbsp;Antonio Bernardo Carvalho","doi":"10.1186/s12862-020-01703-7","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The Drosophilidae family is traditionally divided into two subfamilies: Drosophilinae and Steganinae. This division is based on morphological characters, and the two subfamilies have been treated as monophyletic in most of the literature, but some molecular phylogenies have suggested Steganinae to be paraphyletic. To test the paraphyletic-Steganinae hypothesis, here, we used genomic sequences of eight Drosophilidae (three Steganinae and five Drosophilinae) and two Ephydridae (outgroup) species and inferred the phylogeny for the group based on a dataset of 1,028 orthologous genes present in all species (> 1,000,000 bp). This dataset includes three genera that broke the monophyly of the subfamilies in previous works. To investigate possible biases introduced by small sample sizes and automatic gene annotation, we used the same methods to infer species trees from a set of 10 manually annotated genes that are commonly used in phylogenetics.</p><p><strong>Results: </strong>Most of the 1,028 gene trees depicted Steganinae as paraphyletic with distinct topologies, but the most common topology depicted it as monophyletic (43.7% of the gene trees). Despite the high levels of gene tree heterogeneity observed, species tree inference in ASTRAL, in PhyloNet, and with the concatenation approach strongly supported the monophyly of both subfamilies for the 1,028-gene dataset. However, when using the concatenation approach to infer a species tree from the smaller set of 10 genes, we recovered Steganinae as a paraphyletic group. The pattern of gene tree heterogeneity was asymmetrical and thus could not be explained solely by incomplete lineage sorting (ILS).</p><p><strong>Conclusions: </strong>Steganinae was clearly a monophyletic group in the dataset that we analyzed. In addition to ILS, gene tree discordance was possibly the result of introgression, suggesting complex branching processes during the early evolution of Drosophilidae with short speciation intervals and gene flow. Our study highlights the importance of genomic data in elucidating contentious phylogenetic relationships and suggests that phylogenetic inference for drosophilids based on small molecular datasets should be performed cautiously. Finally, we suggest an approach for the correction and cleaning of BUSCO-derived genomic datasets that will be useful to other researchers planning to use this tool for phylogenomic studies.</p>","PeriodicalId":9111,"journal":{"name":"BMC Evolutionary Biology","volume":null,"pages":null},"PeriodicalIF":3.4000,"publicationDate":"2020-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s12862-020-01703-7","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Evolutionary Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s12862-020-01703-7","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Agricultural and Biological Sciences","Score":null,"Total":0}
引用次数: 6

Abstract

Background: The Drosophilidae family is traditionally divided into two subfamilies: Drosophilinae and Steganinae. This division is based on morphological characters, and the two subfamilies have been treated as monophyletic in most of the literature, but some molecular phylogenies have suggested Steganinae to be paraphyletic. To test the paraphyletic-Steganinae hypothesis, here, we used genomic sequences of eight Drosophilidae (three Steganinae and five Drosophilinae) and two Ephydridae (outgroup) species and inferred the phylogeny for the group based on a dataset of 1,028 orthologous genes present in all species (> 1,000,000 bp). This dataset includes three genera that broke the monophyly of the subfamilies in previous works. To investigate possible biases introduced by small sample sizes and automatic gene annotation, we used the same methods to infer species trees from a set of 10 manually annotated genes that are commonly used in phylogenetics.

Results: Most of the 1,028 gene trees depicted Steganinae as paraphyletic with distinct topologies, but the most common topology depicted it as monophyletic (43.7% of the gene trees). Despite the high levels of gene tree heterogeneity observed, species tree inference in ASTRAL, in PhyloNet, and with the concatenation approach strongly supported the monophyly of both subfamilies for the 1,028-gene dataset. However, when using the concatenation approach to infer a species tree from the smaller set of 10 genes, we recovered Steganinae as a paraphyletic group. The pattern of gene tree heterogeneity was asymmetrical and thus could not be explained solely by incomplete lineage sorting (ILS).

Conclusions: Steganinae was clearly a monophyletic group in the dataset that we analyzed. In addition to ILS, gene tree discordance was possibly the result of introgression, suggesting complex branching processes during the early evolution of Drosophilidae with short speciation intervals and gene flow. Our study highlights the importance of genomic data in elucidating contentious phylogenetic relationships and suggests that phylogenetic inference for drosophilids based on small molecular datasets should be performed cautiously. Finally, we suggest an approach for the correction and cleaning of BUSCO-derived genomic datasets that will be useful to other researchers planning to use this tool for phylogenomic studies.

Abstract Image

Abstract Image

Abstract Image

Steganinae果蝇(双翅目:果蝇科)的系统基因组研究:强大的基因树异质性和单系性证据。
背景:果蝇科传统上分为两个亚科:果蝇科和隐蝇科。这种划分是基于形态特征,在大多数文献中,这两个亚科被认为是单系的,但一些分子系统发育表明Steganinae是副系的。为了验证这一假说,我们使用了8个果蝇科(3个剑蝇科和5个果蝇科)和2个剑蝇科(外群)物种的基因组序列,并基于所有物种(> 1,000,000 bp)中存在的1,028个同源基因数据集推断了该群体的系统发育。该数据集包括三个属,打破了以往工作中亚科的单一性。为了研究小样本量和自动基因注释可能带来的偏差,我们使用相同的方法从一组10个系统发育中常用的人工注释基因中推断物种树。结果:在1028个基因树中,大多数将Steganinae描述为具有不同拓扑结构的副葡萄系,但最常见的拓扑结构将其描述为单系(43.7%)。尽管观察到高水平的基因树异质性,但ASTRAL、PhyloNet和连接方法中的物种树推断强烈支持1,028个基因数据集中两个亚科的单系性。然而,当使用串联方法从较小的10个基因集推断物种树时,我们恢复了Steganinae作为一个副类群。基因树异质性的模式是不对称的,因此不能完全用不完全谱系分类(ILS)来解释。结论:在我们分析的数据集中,Steganinae显然是一个单系类群。除ILS外,基因树不一致可能是基因渗入的结果,表明果蝇在早期进化过程中存在复杂的分支过程,物种形成间隔短,基因流动频繁。我们的研究强调了基因组数据在阐明有争议的系统发育关系中的重要性,并表明基于小分子数据集的果蝇系统发育推断应谨慎进行。最后,我们提出了一种校正和清理busco衍生基因组数据集的方法,这将对其他计划使用该工具进行系统基因组学研究的研究人员有用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
BMC Evolutionary Biology
BMC Evolutionary Biology 生物-进化生物学
CiteScore
5.80
自引率
0.00%
发文量
0
审稿时长
6 months
期刊介绍: BMC Evolutionary Biology is an open access, peer-reviewed journal that considers articles on all aspects of molecular and non-molecular evolution of all organisms, as well as phylogenetics and palaeontology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信