Hijacking a rapid and scalable metagenomic method reveals subgenome dynamics and evolution in polyploid plants

IF 2.7 3区 生物学 Q2 PLANT SCIENCES
Gillian Reynolds, Brendan Mumey, Veronika Strnadova-Neeley, Jennifer Lachowiec
{"title":"Hijacking a rapid and scalable metagenomic method reveals subgenome dynamics and evolution in polyploid plants","authors":"Gillian Reynolds,&nbsp;Brendan Mumey,&nbsp;Veronika Strnadova-Neeley,&nbsp;Jennifer Lachowiec","doi":"10.1002/aps3.11581","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Premise</h3>\n \n <p>The genomes of polyploid plants archive the evolutionary events leading to their present forms. However, plant polyploid genomes present numerous hurdles to the genome comparison algorithms for classification of polyploid types and exploring genome dynamics.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>Here, the problem of intra- and inter-genome comparison for examining polyploid genomes is reframed as a metagenomic problem, enabling the use of the rapid and scalable MinHashing approach. To determine how types of polyploidy are described by this metagenomic approach, plant genomes were examined from across the polyploid spectrum for both <i>k</i>-mer composition and frequency with a range of <i>k</i>-mer sizes. In this approach, no subgenome-specific <i>k</i>-mers are identified; rather, whole-chromosome <i>k</i>-mer subspaces were utilized.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>Given chromosome-scale genome assemblies with sufficient subgenome-specific repetitive element content, literature-verified subgenomic and genomic evolutionary relationships were revealed, including distinguishing auto- from allopolyploidy and putative progenitor genome assignment. The sequences responsible were the rapidly evolving landscape of transposable elements. An investigation into the MinHashing parameters revealed that the downsampled <i>k</i>-mer space (genomic signatures) produced excellent approximations of sequence similarity. Furthermore, the clustering approach used for comparison of the genomic signatures is scrutinized to ensure applicability of the metagenomics-based method.</p>\n </section>\n \n <section>\n \n <h3> Discussion</h3>\n \n <p>The easily implementable and highly computationally efficient MinHashing-based sequence comparison strategy enables comparative subgenomics and genomics for large and complex polyploid plant genomes. Such comparisons provide evidence for polyploidy-type subgenomic assignments. In cases where subgenome-specific repeat signal may not be adequate given a chromosomes' global <i>k</i>-mer profile, alternative methods that are more specific but more computationally complex outperform this approach.</p>\n </section>\n </div>","PeriodicalId":8022,"journal":{"name":"Applications in Plant Sciences","volume":null,"pages":null},"PeriodicalIF":2.7000,"publicationDate":"2024-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aps3.11581","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applications in Plant Sciences","FirstCategoryId":"99","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/aps3.11581","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PLANT SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Premise

The genomes of polyploid plants archive the evolutionary events leading to their present forms. However, plant polyploid genomes present numerous hurdles to the genome comparison algorithms for classification of polyploid types and exploring genome dynamics.

Methods

Here, the problem of intra- and inter-genome comparison for examining polyploid genomes is reframed as a metagenomic problem, enabling the use of the rapid and scalable MinHashing approach. To determine how types of polyploidy are described by this metagenomic approach, plant genomes were examined from across the polyploid spectrum for both k-mer composition and frequency with a range of k-mer sizes. In this approach, no subgenome-specific k-mers are identified; rather, whole-chromosome k-mer subspaces were utilized.

Results

Given chromosome-scale genome assemblies with sufficient subgenome-specific repetitive element content, literature-verified subgenomic and genomic evolutionary relationships were revealed, including distinguishing auto- from allopolyploidy and putative progenitor genome assignment. The sequences responsible were the rapidly evolving landscape of transposable elements. An investigation into the MinHashing parameters revealed that the downsampled k-mer space (genomic signatures) produced excellent approximations of sequence similarity. Furthermore, the clustering approach used for comparison of the genomic signatures is scrutinized to ensure applicability of the metagenomics-based method.

Discussion

The easily implementable and highly computationally efficient MinHashing-based sequence comparison strategy enables comparative subgenomics and genomics for large and complex polyploid plant genomes. Such comparisons provide evidence for polyploidy-type subgenomic assignments. In cases where subgenome-specific repeat signal may not be adequate given a chromosomes' global k-mer profile, alternative methods that are more specific but more computationally complex outperform this approach.

Abstract Image

劫持快速、可扩展的元基因组方法揭示多倍体植物的亚基因组动态与进化
前提多倍体植物的基因组记录了导致其目前形态的进化事件。然而,植物多倍体基因组给分类多倍体类型和探索基因组动态的基因组比较算法带来了许多障碍。方法在这里,研究多倍体基因组的基因组内和基因组间比较问题被重新定义为元基因组问题,从而可以使用快速、可扩展的 MinHashing 方法。为了确定这种元基因组学方法是如何描述多倍体类型的,我们利用一系列 k-聚合大小对整个多倍体谱系的植物基因组进行了 k-聚合组成和频率的研究。结果鉴于染色体规模的基因组集合具有足够的亚基因组特异性重复元件含量,经文献验证的亚基因组和基因组进化关系得以揭示,包括区分自多倍体和异源多倍体以及推测的祖先基因组分配。这些序列是快速进化的转座元件。对 MinHashing 参数的研究表明,下采样 k-mer 空间(基因组特征)产生了极好的序列相似性近似值。此外,为确保基于元基因组学的方法的适用性,还对用于比较基因组特征的聚类方法进行了仔细研究。 讨论基于 MinHashing 的序列比较策略易于实施,计算效率高,可用于大型复杂多倍体植物基因组的亚基因组学和基因组学比较。这种比较为多倍体类型的亚基因组分配提供了证据。在染色体的全局 k-mer 轮廓下,亚基因组特异性重复信号可能不够充分的情况下,特异性更强但计算更复杂的替代方法优于这种方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.30
自引率
0.00%
发文量
50
审稿时长
12 weeks
期刊介绍: Applications in Plant Sciences (APPS) is a monthly, peer-reviewed, open access journal promoting the rapid dissemination of newly developed, innovative tools and protocols in all areas of the plant sciences, including genetics, structure, function, development, evolution, systematics, and ecology. Given the rapid progress today in technology and its application in the plant sciences, the goal of APPS is to foster communication within the plant science community to advance scientific research. APPS is a publication of the Botanical Society of America, originating in 2009 as the American Journal of Botany''s online-only section, AJB Primer Notes & Protocols in the Plant Sciences. APPS publishes the following types of articles: (1) Protocol Notes describe new methods and technological advancements; (2) Genomic Resources Articles characterize the development and demonstrate the usefulness of newly developed genomic resources, including transcriptomes; (3) Software Notes detail new software applications; (4) Application Articles illustrate the application of a new protocol, method, or software application within the context of a larger study; (5) Review Articles evaluate available techniques, methods, or protocols; (6) Primer Notes report novel genetic markers with evidence of wide applicability.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信