Graphical pangenomics-enabled characterization of structural variant impact on gene expression in Brassica napus.

IF 4.4 1区 农林科学 Q1 AGRONOMY
Gözde Yildiz, Silvia F Zanini, Sven Weber, Venkataramana Kopalli, Tobias Kox, Amine Abbadi, Rod J Snowdon, Agnieszka A Golicz
{"title":"Graphical pangenomics-enabled characterization of structural variant impact on gene expression in Brassica napus.","authors":"Gözde Yildiz, Silvia F Zanini, Sven Weber, Venkataramana Kopalli, Tobias Kox, Amine Abbadi, Rod J Snowdon, Agnieszka A Golicz","doi":"10.1007/s00122-025-04867-2","DOIUrl":null,"url":null,"abstract":"<p><strong>Key message: </strong>Pangenome graphs enable population-scale genotyping and improve expression analysis, revealing that structural variations (SVs), particularly transposable elements (TEs), significantly contribute to gene expression variation in winter oilseed rape. Structural variations (SVs) impact important traits, from yield to flowering behaviour and stress responses. Pangenome graphs capture population-level diversity, including SVs, within a single data structure and provide a robust framework for downstream applications. They have the potential to serve as unbiased references for SV genotyping, pan-transcriptomic analyses, and association studies, offering significant advantages over single reference genomes. However, their full potential for expression quantitative trait locus (eQTL) analysis is yet to be explored. We combined long and short-read whole genome sequencing data with expression profiling of Brassica napus (oilseed rape) to assess the impact of SVs on gene expression regulation and explored the utility of pangenome graphs for eQTL analysis. Over 90,000 SVs were discovered from 57 long-read datasets. Pangenome graph as reference was evaluated and used for SV genotyping with short reads and transcript expression quantification. Using SVs genotyped from the graph and 100 expression datasets, we identified 267 gene proximal (cis) SV-eQTLs. Over 70% of eQTL-SVs had similarity to transposable elements (TEs), especially Helitrons. The highest proportion of cis-eQTL-SVs were found in promoter regions. About a third of transcripts whose expression was associated with SVs, had no associated SNPs, suggesting that including SVs allows capturing of relationship which would be missed in SNP-only analyses. This study demonstrated that pangenome graphs provide a unifying framework for eQTL analysis by allowing population-scale SV genotyping and gene expression quantification. We also showed that SVs make an appreciable contribution to gene expression variation in winter oilseed rape.</p>","PeriodicalId":22955,"journal":{"name":"Theoretical and Applied Genetics","volume":"138 4","pages":"91"},"PeriodicalIF":4.4000,"publicationDate":"2025-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11968540/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Theoretical and Applied Genetics","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1007/s00122-025-04867-2","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRONOMY","Score":null,"Total":0}
引用次数: 0

Abstract

Key message: Pangenome graphs enable population-scale genotyping and improve expression analysis, revealing that structural variations (SVs), particularly transposable elements (TEs), significantly contribute to gene expression variation in winter oilseed rape. Structural variations (SVs) impact important traits, from yield to flowering behaviour and stress responses. Pangenome graphs capture population-level diversity, including SVs, within a single data structure and provide a robust framework for downstream applications. They have the potential to serve as unbiased references for SV genotyping, pan-transcriptomic analyses, and association studies, offering significant advantages over single reference genomes. However, their full potential for expression quantitative trait locus (eQTL) analysis is yet to be explored. We combined long and short-read whole genome sequencing data with expression profiling of Brassica napus (oilseed rape) to assess the impact of SVs on gene expression regulation and explored the utility of pangenome graphs for eQTL analysis. Over 90,000 SVs were discovered from 57 long-read datasets. Pangenome graph as reference was evaluated and used for SV genotyping with short reads and transcript expression quantification. Using SVs genotyped from the graph and 100 expression datasets, we identified 267 gene proximal (cis) SV-eQTLs. Over 70% of eQTL-SVs had similarity to transposable elements (TEs), especially Helitrons. The highest proportion of cis-eQTL-SVs were found in promoter regions. About a third of transcripts whose expression was associated with SVs, had no associated SNPs, suggesting that including SVs allows capturing of relationship which would be missed in SNP-only analyses. This study demonstrated that pangenome graphs provide a unifying framework for eQTL analysis by allowing population-scale SV genotyping and gene expression quantification. We also showed that SVs make an appreciable contribution to gene expression variation in winter oilseed rape.

结构变异对甘蓝型油菜基因表达影响的图形泛基因组学表征。
泛基因组图谱揭示了结构变异(SVs),特别是转座因子(te)对冬季油菜基因表达的影响,从而使群体尺度的基因分型和表达分析更加完善。结构变异(SVs)影响重要性状,从产量到开花行为和胁迫反应。泛基因组图在单一数据结构中捕获种群水平的多样性,包括sv,并为下游应用程序提供强大的框架。它们有可能作为SV基因分型、泛转录组分析和关联研究的无偏参考,与单一参考基因组相比具有显著优势。然而,它们在表达数量性状位点(eQTL)分析方面的潜力尚未充分挖掘。我们将长、短读全基因组测序数据与甘蓝型油菜(Brassica napus)的表达谱相结合,评估了SVs对基因表达调控的影响,并探索了泛基因组图谱在eQTL分析中的应用。从57个长读数据集中发现了超过90,000个sv。评估作为参考的泛基因组图,并使用短reads和转录物表达定量进行SV基因分型。利用图中的sv基因分型和100个表达数据集,我们鉴定出267个基因近端(cis) sv - eqtl。超过70%的eQTL-SVs与转座因子(te)具有相似性,尤其是helitron。在启动子区域发现的顺式- eqtl - sv比例最高。大约三分之一的转录本的表达与SVs相关,没有相关的snp,这表明包括SVs可以捕获在单核苷酸多态性分析中可能错过的关系。该研究表明,泛基因组图谱通过允许群体尺度的SV基因分型和基因表达量化,为eQTL分析提供了统一的框架。我们还发现,sv对冬季油菜的基因表达变异有显著的贡献。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
9.60
自引率
7.40%
发文量
241
审稿时长
2.3 months
期刊介绍: Theoretical and Applied Genetics publishes original research and review articles in all key areas of modern plant genetics, plant genomics and plant biotechnology. All work needs to have a clear genetic component and significant impact on plant breeding. Theoretical considerations are only accepted in combination with new experimental data and/or if they indicate a relevant application in plant genetics or breeding. Emphasizing the practical, the journal focuses on research into leading crop plants and articles presenting innovative approaches.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信