大麦多亲本RIL群体转录组数据的基因组预测能力评估。

IF 4.2 1区 农林科学 Q1 AGRONOMY
Christopher Arlt, Delphine van Inghelandt, Jinquan Li, Benjamin Stich
{"title":"大麦多亲本RIL群体转录组数据的基因组预测能力评估。","authors":"Christopher Arlt, Delphine van Inghelandt, Jinquan Li, Benjamin Stich","doi":"10.1007/s00122-025-05029-0","DOIUrl":null,"url":null,"abstract":"<p><strong>Key message: </strong>Low-cost and high-throughput RNA sequencing data for barley RILs achieved GP performance comparable to or better than traditional SNP array datasets when combined with parental whole-genome sequencing SNP data. The field of genomic selection (GS) is advancing rapidly on many fronts including the utilization of multi-omics datasets with the goal of increasing prediction ability and becoming an integral part of an increasing number of breeding programs ensuring future food security. In this study, we used RNA sequencing (RNA-Seq) data to perform genomic prediction (GP) on three related barley RIL populations. We investigated the potential of increasing prediction ability by combining genomic and transcriptomic datasets, adding whole-genome sequencing (WGS) SNP data, functional annotation-based filtering, and empirical quality filtering. Our RNA-Seq data were generated cost-efficiently using small-footprint plant cultivation, high-throughput RNA extraction, and Library preparation miniaturization. We also examined sequencing depth reduction as an additional cost-saving measure. We used fivefold cross-validation to evaluate the prediction ability of the gene expression dataset, the RNA-Seq SNP dataset, and the consensus SNP dataset between the RNA-Seq and parental WGS data, resulting in prediction abilities between 0.73 and 0.78. The consensus SNP dataset performed best, with five out of eight traits performing significantly better compared to a 50K SNP array, which served as a benchmark. The advantage of the consensus SNP dataset was most prominent in the inter-population predictions, in which the training and validation sets originated from different RIL sub-populations. We were therefore able to not only show that RNA-Seq data alone are able to predict various complex traits in barley using RILs, but also that the performance can be further increased with WGS data for which the public availability will steadily increase.</p>","PeriodicalId":22955,"journal":{"name":"Theoretical and Applied Genetics","volume":"138 10","pages":"247"},"PeriodicalIF":4.2000,"publicationDate":"2025-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12423136/pdf/","citationCount":"0","resultStr":"{\"title\":\"Assessment of genomic prediction capabilities of transcriptome data in a barley multi-parent RIL population.\",\"authors\":\"Christopher Arlt, Delphine van Inghelandt, Jinquan Li, Benjamin Stich\",\"doi\":\"10.1007/s00122-025-05029-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Key message: </strong>Low-cost and high-throughput RNA sequencing data for barley RILs achieved GP performance comparable to or better than traditional SNP array datasets when combined with parental whole-genome sequencing SNP data. The field of genomic selection (GS) is advancing rapidly on many fronts including the utilization of multi-omics datasets with the goal of increasing prediction ability and becoming an integral part of an increasing number of breeding programs ensuring future food security. In this study, we used RNA sequencing (RNA-Seq) data to perform genomic prediction (GP) on three related barley RIL populations. We investigated the potential of increasing prediction ability by combining genomic and transcriptomic datasets, adding whole-genome sequencing (WGS) SNP data, functional annotation-based filtering, and empirical quality filtering. Our RNA-Seq data were generated cost-efficiently using small-footprint plant cultivation, high-throughput RNA extraction, and Library preparation miniaturization. We also examined sequencing depth reduction as an additional cost-saving measure. We used fivefold cross-validation to evaluate the prediction ability of the gene expression dataset, the RNA-Seq SNP dataset, and the consensus SNP dataset between the RNA-Seq and parental WGS data, resulting in prediction abilities between 0.73 and 0.78. The consensus SNP dataset performed best, with five out of eight traits performing significantly better compared to a 50K SNP array, which served as a benchmark. The advantage of the consensus SNP dataset was most prominent in the inter-population predictions, in which the training and validation sets originated from different RIL sub-populations. We were therefore able to not only show that RNA-Seq data alone are able to predict various complex traits in barley using RILs, but also that the performance can be further increased with WGS data for which the public availability will steadily increase.</p>\",\"PeriodicalId\":22955,\"journal\":{\"name\":\"Theoretical and Applied Genetics\",\"volume\":\"138 10\",\"pages\":\"247\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2025-09-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12423136/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Theoretical and Applied Genetics\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://doi.org/10.1007/s00122-025-05029-0\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRONOMY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Theoretical and Applied Genetics","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1007/s00122-025-05029-0","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRONOMY","Score":null,"Total":0}
引用次数: 0

摘要

关键信息:低成本和高通量的大麦RILs RNA测序数据与亲本全基因组测序SNP数据相结合,实现了与传统SNP阵列数据集相当或更好的GP性能。基因组选择(GS)领域在许多方面发展迅速,包括多组学数据集的利用,其目标是提高预测能力,并成为越来越多的育种计划的组成部分,确保未来的粮食安全。在这项研究中,我们使用RNA测序(RNA- seq)数据对三个相关的大麦RIL群体进行基因组预测(GP)。我们通过结合基因组和转录组数据集、添加全基因组测序(WGS) SNP数据、基于功能注释的过滤和经验质量过滤来研究提高预测能力的潜力。我们的RNA- seq数据是通过小足迹植物培养、高通量RNA提取和文库制备小型化而经济高效地生成的。我们还研究了减少测序深度作为额外的成本节约措施。我们使用五重交叉验证来评估基因表达数据集、RNA-Seq SNP数据集以及RNA-Seq和亲本WGS数据之间的共识SNP数据集的预测能力,结果预测能力在0.73到0.78之间。共识SNP数据集表现最好,与作为基准的50K SNP阵列相比,8个特征中有5个表现明显更好。共识SNP数据集的优势在种群间预测中最为突出,其中训练集和验证集来自不同的RIL亚种群。因此,我们不仅能够证明RNA-Seq数据本身能够利用rls预测大麦的各种复杂性状,而且还可以进一步提高WGS数据的性能,WGS数据的公众可用性将稳步增加。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Assessment of genomic prediction capabilities of transcriptome data in a barley multi-parent RIL population.

Key message: Low-cost and high-throughput RNA sequencing data for barley RILs achieved GP performance comparable to or better than traditional SNP array datasets when combined with parental whole-genome sequencing SNP data. The field of genomic selection (GS) is advancing rapidly on many fronts including the utilization of multi-omics datasets with the goal of increasing prediction ability and becoming an integral part of an increasing number of breeding programs ensuring future food security. In this study, we used RNA sequencing (RNA-Seq) data to perform genomic prediction (GP) on three related barley RIL populations. We investigated the potential of increasing prediction ability by combining genomic and transcriptomic datasets, adding whole-genome sequencing (WGS) SNP data, functional annotation-based filtering, and empirical quality filtering. Our RNA-Seq data were generated cost-efficiently using small-footprint plant cultivation, high-throughput RNA extraction, and Library preparation miniaturization. We also examined sequencing depth reduction as an additional cost-saving measure. We used fivefold cross-validation to evaluate the prediction ability of the gene expression dataset, the RNA-Seq SNP dataset, and the consensus SNP dataset between the RNA-Seq and parental WGS data, resulting in prediction abilities between 0.73 and 0.78. The consensus SNP dataset performed best, with five out of eight traits performing significantly better compared to a 50K SNP array, which served as a benchmark. The advantage of the consensus SNP dataset was most prominent in the inter-population predictions, in which the training and validation sets originated from different RIL sub-populations. We were therefore able to not only show that RNA-Seq data alone are able to predict various complex traits in barley using RILs, but also that the performance can be further increased with WGS data for which the public availability will steadily increase.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
9.60
自引率
7.40%
发文量
241
审稿时长
2.3 months
期刊介绍: Theoretical and Applied Genetics publishes original research and review articles in all key areas of modern plant genetics, plant genomics and plant biotechnology. All work needs to have a clear genetic component and significant impact on plant breeding. Theoretical considerations are only accepted in combination with new experimental data and/or if they indicate a relevant application in plant genetics or breeding. Emphasizing the practical, the journal focuses on research into leading crop plants and articles presenting innovative approaches.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信