Comparing strategies for genomic predictions in interspecific biparental populations: a case study with the Rubus genus

IF 1.6 3区 农林科学 Q2 AGRONOMY
Allison Vieira da Silva, Melina Prado, Gabriela Romêro Campos, Karina Lima Reis Borges, Rafael Massahiro Yassue, Gustavo Husein, Marcel Bellato Sposito, Lilian Amorim, José Crossa, Roberto Fritsche-Neto
{"title":"Comparing strategies for genomic predictions in interspecific biparental populations: a case study with the Rubus genus","authors":"Allison Vieira da Silva, Melina Prado, Gabriela Romêro Campos, Karina Lima Reis Borges, Rafael Massahiro Yassue, Gustavo Husein, Marcel Bellato Sposito, Lilian Amorim, José Crossa, Roberto Fritsche-Neto","doi":"10.1007/s10681-024-03406-2","DOIUrl":null,"url":null,"abstract":"<p>Genomic selection (GS) is becoming increasingly widespread and applied due to the promising results obtained, cost savings in generating single nucleotide polymorphism (SNP) markers, and the development of statistical models that allow to improve the analysis robustness and accuracy. The composition and size of the training population have a major influence on GS, which poses challenges for interspecific biparental populations. Another factor is the use of different reference genomes from other species to perform SNP calling, which could make it possible to explore variability in interspecific crosses comprehensively. Late leaf rust is a disease caused by the pathogen <i>Acculeastrum americanum,</i> and there are reports on genetic resistance in <i>Rubus occidentalis,</i> which leads to the need for interspecific hybridizations, aiming to combine the fruit quality of <i>R. idaeus</i> with the resistance of <i>R. occidentalis</i>. The present study was carried out with a population of 94 interspecific raspberry hybrids. We evaluated the effect of different reference genomes on the SNP markers discovery, as well as training population optimization strategies on the accuracy of genomic predictions, namely the CV-α, leaving-one-family-out (LOFO), pairwise families, and stratified k-fold. The average predictive accuracies ranged from − 0.33 to 0.44 and We demonstrated higher prediction accuracy and more precise estimates when we combined stratified sampling to compose the training set (CV-α and k-fold stratified CV) and the panel of Unique markers. These results corroborate that genomic prediction aligned with SNP calling and training population optimization strategies can significantly increase genetic gains in interspecific biparental crosses.</p>","PeriodicalId":11803,"journal":{"name":"Euphytica","volume":"6 1","pages":""},"PeriodicalIF":1.6000,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Euphytica","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1007/s10681-024-03406-2","RegionNum":3,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AGRONOMY","Score":null,"Total":0}
引用次数: 0

Abstract

Genomic selection (GS) is becoming increasingly widespread and applied due to the promising results obtained, cost savings in generating single nucleotide polymorphism (SNP) markers, and the development of statistical models that allow to improve the analysis robustness and accuracy. The composition and size of the training population have a major influence on GS, which poses challenges for interspecific biparental populations. Another factor is the use of different reference genomes from other species to perform SNP calling, which could make it possible to explore variability in interspecific crosses comprehensively. Late leaf rust is a disease caused by the pathogen Acculeastrum americanum, and there are reports on genetic resistance in Rubus occidentalis, which leads to the need for interspecific hybridizations, aiming to combine the fruit quality of R. idaeus with the resistance of R. occidentalis. The present study was carried out with a population of 94 interspecific raspberry hybrids. We evaluated the effect of different reference genomes on the SNP markers discovery, as well as training population optimization strategies on the accuracy of genomic predictions, namely the CV-α, leaving-one-family-out (LOFO), pairwise families, and stratified k-fold. The average predictive accuracies ranged from − 0.33 to 0.44 and We demonstrated higher prediction accuracy and more precise estimates when we combined stratified sampling to compose the training set (CV-α and k-fold stratified CV) and the panel of Unique markers. These results corroborate that genomic prediction aligned with SNP calling and training population optimization strategies can significantly increase genetic gains in interspecific biparental crosses.

Abstract Image

种间双亲种群基因组预测策略比较:茜草属案例研究
基因组选择(GS)的应用越来越广泛,这是因为它取得了可喜的成果,节省了生成单核苷酸多态性(SNP)标记的成本,而且统计模型的开发提高了分析的稳健性和准确性。训练群体的组成和规模对 GS 有重大影响,这对种间双亲群体构成了挑战。另一个因素是使用其他物种的不同参考基因组来进行 SNP 调用,这样可以全面探索种间杂交的变异性。晚叶锈病是由病原体 Acculeastrum americanum 引起的一种病害,有报道称西红宝石具有遗传抗性,因此需要进行种间杂交,目的是将西红宝石的果实品质与西红宝石的抗性结合起来。本研究使用了 94 个树莓种间杂交种。我们评估了不同参考基因组对 SNP 标记发现的影响,以及训练群体优化策略对基因组预测准确性的影响,即 CV-α、Leaving one-family-out (LOFO)、pairwise families 和 stratified k-fold。当我们结合分层抽样来组成训练集(CV-α 和 k-fold 分层 CV)和 Unique 标记小组时,我们证明了更高的预测准确率和更精确的估计值。这些结果证实,基因组预测与 SNP 调用和训练群体优化策略相结合,可以显著提高种间双亲杂交的遗传收益。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Euphytica
Euphytica 农林科学-农艺学
CiteScore
3.80
自引率
5.30%
发文量
157
审稿时长
4.5 months
期刊介绍: Euphytica is an international journal on theoretical and applied aspects of plant breeding. It publishes critical reviews and papers on the results of original research related to plant breeding. The integration of modern and traditional plant breeding is a growing field of research using transgenic crop plants and/or marker assisted breeding in combination with traditional breeding tools. The content should cover the interests of researchers directly or indirectly involved in plant breeding, at universities, breeding institutes, seed industries, plant biotech companies and industries using plant raw materials, and promote stability, adaptability and sustainability in agriculture and agro-industries.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信