Allison Vieira da Silva, Melina Prado, Gabriela Romêro Campos, Karina Lima Reis Borges, Rafael Massahiro Yassue, Gustavo Husein, Marcel Bellato Sposito, Lilian Amorim, José Crossa, Roberto Fritsche-Neto
{"title":"种间双亲种群基因组预测策略比较:茜草属案例研究","authors":"Allison Vieira da Silva, Melina Prado, Gabriela Romêro Campos, Karina Lima Reis Borges, Rafael Massahiro Yassue, Gustavo Husein, Marcel Bellato Sposito, Lilian Amorim, José Crossa, Roberto Fritsche-Neto","doi":"10.1007/s10681-024-03406-2","DOIUrl":null,"url":null,"abstract":"<p>Genomic selection (GS) is becoming increasingly widespread and applied due to the promising results obtained, cost savings in generating single nucleotide polymorphism (SNP) markers, and the development of statistical models that allow to improve the analysis robustness and accuracy. The composition and size of the training population have a major influence on GS, which poses challenges for interspecific biparental populations. Another factor is the use of different reference genomes from other species to perform SNP calling, which could make it possible to explore variability in interspecific crosses comprehensively. Late leaf rust is a disease caused by the pathogen <i>Acculeastrum americanum,</i> and there are reports on genetic resistance in <i>Rubus occidentalis,</i> which leads to the need for interspecific hybridizations, aiming to combine the fruit quality of <i>R. idaeus</i> with the resistance of <i>R. occidentalis</i>. The present study was carried out with a population of 94 interspecific raspberry hybrids. We evaluated the effect of different reference genomes on the SNP markers discovery, as well as training population optimization strategies on the accuracy of genomic predictions, namely the CV-α, leaving-one-family-out (LOFO), pairwise families, and stratified k-fold. The average predictive accuracies ranged from − 0.33 to 0.44 and We demonstrated higher prediction accuracy and more precise estimates when we combined stratified sampling to compose the training set (CV-α and k-fold stratified CV) and the panel of Unique markers. These results corroborate that genomic prediction aligned with SNP calling and training population optimization strategies can significantly increase genetic gains in interspecific biparental crosses.</p>","PeriodicalId":11803,"journal":{"name":"Euphytica","volume":null,"pages":null},"PeriodicalIF":1.6000,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comparing strategies for genomic predictions in interspecific biparental populations: a case study with the Rubus genus\",\"authors\":\"Allison Vieira da Silva, Melina Prado, Gabriela Romêro Campos, Karina Lima Reis Borges, Rafael Massahiro Yassue, Gustavo Husein, Marcel Bellato Sposito, Lilian Amorim, José Crossa, Roberto Fritsche-Neto\",\"doi\":\"10.1007/s10681-024-03406-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Genomic selection (GS) is becoming increasingly widespread and applied due to the promising results obtained, cost savings in generating single nucleotide polymorphism (SNP) markers, and the development of statistical models that allow to improve the analysis robustness and accuracy. The composition and size of the training population have a major influence on GS, which poses challenges for interspecific biparental populations. Another factor is the use of different reference genomes from other species to perform SNP calling, which could make it possible to explore variability in interspecific crosses comprehensively. Late leaf rust is a disease caused by the pathogen <i>Acculeastrum americanum,</i> and there are reports on genetic resistance in <i>Rubus occidentalis,</i> which leads to the need for interspecific hybridizations, aiming to combine the fruit quality of <i>R. idaeus</i> with the resistance of <i>R. occidentalis</i>. The present study was carried out with a population of 94 interspecific raspberry hybrids. We evaluated the effect of different reference genomes on the SNP markers discovery, as well as training population optimization strategies on the accuracy of genomic predictions, namely the CV-α, leaving-one-family-out (LOFO), pairwise families, and stratified k-fold. The average predictive accuracies ranged from − 0.33 to 0.44 and We demonstrated higher prediction accuracy and more precise estimates when we combined stratified sampling to compose the training set (CV-α and k-fold stratified CV) and the panel of Unique markers. These results corroborate that genomic prediction aligned with SNP calling and training population optimization strategies can significantly increase genetic gains in interspecific biparental crosses.</p>\",\"PeriodicalId\":11803,\"journal\":{\"name\":\"Euphytica\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2024-09-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Euphytica\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://doi.org/10.1007/s10681-024-03406-2\",\"RegionNum\":3,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AGRONOMY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Euphytica","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1007/s10681-024-03406-2","RegionNum":3,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AGRONOMY","Score":null,"Total":0}
引用次数: 0
摘要
基因组选择(GS)的应用越来越广泛,这是因为它取得了可喜的成果,节省了生成单核苷酸多态性(SNP)标记的成本,而且统计模型的开发提高了分析的稳健性和准确性。训练群体的组成和规模对 GS 有重大影响,这对种间双亲群体构成了挑战。另一个因素是使用其他物种的不同参考基因组来进行 SNP 调用,这样可以全面探索种间杂交的变异性。晚叶锈病是由病原体 Acculeastrum americanum 引起的一种病害,有报道称西红宝石具有遗传抗性,因此需要进行种间杂交,目的是将西红宝石的果实品质与西红宝石的抗性结合起来。本研究使用了 94 个树莓种间杂交种。我们评估了不同参考基因组对 SNP 标记发现的影响,以及训练群体优化策略对基因组预测准确性的影响,即 CV-α、Leaving one-family-out (LOFO)、pairwise families 和 stratified k-fold。当我们结合分层抽样来组成训练集(CV-α 和 k-fold 分层 CV)和 Unique 标记小组时,我们证明了更高的预测准确率和更精确的估计值。这些结果证实,基因组预测与 SNP 调用和训练群体优化策略相结合,可以显著提高种间双亲杂交的遗传收益。
Comparing strategies for genomic predictions in interspecific biparental populations: a case study with the Rubus genus
Genomic selection (GS) is becoming increasingly widespread and applied due to the promising results obtained, cost savings in generating single nucleotide polymorphism (SNP) markers, and the development of statistical models that allow to improve the analysis robustness and accuracy. The composition and size of the training population have a major influence on GS, which poses challenges for interspecific biparental populations. Another factor is the use of different reference genomes from other species to perform SNP calling, which could make it possible to explore variability in interspecific crosses comprehensively. Late leaf rust is a disease caused by the pathogen Acculeastrum americanum, and there are reports on genetic resistance in Rubus occidentalis, which leads to the need for interspecific hybridizations, aiming to combine the fruit quality of R. idaeus with the resistance of R. occidentalis. The present study was carried out with a population of 94 interspecific raspberry hybrids. We evaluated the effect of different reference genomes on the SNP markers discovery, as well as training population optimization strategies on the accuracy of genomic predictions, namely the CV-α, leaving-one-family-out (LOFO), pairwise families, and stratified k-fold. The average predictive accuracies ranged from − 0.33 to 0.44 and We demonstrated higher prediction accuracy and more precise estimates when we combined stratified sampling to compose the training set (CV-α and k-fold stratified CV) and the panel of Unique markers. These results corroborate that genomic prediction aligned with SNP calling and training population optimization strategies can significantly increase genetic gains in interspecific biparental crosses.
期刊介绍:
Euphytica is an international journal on theoretical and applied aspects of plant breeding. It publishes critical reviews and papers on the results of original research related to plant breeding.
The integration of modern and traditional plant breeding is a growing field of research using transgenic crop plants and/or marker assisted breeding in combination with traditional breeding tools. The content should cover the interests of researchers directly or indirectly involved in plant breeding, at universities, breeding institutes, seed industries, plant biotech companies and industries using plant raw materials, and promote stability, adaptability and sustainability in agriculture and agro-industries.