SNP 标记密度和训练群体大小对紫花苜蓿(Medicago sativa L.)基因组选择预测准确性的影响。

IF 3.9 2区 生物学 Q1 GENETICS & HEREDITY
Plant Genome Pub Date : 2024-03-01 Epub Date: 2024-01-23 DOI:10.1002/tpg2.20431
Hu Wang, Yuguang Bai, Bill Biligetu
{"title":"SNP 标记密度和训练群体大小对紫花苜蓿(Medicago sativa L.)基因组选择预测准确性的影响。","authors":"Hu Wang, Yuguang Bai, Bill Biligetu","doi":"10.1002/tpg2.20431","DOIUrl":null,"url":null,"abstract":"<p><p>Effects of individual single-nucleotide polymorphism (SNP) markers and the size of \"training\" and \"test\" populations affect prediction accuracy in genomic selection (GS). This study evaluated 11 subsets of 4932 SNPs using six genetic additive methods to understand marker density in GS prediction in alfalfa (Medicago sativa L.). In the GS methods, the effect of \"training\" to \"test\" population size was also evaluated. Fourteen alfalfa populations sampled from long-term grazing sites were genotyped using genotyping by sequencing for the identification of SNPs. These populations were also phenotyped for six agromorphological and three nutritive traits from 2018 to 2020. The accuracy of GS prediction improved across six GS methods when the ratio of \"training\" to \"test\" population size increased. However, the prediction accuracy of the six GS methods reduced to a range of -0.27 to 0.11 when random, uninformative SNPs were used. In this study, five Bayesian methods and ridge-regression best linear unbiased prediction (rrBLUP) method had similar GS accuracies for \"training\" sets, but rrBLUP tended to outperform Bayesian methods in independent \"test\" sets when SNP subsets with high mean-squared-estimated-marker effect were used. These findings can enhance the application of GS in alfalfa genetic improvement.</p>","PeriodicalId":49002,"journal":{"name":"Plant Genome","volume":null,"pages":null},"PeriodicalIF":3.9000,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Effects of SNP marker density and training population size on prediction accuracy in alfalfa (Medicago sativa L.) genomic selection.\",\"authors\":\"Hu Wang, Yuguang Bai, Bill Biligetu\",\"doi\":\"10.1002/tpg2.20431\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Effects of individual single-nucleotide polymorphism (SNP) markers and the size of \\\"training\\\" and \\\"test\\\" populations affect prediction accuracy in genomic selection (GS). This study evaluated 11 subsets of 4932 SNPs using six genetic additive methods to understand marker density in GS prediction in alfalfa (Medicago sativa L.). In the GS methods, the effect of \\\"training\\\" to \\\"test\\\" population size was also evaluated. Fourteen alfalfa populations sampled from long-term grazing sites were genotyped using genotyping by sequencing for the identification of SNPs. These populations were also phenotyped for six agromorphological and three nutritive traits from 2018 to 2020. The accuracy of GS prediction improved across six GS methods when the ratio of \\\"training\\\" to \\\"test\\\" population size increased. However, the prediction accuracy of the six GS methods reduced to a range of -0.27 to 0.11 when random, uninformative SNPs were used. In this study, five Bayesian methods and ridge-regression best linear unbiased prediction (rrBLUP) method had similar GS accuracies for \\\"training\\\" sets, but rrBLUP tended to outperform Bayesian methods in independent \\\"test\\\" sets when SNP subsets with high mean-squared-estimated-marker effect were used. These findings can enhance the application of GS in alfalfa genetic improvement.</p>\",\"PeriodicalId\":49002,\"journal\":{\"name\":\"Plant Genome\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2024-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Plant Genome\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1002/tpg2.20431\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/1/23 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Plant Genome","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1002/tpg2.20431","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/23 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

摘要

单个单核苷酸多态性(SNP)标记的影响以及 "训练 "和 "测试 "群体的大小会影响基因组选择(GS)的预测准确性。本研究使用六种遗传加和方法评估了 4932 个 SNP 的 11 个子集,以了解标记密度在紫花苜蓿(Medicago sativa L.)基因组选择预测中的作用。在 GS 方法中,还评估了 "训练 "到 "测试 "群体大小的影响。利用基因分型测序技术对从长期放牧地采样的 14 个紫花苜蓿种群进行了基因分型,以鉴定 SNPs。从 2018 年到 2020 年,还对这些种群的六个农业形态和三个营养性状进行了表型分析。当 "训练 "与 "测试 "种群规模的比例增加时,六种GS方法的GS预测准确率都有所提高。然而,当使用随机、无信息的SNP时,六种GS方法的预测准确率降至-0.27至0.11之间。在本研究中,五种贝叶斯方法和脊回归最佳线性无偏预测(rrBLUP)方法在 "训练 "集上的 GS 准确率相似,但当使用具有高均方估计标记效应的 SNP 子集时,rrBLUP 在独立的 "测试 "集上的表现往往优于贝叶斯方法。这些发现可以提高 GS 在苜蓿遗传改良中的应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Effects of SNP marker density and training population size on prediction accuracy in alfalfa (Medicago sativa L.) genomic selection.

Effects of individual single-nucleotide polymorphism (SNP) markers and the size of "training" and "test" populations affect prediction accuracy in genomic selection (GS). This study evaluated 11 subsets of 4932 SNPs using six genetic additive methods to understand marker density in GS prediction in alfalfa (Medicago sativa L.). In the GS methods, the effect of "training" to "test" population size was also evaluated. Fourteen alfalfa populations sampled from long-term grazing sites were genotyped using genotyping by sequencing for the identification of SNPs. These populations were also phenotyped for six agromorphological and three nutritive traits from 2018 to 2020. The accuracy of GS prediction improved across six GS methods when the ratio of "training" to "test" population size increased. However, the prediction accuracy of the six GS methods reduced to a range of -0.27 to 0.11 when random, uninformative SNPs were used. In this study, five Bayesian methods and ridge-regression best linear unbiased prediction (rrBLUP) method had similar GS accuracies for "training" sets, but rrBLUP tended to outperform Bayesian methods in independent "test" sets when SNP subsets with high mean-squared-estimated-marker effect were used. These findings can enhance the application of GS in alfalfa genetic improvement.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Plant Genome
Plant Genome PLANT SCIENCES-GENETICS & HEREDITY
CiteScore
6.00
自引率
4.80%
发文量
93
审稿时长
>12 weeks
期刊介绍: The Plant Genome publishes original research investigating all aspects of plant genomics. Technical breakthroughs reporting improvements in the efficiency and speed of acquiring and interpreting plant genomics data are welcome. The editorial board gives preference to novel reports that use innovative genomic applications that advance our understanding of plant biology that may have applications to crop improvement. The journal also publishes invited review articles and perspectives that offer insight and commentary on recent advances in genomics and their potential for agronomic improvement.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信