Accurate genomic prediction for grain yield and grain moisture content of maize hybrids using multi-environment data.

IF 9.3 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY
Jingxin Wang, Liwei Liu, Kunhui He, Takele Weldu Gebrewahid, Shang Gao, Qingzhen Tian, Zhanyi Li, Yiqun Song, Yiliang Guo, Yanwei Li, Qinxin Cui, Luyan Zhang, Jiankang Wang, Changling Huang, Liang Li, Tingting Guo, Huihui Li
{"title":"Accurate genomic prediction for grain yield and grain moisture content of maize hybrids using multi-environment data.","authors":"Jingxin Wang, Liwei Liu, Kunhui He, Takele Weldu Gebrewahid, Shang Gao, Qingzhen Tian, Zhanyi Li, Yiqun Song, Yiliang Guo, Yanwei Li, Qinxin Cui, Luyan Zhang, Jiankang Wang, Changling Huang, Liang Li, Tingting Guo, Huihui Li","doi":"10.1111/jipb.13857","DOIUrl":null,"url":null,"abstract":"<p><p>Incorporating genotype-by-environment (GE) interaction effects into genomic prediction (GP) models with multi-environment climate data can improve selection accuracy to accelerate crop breeding but has received little research attention. Here, we conducted a cross-region GP study of grain moisture content (GMC) and grain yield (GY) in maize hybrids in two major Chinese growing regions using data for 19 climatic factors across 34 environments in 2020 and 2021. Predictions were conducted in 2,126 hybrids generated from 475 maize inbred lines, using 9,355 single nucleotide polymorphism markers for genotyping. Models based on genomic best linear unbiased prediction (GBLUP) incorporating GE interaction effects of 19 climatic factors associated with day length, transpiration, temperature, and radiation (GBLUP-GE<sub>19CF</sub>) trained on whole data set outperformed the traditional GBLUP or BayesB models in predicting GMC or GY by 10-fold cross-validation, achieving prediction accuracies of 0.731 and 0.331, respectively. To refine the climate data, we examined 84 statistical features associated with these climatic factors and identified nine factors most correlated with GMC or GY. Principal component analysis of climate data yielded nine principal components responsible for 97% of the variability in the data. Incorporating these nine factors or principal components into the GBLUP-GE framework with a similarity matrix of environments (GBLUP-GE<sub>9CF</sub> and GBLUP-GE<sub>PCA</sub>) provided similar prediction accuracies but could reduce the computational burden. In addition, increasing the number of test set environments in the training set from 8 to 14 increased the prediction accuracy of GBLUP-GE<sub>19CF</sub> trained with monthly average climate data for 2020-2021. Examining prediction accuracy based on concordance, the proportion of overlapping hybrids between the top 50% of predicted and observed values for GMC and GY, indicated that concordance exceeded 50% for the GBLUP-GE<sub>19CF</sub> model, confirming the reliability of our predictions. This study can provide practical guidance for optimizing GPs for maize breeding programs in multi-environment selection.</p>","PeriodicalId":195,"journal":{"name":"Journal of Integrative Plant Biology","volume":" ","pages":""},"PeriodicalIF":9.3000,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Integrative Plant Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1111/jipb.13857","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Incorporating genotype-by-environment (GE) interaction effects into genomic prediction (GP) models with multi-environment climate data can improve selection accuracy to accelerate crop breeding but has received little research attention. Here, we conducted a cross-region GP study of grain moisture content (GMC) and grain yield (GY) in maize hybrids in two major Chinese growing regions using data for 19 climatic factors across 34 environments in 2020 and 2021. Predictions were conducted in 2,126 hybrids generated from 475 maize inbred lines, using 9,355 single nucleotide polymorphism markers for genotyping. Models based on genomic best linear unbiased prediction (GBLUP) incorporating GE interaction effects of 19 climatic factors associated with day length, transpiration, temperature, and radiation (GBLUP-GE19CF) trained on whole data set outperformed the traditional GBLUP or BayesB models in predicting GMC or GY by 10-fold cross-validation, achieving prediction accuracies of 0.731 and 0.331, respectively. To refine the climate data, we examined 84 statistical features associated with these climatic factors and identified nine factors most correlated with GMC or GY. Principal component analysis of climate data yielded nine principal components responsible for 97% of the variability in the data. Incorporating these nine factors or principal components into the GBLUP-GE framework with a similarity matrix of environments (GBLUP-GE9CF and GBLUP-GEPCA) provided similar prediction accuracies but could reduce the computational burden. In addition, increasing the number of test set environments in the training set from 8 to 14 increased the prediction accuracy of GBLUP-GE19CF trained with monthly average climate data for 2020-2021. Examining prediction accuracy based on concordance, the proportion of overlapping hybrids between the top 50% of predicted and observed values for GMC and GY, indicated that concordance exceeded 50% for the GBLUP-GE19CF model, confirming the reliability of our predictions. This study can provide practical guidance for optimizing GPs for maize breeding programs in multi-environment selection.

利用多环境数据对玉米杂交种籽粒产量和籽粒含水量进行精确的基因组预测。
在多环境气候数据的基因组预测模型中引入基因型-环境互作效应可以提高作物的选择精度,从而加快作物育种,但目前的研究较少。本研究利用2020年和2021年34个环境下19个气候因子的数据,对中国两个主要产区的玉米杂交种的籽粒含水量(GMC)和籽粒产量(GY)进行了跨区域GP研究。利用9355个单核苷酸多态性标记进行基因分型,对475个玉米自交系的2126个杂交种进行预测。结合日长、蒸腾量、温度和辐射等19个气候因子的GE交互效应的基因组最佳线性无偏预测(GBLUP- ge19cf)模型经过10倍交叉验证,预测GMC和GY的准确率分别为0.731和0.331,优于传统的GBLUP或BayesB模型。为了完善气候数据,我们研究了与这些气候因子相关的84个统计特征,并确定了与GMC或GY最相关的9个因素。对气候数据的主成分分析产生了9个主成分,这些主成分负责数据中97%的变异性。将这9个因素或主成分结合到具有相似环境矩阵(GBLUP-GE9CF和GBLUP-GEPCA)的GBLUP-GE框架中,可以提供相似的预测精度,但可以减少计算负担。此外,将训练集中的测试集环境数量从8个增加到14个,提高了使用2020-2021年月平均气候数据训练的GBLUP-GE19CF的预测精度。检验基于一致性的预测准确性,GMC和GY的预测值与观测值前50%的重叠杂交比例表明,GBLUP-GE19CF模型的一致性超过50%,证实了我们预测的可靠性。该研究可为玉米多环境选择中的GPs优化提供实践指导。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Integrative Plant Biology
Journal of Integrative Plant Biology 生物-生化与分子生物学
CiteScore
18.00
自引率
5.30%
发文量
220
审稿时长
3 months
期刊介绍: Journal of Integrative Plant Biology is a leading academic journal reporting on the latest discoveries in plant biology.Enjoy the latest news and developments in the field, understand new and improved methods and research tools, and explore basic biological questions through reproducible experimental design, using genetic, biochemical, cell and molecular biological methods, and statistical analyses.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信