Modelling and Using Spatial Effects in Nationwide Historical Data Improve Genomic Prediction of Rice Heading Date in Japan.

IF 4.8 1区 农林科学 Q1 AGRONOMY
Rice Pub Date : 2025-04-11 DOI:10.1186/s12284-025-00778-4
Shoji Taniguchi, Takeshi Hayashi, Hiroshi Nakagawa, Kei Matsushita, Hiromi Kajiya-Kanegae, Jun-Ichi Yonemaru, Akitoshi Goto
{"title":"Modelling and Using Spatial Effects in Nationwide Historical Data Improve Genomic Prediction of Rice Heading Date in Japan.","authors":"Shoji Taniguchi, Takeshi Hayashi, Hiroshi Nakagawa, Kei Matsushita, Hiromi Kajiya-Kanegae, Jun-Ichi Yonemaru, Akitoshi Goto","doi":"10.1186/s12284-025-00778-4","DOIUrl":null,"url":null,"abstract":"<p><p>Genomic prediction is a promising strategy for enhancing crop breeding efficiency. Historical data of breeding and cultivation tests from geographically wide regions presumably contain rich information for training genomic prediction models. Therefore, it is essential to explore methodologies to effectively handle such data. To improve the prediction accuracy of models using historical data, we incorporated a spatial model to account for spatial structures among field stations, in addition to conventional genomic prediction models. Targeting the rice heading date from historical data across Japan, we first constructed conventional genomic prediction models using genomic and/or meteorological elements as predictors. Next, we obtain the residual terms. Assuming that the residual terms were partly explained by the spatial effects assigned to each field station, a spatial model was applied to the residual terms and the spatial effects were calculated. Our genomic prediction models performed best when the genome, meteorological elements, and genome-meteorology interactions were included (model 3), and they performed second best when the genome and meteorological elements were included (model 2). For these genomic prediction models, residual terms were spatially biased and corrected for spatial effects. For the best model (model 3), the root mean squared errors (RMSE) of genomic prediction combined with spatial effects were approximately 3.6 days under tenfold cross-validation and approximately 5.1 days under leave-one-line-out cross-validation. The inclusion of the spatial effects improved the RMSEs by approximately 15% and 9% for the former and latter, respectively. Lines with highly improved predictions of the spatial effects were developed, mainly in the northern Tohoku region. The spatial effects were heterogeneous and regional patterns were detected. These findings imply that spatial effects are important not only for improving prediction performance but also for dissecting the model itself to identify the factors contributing to model improvement.</p>","PeriodicalId":21408,"journal":{"name":"Rice","volume":"18 1","pages":"27"},"PeriodicalIF":4.8000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11992326/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Rice","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1186/s12284-025-00778-4","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRONOMY","Score":null,"Total":0}
引用次数: 0

Abstract

Genomic prediction is a promising strategy for enhancing crop breeding efficiency. Historical data of breeding and cultivation tests from geographically wide regions presumably contain rich information for training genomic prediction models. Therefore, it is essential to explore methodologies to effectively handle such data. To improve the prediction accuracy of models using historical data, we incorporated a spatial model to account for spatial structures among field stations, in addition to conventional genomic prediction models. Targeting the rice heading date from historical data across Japan, we first constructed conventional genomic prediction models using genomic and/or meteorological elements as predictors. Next, we obtain the residual terms. Assuming that the residual terms were partly explained by the spatial effects assigned to each field station, a spatial model was applied to the residual terms and the spatial effects were calculated. Our genomic prediction models performed best when the genome, meteorological elements, and genome-meteorology interactions were included (model 3), and they performed second best when the genome and meteorological elements were included (model 2). For these genomic prediction models, residual terms were spatially biased and corrected for spatial effects. For the best model (model 3), the root mean squared errors (RMSE) of genomic prediction combined with spatial effects were approximately 3.6 days under tenfold cross-validation and approximately 5.1 days under leave-one-line-out cross-validation. The inclusion of the spatial effects improved the RMSEs by approximately 15% and 9% for the former and latter, respectively. Lines with highly improved predictions of the spatial effects were developed, mainly in the northern Tohoku region. The spatial effects were heterogeneous and regional patterns were detected. These findings imply that spatial effects are important not only for improving prediction performance but also for dissecting the model itself to identify the factors contributing to model improvement.

利用全国历史数据建模和空间效应改进日本水稻抽穗期基因组预测。
基因组预测是提高作物育种效率的一种有前景的策略。来自地理上广泛区域的育种和栽培试验的历史数据可能包含丰富的信息来训练基因组预测模型。因此,探索有效处理此类数据的方法至关重要。为了提高使用历史数据的模型的预测精度,我们在传统的基因组预测模型的基础上加入了一个空间模型来考虑野外站点之间的空间结构。针对日本各地历史数据中的水稻抽穗日期,我们首先构建了以基因组和/或气象要素为预测因子的常规基因组预测模型。接下来,我们得到残差项。假设各场站的空间效应可以部分解释残差项,对残差项建立空间模型,计算空间效应。我们的基因组预测模型在包含基因组、气象要素和基因组-气象相互作用时表现最好(模型3),在包含基因组和气象要素时表现第二好(模型2)。对于这些基因组预测模型,残差项在空间上有偏倚,并根据空间效应进行校正。对于最佳模型(模型3),结合空间效应的基因组预测的均方根误差(RMSE)在10倍交叉验证下约为3.6天,在留一行交叉验证下约为5.1天。纳入空间效应后,前者和后者的均方根误差分别提高了约15%和9%。主要在东北北部地区开发了具有高度改进的空间效应预测的线路。空间效应具有异质性,且存在区域格局。这些发现表明,空间效应不仅对提高预测性能很重要,而且对分析模型本身以确定有助于模型改进的因素也很重要。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Rice
Rice AGRONOMY-
CiteScore
10.10
自引率
3.60%
发文量
60
审稿时长
>12 weeks
期刊介绍: Rice aims to fill a glaring void in basic and applied plant science journal publishing. This journal is the world''s only high-quality serial publication for reporting current advances in rice genetics, structural and functional genomics, comparative genomics, molecular biology and physiology, molecular breeding and comparative biology. Rice welcomes review articles and original papers in all of the aforementioned areas and serves as the primary source of newly published information for researchers and students in rice and related research.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信