{"title":"利用全国历史数据建模和空间效应改进日本水稻抽穗期基因组预测。","authors":"Shoji Taniguchi, Takeshi Hayashi, Hiroshi Nakagawa, Kei Matsushita, Hiromi Kajiya-Kanegae, Jun-Ichi Yonemaru, Akitoshi Goto","doi":"10.1186/s12284-025-00778-4","DOIUrl":null,"url":null,"abstract":"<p><p>Genomic prediction is a promising strategy for enhancing crop breeding efficiency. Historical data of breeding and cultivation tests from geographically wide regions presumably contain rich information for training genomic prediction models. Therefore, it is essential to explore methodologies to effectively handle such data. To improve the prediction accuracy of models using historical data, we incorporated a spatial model to account for spatial structures among field stations, in addition to conventional genomic prediction models. Targeting the rice heading date from historical data across Japan, we first constructed conventional genomic prediction models using genomic and/or meteorological elements as predictors. Next, we obtain the residual terms. Assuming that the residual terms were partly explained by the spatial effects assigned to each field station, a spatial model was applied to the residual terms and the spatial effects were calculated. Our genomic prediction models performed best when the genome, meteorological elements, and genome-meteorology interactions were included (model 3), and they performed second best when the genome and meteorological elements were included (model 2). For these genomic prediction models, residual terms were spatially biased and corrected for spatial effects. For the best model (model 3), the root mean squared errors (RMSE) of genomic prediction combined with spatial effects were approximately 3.6 days under tenfold cross-validation and approximately 5.1 days under leave-one-line-out cross-validation. The inclusion of the spatial effects improved the RMSEs by approximately 15% and 9% for the former and latter, respectively. Lines with highly improved predictions of the spatial effects were developed, mainly in the northern Tohoku region. The spatial effects were heterogeneous and regional patterns were detected. These findings imply that spatial effects are important not only for improving prediction performance but also for dissecting the model itself to identify the factors contributing to model improvement.</p>","PeriodicalId":21408,"journal":{"name":"Rice","volume":"18 1","pages":"27"},"PeriodicalIF":4.8000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11992326/pdf/","citationCount":"0","resultStr":"{\"title\":\"Modelling and Using Spatial Effects in Nationwide Historical Data Improve Genomic Prediction of Rice Heading Date in Japan.\",\"authors\":\"Shoji Taniguchi, Takeshi Hayashi, Hiroshi Nakagawa, Kei Matsushita, Hiromi Kajiya-Kanegae, Jun-Ichi Yonemaru, Akitoshi Goto\",\"doi\":\"10.1186/s12284-025-00778-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Genomic prediction is a promising strategy for enhancing crop breeding efficiency. Historical data of breeding and cultivation tests from geographically wide regions presumably contain rich information for training genomic prediction models. Therefore, it is essential to explore methodologies to effectively handle such data. To improve the prediction accuracy of models using historical data, we incorporated a spatial model to account for spatial structures among field stations, in addition to conventional genomic prediction models. Targeting the rice heading date from historical data across Japan, we first constructed conventional genomic prediction models using genomic and/or meteorological elements as predictors. Next, we obtain the residual terms. Assuming that the residual terms were partly explained by the spatial effects assigned to each field station, a spatial model was applied to the residual terms and the spatial effects were calculated. Our genomic prediction models performed best when the genome, meteorological elements, and genome-meteorology interactions were included (model 3), and they performed second best when the genome and meteorological elements were included (model 2). For these genomic prediction models, residual terms were spatially biased and corrected for spatial effects. For the best model (model 3), the root mean squared errors (RMSE) of genomic prediction combined with spatial effects were approximately 3.6 days under tenfold cross-validation and approximately 5.1 days under leave-one-line-out cross-validation. The inclusion of the spatial effects improved the RMSEs by approximately 15% and 9% for the former and latter, respectively. Lines with highly improved predictions of the spatial effects were developed, mainly in the northern Tohoku region. The spatial effects were heterogeneous and regional patterns were detected. These findings imply that spatial effects are important not only for improving prediction performance but also for dissecting the model itself to identify the factors contributing to model improvement.</p>\",\"PeriodicalId\":21408,\"journal\":{\"name\":\"Rice\",\"volume\":\"18 1\",\"pages\":\"27\"},\"PeriodicalIF\":4.8000,\"publicationDate\":\"2025-04-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11992326/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Rice\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://doi.org/10.1186/s12284-025-00778-4\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRONOMY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Rice","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1186/s12284-025-00778-4","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRONOMY","Score":null,"Total":0}
Modelling and Using Spatial Effects in Nationwide Historical Data Improve Genomic Prediction of Rice Heading Date in Japan.
Genomic prediction is a promising strategy for enhancing crop breeding efficiency. Historical data of breeding and cultivation tests from geographically wide regions presumably contain rich information for training genomic prediction models. Therefore, it is essential to explore methodologies to effectively handle such data. To improve the prediction accuracy of models using historical data, we incorporated a spatial model to account for spatial structures among field stations, in addition to conventional genomic prediction models. Targeting the rice heading date from historical data across Japan, we first constructed conventional genomic prediction models using genomic and/or meteorological elements as predictors. Next, we obtain the residual terms. Assuming that the residual terms were partly explained by the spatial effects assigned to each field station, a spatial model was applied to the residual terms and the spatial effects were calculated. Our genomic prediction models performed best when the genome, meteorological elements, and genome-meteorology interactions were included (model 3), and they performed second best when the genome and meteorological elements were included (model 2). For these genomic prediction models, residual terms were spatially biased and corrected for spatial effects. For the best model (model 3), the root mean squared errors (RMSE) of genomic prediction combined with spatial effects were approximately 3.6 days under tenfold cross-validation and approximately 5.1 days under leave-one-line-out cross-validation. The inclusion of the spatial effects improved the RMSEs by approximately 15% and 9% for the former and latter, respectively. Lines with highly improved predictions of the spatial effects were developed, mainly in the northern Tohoku region. The spatial effects were heterogeneous and regional patterns were detected. These findings imply that spatial effects are important not only for improving prediction performance but also for dissecting the model itself to identify the factors contributing to model improvement.
期刊介绍:
Rice aims to fill a glaring void in basic and applied plant science journal publishing. This journal is the world''s only high-quality serial publication for reporting current advances in rice genetics, structural and functional genomics, comparative genomics, molecular biology and physiology, molecular breeding and comparative biology. Rice welcomes review articles and original papers in all of the aforementioned areas and serves as the primary source of newly published information for researchers and students in rice and related research.