Forecasting crop yield through a data-driven framework of remote sensing and biophysical knowledge: A case study for wheat and maize in the Guanzhong Plain, China

IF 5.5 1区 农林科学 Q1 AGRONOMY
European Journal of Agronomy Pub Date : 2026-04-01 Epub Date: 2026-02-12 DOI:10.1016/j.eja.2026.128038
Zhikai Cheng, Xiaobo Gu, Yuanling Zhang, Tongtong Zhao, Shikun Sun, Yadan Du, Huanjie Cai
{"title":"Forecasting crop yield through a data-driven framework of remote sensing and biophysical knowledge: A case study for wheat and maize in the Guanzhong Plain, China","authors":"Zhikai Cheng,&nbsp;Xiaobo Gu,&nbsp;Yuanling Zhang,&nbsp;Tongtong Zhao,&nbsp;Shikun Sun,&nbsp;Yadan Du,&nbsp;Huanjie Cai","doi":"10.1016/j.eja.2026.128038","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate early yield forecasts are essential for maximizing benefits and ensuring food security in the Guanzhong Plain, China. Process-based crop models are often constrained by uncertain input data, which limits their ability to forecast yield at the regional grid level (e.g., 1 km × 1 km). Statistical models, ignore the biophysical mechanisms underlying crop growth and development, and their performance is limited by the quantity and quality of available training data. Therefore, there is an urgent need for a more comprehensive and robust grid-level wheat and maize yield forecasting approach for the Guanzhong Plain. In this study, an interpretable data-driven framework was developed to forecast wheat and maize yields by combining remote sensing (solar-induced chlorophyll fluorescence, SIF; spectral indices, SIs) and biophysical knowledge (APSIM outputs and extreme climatic events) data. A Bayesian integration model (BIM) was trained on high-quality synthetic datasets (obtained by the synthetic minority oversampling technique for regression, SMOTER) to achieve accurate harvest-time yield forecasts at specific time windows. The results showed that the integration of multi-source data reduced the yield prediction error, with the overall normalized root mean square error (NRMSE) decreasing by 0.6 %–39.0 % compared to the single-source models. The data-driven model trained on the SMOTER -based synthetic dataset achieved the highest yield forecasting accuracy (wheat: NRMSE = 16.2 %; maize: NRMSE = 20.7 %). The SIF made the largest contribution to yield forecasts and showed strong interactions and synergies with other feature variables (e.g., aboveground biomass, drought, and low temperature stress), further enhancing model performance. Overall, the proposed data-driven framework demonstrates a promising way for improving grid-level yield forecasting and provides useful insights for the sustainable development of agricultural systems.</div></div>","PeriodicalId":51045,"journal":{"name":"European Journal of Agronomy","volume":"175 ","pages":"Article 128038"},"PeriodicalIF":5.5000,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Agronomy","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1161030126000572","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/2/12 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"AGRONOMY","Score":null,"Total":0}
引用次数: 0

Abstract

Accurate early yield forecasts are essential for maximizing benefits and ensuring food security in the Guanzhong Plain, China. Process-based crop models are often constrained by uncertain input data, which limits their ability to forecast yield at the regional grid level (e.g., 1 km × 1 km). Statistical models, ignore the biophysical mechanisms underlying crop growth and development, and their performance is limited by the quantity and quality of available training data. Therefore, there is an urgent need for a more comprehensive and robust grid-level wheat and maize yield forecasting approach for the Guanzhong Plain. In this study, an interpretable data-driven framework was developed to forecast wheat and maize yields by combining remote sensing (solar-induced chlorophyll fluorescence, SIF; spectral indices, SIs) and biophysical knowledge (APSIM outputs and extreme climatic events) data. A Bayesian integration model (BIM) was trained on high-quality synthetic datasets (obtained by the synthetic minority oversampling technique for regression, SMOTER) to achieve accurate harvest-time yield forecasts at specific time windows. The results showed that the integration of multi-source data reduced the yield prediction error, with the overall normalized root mean square error (NRMSE) decreasing by 0.6 %–39.0 % compared to the single-source models. The data-driven model trained on the SMOTER -based synthetic dataset achieved the highest yield forecasting accuracy (wheat: NRMSE = 16.2 %; maize: NRMSE = 20.7 %). The SIF made the largest contribution to yield forecasts and showed strong interactions and synergies with other feature variables (e.g., aboveground biomass, drought, and low temperature stress), further enhancing model performance. Overall, the proposed data-driven framework demonstrates a promising way for improving grid-level yield forecasting and provides useful insights for the sustainable development of agricultural systems.
利用数据驱动的遥感和生物物理知识框架预测作物产量:以中国关中平原小麦和玉米为例
准确的早期产量预测对于实现效益最大化和确保关中平原的粮食安全至关重要。基于过程的作物模型经常受到不确定输入数据的约束,这限制了它们在区域网格级预测产量的能力(例如,1 km × 1 km)。统计模型忽略了作物生长发育的生物物理机制,其性能受到可用训练数据的数量和质量的限制。因此,迫切需要一种更全面、更可靠的关中平原小麦和玉米网格级产量预测方法。在这项研究中,开发了一个可解释的数据驱动框架,通过结合遥感(太阳诱导叶绿素荧光,SIF;光谱指数,si)和生物物理知识(APSIM输出和极端气候事件)数据来预测小麦和玉米产量。贝叶斯集成模型(BIM)在高质量的合成数据集(通过合成少数派过采样技术进行回归,SMOTER)上进行训练,以在特定的时间窗口实现准确的收获时间产量预测。结果表明,与单源模型相比,多源数据集成降低了产量预测误差,总体归一化均方根误差(NRMSE)降低0.6 % ~ 39.0 %。在SMOTER合成数据集上训练的数据驱动模型的产量预测精度最高(小麦:NRMSE = 16.2 %;玉米:NRMSE = 20.7 %)。SIF对产量预测的贡献最大,并与其他特征变量(如地上生物量、干旱和低温胁迫)表现出强烈的相互作用和协同作用,进一步提高了模型的性能。总体而言,所提出的数据驱动框架展示了一种改善电网级产量预测的有希望的方法,并为农业系统的可持续发展提供了有用的见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
European Journal of Agronomy
European Journal of Agronomy 农林科学-农艺学
CiteScore
8.30
自引率
7.70%
发文量
187
审稿时长
4.5 months
期刊介绍: The European Journal of Agronomy, the official journal of the European Society for Agronomy, publishes original research papers reporting experimental and theoretical contributions to field-based agronomy and crop science. The journal will consider research at the field level for agricultural, horticultural and tree crops, that uses comprehensive and explanatory approaches. The EJA covers the following topics: crop physiology crop production and management including irrigation, fertilization and soil management agroclimatology and modelling plant-soil relationships crop quality and post-harvest physiology farming and cropping systems agroecosystems and the environment crop-weed interactions and management organic farming horticultural crops papers from the European Society for Agronomy bi-annual meetings In determining the suitability of submitted articles for publication, particular scrutiny is placed on the degree of novelty and significance of the research and the extent to which it adds to existing knowledge in agronomy.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书