基于多源遥感数据的冬小麦产量估算：双支路TCN-Transformer模型及生育期特征转换机制分析

IF 8.9 1区农林科学 Q1 AGRICULTURE, MULTIDISCIPLINARY

Computers and Electronics in Agriculture Pub Date : 2025-09-19 DOI:10.1016/j.compag.2025.111014

Lei Zhang , Changchun Li , Guangsheng Zhang , Xifang Wu , Longfei Zhou , Lulu Chen , Yinghua Jiao , Guodong Liu , Wenyan Hei

{"title":"基于多源遥感数据的冬小麦产量估算：双支路TCN-Transformer模型及生育期特征转换机制分析","authors":"Lei Zhang , Changchun Li , Guangsheng Zhang , Xifang Wu , Longfei Zhou , Lulu Chen , Yinghua Jiao , Guodong Liu , Wenyan Hei","doi":"10.1016/j.compag.2025.111014","DOIUrl":null,"url":null,"abstract":"<div><div>Timely and accurate acquisition of winter wheat yield information is crucial for ensuring food security and formulating agricultural policies. Although deep learning methods have become increasingly prominent in crop yield estimation, they often face challenges in simultaneously capturing both fine-grained local patterns and long-term temporal dependencies in time series data. By utilizing EVI, LAI, and fraction of photosynthetically active radiation (FPAR) from MODIS, along with temperature (TEM) and precipitation (PRE) data from ERA5-Land, we propose a novel dual-branch hybrid model named TCN–Transformer (TCT), which synergistically integrates temporal convolutional network (TCN) and transformer architectures to concurrently capture both localized temporal patterns and long-term dependencies. Bayesian optimization was employed for automated hyperparameter tuning, enabling accurate estimation of winter wheat yield under diverse agricultural management conditions. The experimental results demonstrate that optimal performance is achieved by the proposed TCT model in terms of estimating the county-level winter wheat yields across North China on the test set (R2 = 0.80, RMSE = 645.75 kg/ha). It significantly outperforms the individual temporal models (the TCN, LSTM, and transformer) and other comparative models, including traditional machine learning methods (Ridge, RF, LightGBM, and XGBoost) and an advanced hybrid model (CNN-BiLSTM). Specifically, compared with the individual models, the TCT improved R2 by 0.03 to 0.1 and reduced the RMSE by 29.33 to 156.07 kg/ha. It also outperforms CNN-BiLSTM (R2 = 0.78, RMSE = 668.23 kg/ha), achieving lower errors and more robust bias control. To elucidate the decision-making mechanism of the model, the Shapley additive explanations (SHAP) method was employed to analyze the feature importance values across the study region and the temporal feature weights at 8-day intervals. The results reveal that the EVI is the most representative feature, with the model accurately identifying critical growth stages from T20 (February 26) to T28 (May 1), corresponding to the greening to milk phases, respectively. The feature contribution dynamics were further visualized, revealing a transition from FPAR dominance during early greening (T20–T22) to EVI dominance during jointing (T23–T25), EVI‒PRE interactions during heading-milk (T26–T29), and finally LAI‒PRE dominance at maturity (T30–T32). Furthermore, the one-year leave-one-out cross-validation confirms the robustness of the TCT model, the simulation of yield spatial distribution for unseen years is consistent with the official yield data. Additionally, the proposed interpretability framework not only performed excellently in this study but also demonstrated strong generalizability and flexibility, indicating its broad application potential in other crop types and agricultural domains.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"239 ","pages":"Article 111014"},"PeriodicalIF":8.9000,"publicationDate":"2025-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Winter wheat yield estimation based on multisource remote sensing data: A dual-branch TCN-Transformer model and analysis of growth-stage feature transition mechanisms\",\"authors\":\"Lei Zhang , Changchun Li , Guangsheng Zhang , Xifang Wu , Longfei Zhou , Lulu Chen , Yinghua Jiao , Guodong Liu , Wenyan Hei\",\"doi\":\"10.1016/j.compag.2025.111014\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Timely and accurate acquisition of winter wheat yield information is crucial for ensuring food security and formulating agricultural policies. Although deep learning methods have become increasingly prominent in crop yield estimation, they often face challenges in simultaneously capturing both fine-grained local patterns and long-term temporal dependencies in time series data. By utilizing EVI, LAI, and fraction of photosynthetically active radiation (FPAR) from MODIS, along with temperature (TEM) and precipitation (PRE) data from ERA5-Land, we propose a novel dual-branch hybrid model named TCN–Transformer (TCT), which synergistically integrates temporal convolutional network (TCN) and transformer architectures to concurrently capture both localized temporal patterns and long-term dependencies. Bayesian optimization was employed for automated hyperparameter tuning, enabling accurate estimation of winter wheat yield under diverse agricultural management conditions. The experimental results demonstrate that optimal performance is achieved by the proposed TCT model in terms of estimating the county-level winter wheat yields across North China on the test set (R2 = 0.80, RMSE = 645.75 kg/ha). It significantly outperforms the individual temporal models (the TCN, LSTM, and transformer) and other comparative models, including traditional machine learning methods (Ridge, RF, LightGBM, and XGBoost) and an advanced hybrid model (CNN-BiLSTM). Specifically, compared with the individual models, the TCT improved R2 by 0.03 to 0.1 and reduced the RMSE by 29.33 to 156.07 kg/ha. It also outperforms CNN-BiLSTM (R2 = 0.78, RMSE = 668.23 kg/ha), achieving lower errors and more robust bias control. To elucidate the decision-making mechanism of the model, the Shapley additive explanations (SHAP) method was employed to analyze the feature importance values across the study region and the temporal feature weights at 8-day intervals. The results reveal that the EVI is the most representative feature, with the model accurately identifying critical growth stages from T20 (February 26) to T28 (May 1), corresponding to the greening to milk phases, respectively. The feature contribution dynamics were further visualized, revealing a transition from FPAR dominance during early greening (T20–T22) to EVI dominance during jointing (T23–T25), EVI‒PRE interactions during heading-milk (T26–T29), and finally LAI‒PRE dominance at maturity (T30–T32). Furthermore, the one-year leave-one-out cross-validation confirms the robustness of the TCT model, the simulation of yield spatial distribution for unseen years is consistent with the official yield data. Additionally, the proposed interpretability framework not only performed excellently in this study but also demonstrated strong generalizability and flexibility, indicating its broad application potential in other crop types and agricultural domains.</div></div>\",\"PeriodicalId\":50627,\"journal\":{\"name\":\"Computers and Electronics in Agriculture\",\"volume\":\"239 \",\"pages\":\"Article 111014\"},\"PeriodicalIF\":8.9000,\"publicationDate\":\"2025-09-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers and Electronics in Agriculture\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0168169925011202\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169925011202","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

摘要

及时、准确地获取冬小麦产量信息对确保粮食安全和制定农业政策至关重要。尽管深度学习方法在作物产量估计中越来越突出，但它们在同时捕获时间序列数据中的细粒度局部模式和长期时间依赖性方面经常面临挑战。通过利用MODIS的EVI、LAI和光合有效辐射（FPAR）分数，以及ERA5-Land的温度（TEM）和降水（PRE）数据，我们提出了一种新的双分支混合模型TCN - transformer (TCT)，该模型协同集成了时间卷积网络（TCN）和变压器架构，以同时捕获局部时间模式和长期依赖关系。采用贝叶斯优化进行自动超参数整定，实现了对不同农业经营条件下冬小麦产量的准确估计。试验结果表明，TCT模型在华北地区县域冬小麦产量估算中具有最优性能（R2 = 0.80, RMSE = 645.75 kg/ha）。它明显优于单个时间模型（TCN， LSTM和transformer）和其他比较模型，包括传统的机器学习方法（Ridge， RF， LightGBM和XGBoost）和先进的混合模型（CNN-BiLSTM）。具体而言，与单个模型相比，TCT将R2提高了0.03至0.1，将RMSE降低了29.33至156.07 kg/ha。它也优于CNN-BiLSTM (R2 = 0.78, RMSE = 668.23 kg/ha)，实现了更低的误差和更稳健的偏差控制。为了阐明模型的决策机制，采用Shapley加性解释（SHAP）方法分析研究区域的特征重要性值和时间间隔为8 d的特征权重。结果表明，EVI是最具代表性的特征，该模型准确地识别了T20（2月26日）至T28（5月1日）的关键生长阶段，分别对应于青乳期。进一步可视化特征贡献的动态，揭示了从早绿期（T20-T22）的FPAR优势到拔节期（T23-T25）的EVI优势，抽头-乳汁期（T26-T29）的EVI - pre相互作用，最后到成熟期（T30-T32）的lei - pre优势的转变。一年期留一交叉验证验证了TCT模型的稳健性，对未见年份产量空间分布的模拟与官方产量数据一致。此外，本文提出的可解释性框架不仅在本研究中表现优异，而且具有很强的通用性和灵活性，在其他作物类型和农业领域具有广泛的应用潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Winter wheat yield estimation based on multisource remote sensing data: A dual-branch TCN-Transformer model and analysis of growth-stage feature transition mechanisms

Timely and accurate acquisition of winter wheat yield information is crucial for ensuring food security and formulating agricultural policies. Although deep learning methods have become increasingly prominent in crop yield estimation, they often face challenges in simultaneously capturing both fine-grained local patterns and long-term temporal dependencies in time series data. By utilizing EVI, LAI, and fraction of photosynthetically active radiation (FPAR) from MODIS, along with temperature (TEM) and precipitation (PRE) data from ERA5-Land, we propose a novel dual-branch hybrid model named TCN–Transformer (TCT), which synergistically integrates temporal convolutional network (TCN) and transformer architectures to concurrently capture both localized temporal patterns and long-term dependencies. Bayesian optimization was employed for automated hyperparameter tuning, enabling accurate estimation of winter wheat yield under diverse agricultural management conditions. The experimental results demonstrate that optimal performance is achieved by the proposed TCT model in terms of estimating the county-level winter wheat yields across North China on the test set (R2 = 0.80, RMSE = 645.75 kg/ha). It significantly outperforms the individual temporal models (the TCN, LSTM, and transformer) and other comparative models, including traditional machine learning methods (Ridge, RF, LightGBM, and XGBoost) and an advanced hybrid model (CNN-BiLSTM). Specifically, compared with the individual models, the TCT improved R2 by 0.03 to 0.1 and reduced the RMSE by 29.33 to 156.07 kg/ha. It also outperforms CNN-BiLSTM (R2 = 0.78, RMSE = 668.23 kg/ha), achieving lower errors and more robust bias control. To elucidate the decision-making mechanism of the model, the Shapley additive explanations (SHAP) method was employed to analyze the feature importance values across the study region and the temporal feature weights at 8-day intervals. The results reveal that the EVI is the most representative feature, with the model accurately identifying critical growth stages from T20 (February 26) to T28 (May 1), corresponding to the greening to milk phases, respectively. The feature contribution dynamics were further visualized, revealing a transition from FPAR dominance during early greening (T20–T22) to EVI dominance during jointing (T23–T25), EVI‒PRE interactions during heading-milk (T26–T29), and finally LAI‒PRE dominance at maturity (T30–T32). Furthermore, the one-year leave-one-out cross-validation confirms the robustness of the TCT model, the simulation of yield spatial distribution for unseen years is consistent with the official yield data. Additionally, the proposed interpretability framework not only performed excellently in this study but also demonstrated strong generalizability and flexibility, indicating its broad application potential in other crop types and agricultural domains.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Computers and Electronics in Agriculture 工程技术-计算机：跨学科应用

CiteScore

15.30

自引率

14.50%

发文量

800

审稿时长

62 days

期刊介绍： Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.