Deborah V. Gaso , Laura Elena Cue La Rosa , Laila A. Puntel , Juan I. Rattalino Edreira , Allard de Wit , Lammert Kooistra
{"title":"Predicting spatial and temporal variability in soybean yield using deep learning and open source data","authors":"Deborah V. Gaso , Laura Elena Cue La Rosa , Laila A. Puntel , Juan I. Rattalino Edreira , Allard de Wit , Lammert Kooistra","doi":"10.1016/j.eja.2024.127498","DOIUrl":null,"url":null,"abstract":"<div><div>Spatial crop yield prediction provides valuable insights for supporting sustainable and precise crop management decisions. This study assessed the capabilities of advanced Deep Learning (DL) architectures in predicting within-field soybean yields using spectral bands from Sentinel-2 (RS-Inputs), weather (W-Inputs), and topographic attributes (TA-Inputs). DL architectures included 1-D convolutional neural network (1D-CNN), long short-term memory (LSTM) and transformer-based (TRFM). We used an extensive dataset ( ∼ 700 K) of yield observations, collected with a combine harvester, from 310 fields across three growing seasons (2020, 2021 and 2022) in Uruguay. The DL architectures were assessed under two testing strategies: across-fields and across-years. We compared results from DL architectures against a baseline that uses a process-based method with data assimilation. Our results revealed that DL architectures outperformed the baseline in testing across-fields only (RRMSE 35 % vs 40 %). The DL architectures encountered more challenges with temporal extrapolation (RRMSE 51 % in across-years). There were no substantial differences in performance among the DL architectures. The TA-Inputs enhanced accuracy in 1D-CNN (reduced RRMSE by ∼ 13 %), while W-Inputs led to a small improvement in 1D-CNN and LSTM (reduced RRMSE by ∼ 2 %) when tested across-years. All combinations of DL architectures and input settings encountered challenges in predicting the tails of the yield distribution (mean bias ∼ 1000 kg ha<sup>−1</sup>). We discussed the current limitations of DL architectures in capturing crop yield complexity using openly available spatial data and provided further directions for improving the reliability and interpretability of data-driven models by integrating process-based approaches. Beyond this, the performance of data-driven methods alone is expected to improve with the increasing availability of collected and stored data. Incorporating historical yield maps and in-season crop management data into open-source datasets will facilitate continuous training and enhancement for tailored models.</div></div>","PeriodicalId":51045,"journal":{"name":"European Journal of Agronomy","volume":"164 ","pages":"Article 127498"},"PeriodicalIF":4.5000,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Agronomy","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1161030124004192","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRONOMY","Score":null,"Total":0}
引用次数: 0
Abstract
Spatial crop yield prediction provides valuable insights for supporting sustainable and precise crop management decisions. This study assessed the capabilities of advanced Deep Learning (DL) architectures in predicting within-field soybean yields using spectral bands from Sentinel-2 (RS-Inputs), weather (W-Inputs), and topographic attributes (TA-Inputs). DL architectures included 1-D convolutional neural network (1D-CNN), long short-term memory (LSTM) and transformer-based (TRFM). We used an extensive dataset ( ∼ 700 K) of yield observations, collected with a combine harvester, from 310 fields across three growing seasons (2020, 2021 and 2022) in Uruguay. The DL architectures were assessed under two testing strategies: across-fields and across-years. We compared results from DL architectures against a baseline that uses a process-based method with data assimilation. Our results revealed that DL architectures outperformed the baseline in testing across-fields only (RRMSE 35 % vs 40 %). The DL architectures encountered more challenges with temporal extrapolation (RRMSE 51 % in across-years). There were no substantial differences in performance among the DL architectures. The TA-Inputs enhanced accuracy in 1D-CNN (reduced RRMSE by ∼ 13 %), while W-Inputs led to a small improvement in 1D-CNN and LSTM (reduced RRMSE by ∼ 2 %) when tested across-years. All combinations of DL architectures and input settings encountered challenges in predicting the tails of the yield distribution (mean bias ∼ 1000 kg ha−1). We discussed the current limitations of DL architectures in capturing crop yield complexity using openly available spatial data and provided further directions for improving the reliability and interpretability of data-driven models by integrating process-based approaches. Beyond this, the performance of data-driven methods alone is expected to improve with the increasing availability of collected and stored data. Incorporating historical yield maps and in-season crop management data into open-source datasets will facilitate continuous training and enhancement for tailored models.
期刊介绍:
The European Journal of Agronomy, the official journal of the European Society for Agronomy, publishes original research papers reporting experimental and theoretical contributions to field-based agronomy and crop science. The journal will consider research at the field level for agricultural, horticultural and tree crops, that uses comprehensive and explanatory approaches. The EJA covers the following topics:
crop physiology
crop production and management including irrigation, fertilization and soil management
agroclimatology and modelling
plant-soil relationships
crop quality and post-harvest physiology
farming and cropping systems
agroecosystems and the environment
crop-weed interactions and management
organic farming
horticultural crops
papers from the European Society for Agronomy bi-annual meetings
In determining the suitability of submitted articles for publication, particular scrutiny is placed on the degree of novelty and significance of the research and the extent to which it adds to existing knowledge in agronomy.