Estimating soil mineral nitrogen from data-sparse field experiments using crop model-guided machine learning approach

Rishabh Gupta, Satya Krishna Pothapragada, Weihuang Xu, Prateek Kumar Goel, Miguel Barrera, Mira Saldanha, Joel Harley, Kelly T. Morgan, Alina Zare, Lincoln Zotarelli
{"title":"Estimating soil mineral nitrogen from data-sparse field experiments using crop model-guided machine learning approach","authors":"Rishabh Gupta, Satya Krishna Pothapragada, Weihuang Xu, Prateek Kumar Goel, Miguel Barrera, Mira Saldanha, Joel Harley, Kelly T. Morgan, Alina Zare, Lincoln Zotarelli","doi":"10.1101/2024.09.03.610387","DOIUrl":null,"url":null,"abstract":"Sandy soils are susceptible to excessive nitrogen (N) leaching under intensive crop production which is linked with the soil's low nutrient holding capacity and high-water infiltration rate. Estimating soil mineral nitrogen (SMN) at the daily time-step is crucial in providing fertilizer recommendations balancing plant nitrogen use efficiency (NUE) and N losses to the environment. Crop models [e.g., Decision Support System for Agrotechnology Transfer (DSSAT)] can simulate the trend of SMN in varied fertilizer rates and timing of application but are unable to replicate its magnitude due to the inability to capture high-water table conditions in a sub-irrigated soil. As an alternative to such physics-based model, time-series deep learning (DL) models based on a long short-term memory (LSTM) are promising in understanding nonlinearity among complex variables. Yet, purely data-driven DL models for crops are difficult to obtain due to the insufficient amount of data available and the excessive costs with producing more data. To address this challenge, a hybrid model (hybrid-LSTM) was developed by leveraging both the DSSAT andLSTM models to estimate daily SMN primarily using daily weather, applied fertilizer rates- timings, and the SMN sparse observations. This study used the observations from field trials conducted between 2010-2014 in Hastings, FL. The first step was to calibrate the DSSAT-SUBSTOR-Potato model to produce reliable SMN of the topsoil for treatments with varied N applied fertilizer rates split among the pre-planting, emergence, and tuber-initiation stages of the potato crop. Thereafter, the hybrid-LSTM model was trained on the calibrated DSSAT simulated SMN time-series and fine-tuned its predictions using the observed SMN to improve DSSAT simulated SMN. The hybrid-LSTM model was then tested on both calibrated and uncalibrated DSSAT SMN simulations where it outperformed the DSSAT model (range of improvement ranged ~18-30% on comparing the normalized root mean squared error) in providing reliable estimates of SMN across most of the farms and years. This novel hybrid modeling approach could guide stakeholders and farmers to build sustainable N management with improved crop NUE and yield and help in minimizing environmental losses.","PeriodicalId":501557,"journal":{"name":"bioRxiv - Physiology","volume":"99 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"bioRxiv - Physiology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.09.03.610387","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Sandy soils are susceptible to excessive nitrogen (N) leaching under intensive crop production which is linked with the soil's low nutrient holding capacity and high-water infiltration rate. Estimating soil mineral nitrogen (SMN) at the daily time-step is crucial in providing fertilizer recommendations balancing plant nitrogen use efficiency (NUE) and N losses to the environment. Crop models [e.g., Decision Support System for Agrotechnology Transfer (DSSAT)] can simulate the trend of SMN in varied fertilizer rates and timing of application but are unable to replicate its magnitude due to the inability to capture high-water table conditions in a sub-irrigated soil. As an alternative to such physics-based model, time-series deep learning (DL) models based on a long short-term memory (LSTM) are promising in understanding nonlinearity among complex variables. Yet, purely data-driven DL models for crops are difficult to obtain due to the insufficient amount of data available and the excessive costs with producing more data. To address this challenge, a hybrid model (hybrid-LSTM) was developed by leveraging both the DSSAT andLSTM models to estimate daily SMN primarily using daily weather, applied fertilizer rates- timings, and the SMN sparse observations. This study used the observations from field trials conducted between 2010-2014 in Hastings, FL. The first step was to calibrate the DSSAT-SUBSTOR-Potato model to produce reliable SMN of the topsoil for treatments with varied N applied fertilizer rates split among the pre-planting, emergence, and tuber-initiation stages of the potato crop. Thereafter, the hybrid-LSTM model was trained on the calibrated DSSAT simulated SMN time-series and fine-tuned its predictions using the observed SMN to improve DSSAT simulated SMN. The hybrid-LSTM model was then tested on both calibrated and uncalibrated DSSAT SMN simulations where it outperformed the DSSAT model (range of improvement ranged ~18-30% on comparing the normalized root mean squared error) in providing reliable estimates of SMN across most of the farms and years. This novel hybrid modeling approach could guide stakeholders and farmers to build sustainable N management with improved crop NUE and yield and help in minimizing environmental losses.
利用作物模型指导的机器学习方法,从数据稀少的田间试验中估算土壤矿物氮
在密集型作物生产过程中,沙质土壤很容易发生过量的氮渗漏,这与土壤的低养分保持能力和高水分渗透率有关。按日时估算土壤矿物氮(SMN)对于提供平衡植物氮利用效率(NUE)和环境氮损失的肥料建议至关重要。作物模型(如农业技术转让决策支持系统 (DSSAT))可以模拟不同施肥量和施肥时间下的土壤矿物氮趋势,但由于无法捕捉次灌溉土壤中的高地下水位条件,因此无法复制其大小。作为这种基于物理的模型的替代方案,基于长短期记忆(LSTM)的时间序列深度学习(DL)模型在理解复杂变量之间的非线性方面大有可为。然而,由于可用数据量不足以及产生更多数据的成本过高,很难获得纯数据驱动的农作物 DL 模型。为了应对这一挑战,我们开发了一种混合模型(混合-LSTM),利用 DSSAT 和 LSTM 模型,主要通过每日天气、施肥量-时间和 SMN 稀疏观测数据来估算每日 SMN。本研究使用了 2010-2014 年期间在佛罗里达州黑斯廷斯进行的田间试验的观测数据。第一步是对 DSSAT-SUBSTOR-Potato 模型进行校准,以便在马铃薯作物的播种前、出苗和块茎始发阶段对施用不同氮肥率的处理产生可靠的表土 SMN。此后,混合 LSTM 模型在校准的 DSSAT 模拟 SMN 时间序列上进行训练,并利用观测到的 SMN 对其预测进行微调,以改进 DSSAT 模拟的 SMN。混合 LSTM 模型随后在经过校准和未经校准的 DSSAT SMN 模拟上进行了测试,结果表明该模型的性能优于 DSSAT 模型(比较归一化均方根误差,改进范围约为 18%-30%),可为大多数农场和年份提供可靠的 SMN 估计值。这种新颖的混合建模方法可指导利益相关者和农民建立可持续的氮管理,提高作物氮利用效率和产量,并有助于最大限度地减少环境损失。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信