Knowledge-guided machine learning captures key mechanistic pathways for better predicting spatio-temporal patterns of growing season N2O emissions in the U.S. Midwest
Lexuan Ye, Licheng Liu, Yufeng Yang, Ziyi Li, Wang Zhou, Bin Peng, Shaoming Xu, Vipin Kumar, Wendy H. Yang, Jinyun Tang, Zhenong Jin, Kaiyu Guan
{"title":"Knowledge-guided machine learning captures key mechanistic pathways for better predicting spatio-temporal patterns of growing season N2O emissions in the U.S. Midwest","authors":"Lexuan Ye, Licheng Liu, Yufeng Yang, Ziyi Li, Wang Zhou, Bin Peng, Shaoming Xu, Vipin Kumar, Wendy H. Yang, Jinyun Tang, Zhenong Jin, Kaiyu Guan","doi":"10.1016/j.agrformet.2025.110750","DOIUrl":null,"url":null,"abstract":"Accurately predicting agricultural N<sub>2</sub>O emission hot moments has long been a key focus of N<sub>2</sub>O models, which is challenging given the complex mechanisms involved and the high spatio-temporal heterogeneity of controlling factors. In this study, we improve a knowledge-guided machine learning model for agricultural N<sub>2</sub>O flux prediction (KGML-ag-N<sub>2</sub>O) by (i) incorporating a physical module to explicitly represent fertilization, (ii) pre-training KGML-ag-N<sub>2</sub>O with synthetic data from a process-based (PB) model <em>ecosys</em> under diverse fertilization strategies and environmental conditions, and (iii) fine-tuning KGML-ag-N<sub>2</sub>O with field observations of daily N<sub>2</sub>O flux and key controlling factors from 29 sites across the U.S. Midwest. We then assess whether it can outperform the PB model and the pure machine learning model over different spatio-temporal scales. Through integrating knowledge from both the PB model and field observations, KGML-ag-N<sub>2</sub>O shows the best performance in capturing site-level daily N<sub>2</sub>O emission hot moments (<em>r</em> = 0.63, RMSE = 3.765 mgN m<sup>-2</sup> d<sup>-1</sup>), mainly due to the strengthened triggering effect of soil water content increase on N<sub>2</sub>O emissions in KGML-ag-N<sub>2</sub>O. The improved causality representations between key controlling factors and N<sub>2</sub>O emissions further lead to KGML-ag-N<sub>2</sub>O outperformance in capturing N<sub>2</sub>O emission patterns over larger spatial (e.g., regional) and temporal (e.g., inter-monthly) scales than both conventional approaches. By validating and interpreting the improvements in KGML-ag-N<sub>2</sub>O performance, our study illustrates its potential to quantify agricultural N<sub>2</sub>O emissions in the U.S. Midwest, and provides important insights into identifying and reducing PB model structure uncertainties using KGML techniques.","PeriodicalId":50839,"journal":{"name":"Agricultural and Forest Meteorology","volume":"33 1","pages":""},"PeriodicalIF":5.7000,"publicationDate":"2025-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Agricultural and Forest Meteorology","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1016/j.agrformet.2025.110750","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRONOMY","Score":null,"Total":0}
引用次数: 0
Abstract
Accurately predicting agricultural N2O emission hot moments has long been a key focus of N2O models, which is challenging given the complex mechanisms involved and the high spatio-temporal heterogeneity of controlling factors. In this study, we improve a knowledge-guided machine learning model for agricultural N2O flux prediction (KGML-ag-N2O) by (i) incorporating a physical module to explicitly represent fertilization, (ii) pre-training KGML-ag-N2O with synthetic data from a process-based (PB) model ecosys under diverse fertilization strategies and environmental conditions, and (iii) fine-tuning KGML-ag-N2O with field observations of daily N2O flux and key controlling factors from 29 sites across the U.S. Midwest. We then assess whether it can outperform the PB model and the pure machine learning model over different spatio-temporal scales. Through integrating knowledge from both the PB model and field observations, KGML-ag-N2O shows the best performance in capturing site-level daily N2O emission hot moments (r = 0.63, RMSE = 3.765 mgN m-2 d-1), mainly due to the strengthened triggering effect of soil water content increase on N2O emissions in KGML-ag-N2O. The improved causality representations between key controlling factors and N2O emissions further lead to KGML-ag-N2O outperformance in capturing N2O emission patterns over larger spatial (e.g., regional) and temporal (e.g., inter-monthly) scales than both conventional approaches. By validating and interpreting the improvements in KGML-ag-N2O performance, our study illustrates its potential to quantify agricultural N2O emissions in the U.S. Midwest, and provides important insights into identifying and reducing PB model structure uncertainties using KGML techniques.
期刊介绍:
Agricultural and Forest Meteorology is an international journal for the publication of original articles and reviews on the inter-relationship between meteorology, agriculture, forestry, and natural ecosystems. Emphasis is on basic and applied scientific research relevant to practical problems in the field of plant and soil sciences, ecology and biogeochemistry as affected by weather as well as climate variability and change. Theoretical models should be tested against experimental data. Articles must appeal to an international audience. Special issues devoted to single topics are also published.
Typical topics include canopy micrometeorology (e.g. canopy radiation transfer, turbulence near the ground, evapotranspiration, energy balance, fluxes of trace gases), micrometeorological instrumentation (e.g., sensors for trace gases, flux measurement instruments, radiation measurement techniques), aerobiology (e.g. the dispersion of pollen, spores, insects and pesticides), biometeorology (e.g. the effect of weather and climate on plant distribution, crop yield, water-use efficiency, and plant phenology), forest-fire/weather interactions, and feedbacks from vegetation to weather and the climate system.