H. W. Xie, D. Sujono, T. Ryder, E. B. Sudderth, S. D. Allison
{"title":"基于归一化流的土壤生物地球化学模型变分推断和数据同化框架","authors":"H. W. Xie, D. Sujono, T. Ryder, E. B. Sudderth, S. D. Allison","doi":"10.1029/2024MS004547","DOIUrl":null,"url":null,"abstract":"<p>Soil biogeochemical models (SBMs) represent soil variables and their responses to global change. Data assimilation approaches help determine whether SBMs accurately represent soil processes consistent with soil pool and flux measurements. Bayesian inference is commonly used in data assimilation procedures that estimate posterior parameter distributions with Markov chain Monte Carlo (MCMC) methods. The ability to account for data and parameter uncertainty is a strength of MCMC inference, but the computational inefficiency of MCMC methods remains a barrier to their wider application, especially with large data sets. Given the limitations of MCMC approaches, we developed an alternative variational inference framework that uses a method called <i>normalizing flows</i> from the field of machine learning. Normalizing flows rely on deep learning to map probability distributions and approximate SBMs that have been discretized into state space models. As a test of our method, we fit approximated SBMs to synthetic data sourced from known data-generating processes to identify discrepancies between the inference results and true parameter values. Our approach compares favorably with established MCMC methods and could be a viable alternative for SBM data assimilation that reduces computational time and resource needs. However, our method has some limitations, including challenges assimilating data with irregular measurement intervals, underestimation of posterior parameter uncertainty, and limited goodness-of-fit metrics for comparison to MCMC inference methods. Many of these limitations could be overcome with additional algorithm development based on the approaches we report here.</p>","PeriodicalId":14881,"journal":{"name":"Journal of Advances in Modeling Earth Systems","volume":"17 8","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2025-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://agupubs.onlinelibrary.wiley.com/doi/epdf/10.1029/2024MS004547","citationCount":"0","resultStr":"{\"title\":\"A Framework for Variational Inference and Data Assimilation of Soil Biogeochemical Models Using Normalizing Flows\",\"authors\":\"H. W. Xie, D. Sujono, T. Ryder, E. B. Sudderth, S. D. Allison\",\"doi\":\"10.1029/2024MS004547\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Soil biogeochemical models (SBMs) represent soil variables and their responses to global change. Data assimilation approaches help determine whether SBMs accurately represent soil processes consistent with soil pool and flux measurements. Bayesian inference is commonly used in data assimilation procedures that estimate posterior parameter distributions with Markov chain Monte Carlo (MCMC) methods. The ability to account for data and parameter uncertainty is a strength of MCMC inference, but the computational inefficiency of MCMC methods remains a barrier to their wider application, especially with large data sets. Given the limitations of MCMC approaches, we developed an alternative variational inference framework that uses a method called <i>normalizing flows</i> from the field of machine learning. Normalizing flows rely on deep learning to map probability distributions and approximate SBMs that have been discretized into state space models. As a test of our method, we fit approximated SBMs to synthetic data sourced from known data-generating processes to identify discrepancies between the inference results and true parameter values. Our approach compares favorably with established MCMC methods and could be a viable alternative for SBM data assimilation that reduces computational time and resource needs. However, our method has some limitations, including challenges assimilating data with irregular measurement intervals, underestimation of posterior parameter uncertainty, and limited goodness-of-fit metrics for comparison to MCMC inference methods. Many of these limitations could be overcome with additional algorithm development based on the approaches we report here.</p>\",\"PeriodicalId\":14881,\"journal\":{\"name\":\"Journal of Advances in Modeling Earth Systems\",\"volume\":\"17 8\",\"pages\":\"\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2025-08-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://agupubs.onlinelibrary.wiley.com/doi/epdf/10.1029/2024MS004547\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Advances in Modeling Earth Systems\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://agupubs.onlinelibrary.wiley.com/doi/10.1029/2024MS004547\",\"RegionNum\":2,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"METEOROLOGY & ATMOSPHERIC SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Advances in Modeling Earth Systems","FirstCategoryId":"89","ListUrlMain":"https://agupubs.onlinelibrary.wiley.com/doi/10.1029/2024MS004547","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}
A Framework for Variational Inference and Data Assimilation of Soil Biogeochemical Models Using Normalizing Flows
Soil biogeochemical models (SBMs) represent soil variables and their responses to global change. Data assimilation approaches help determine whether SBMs accurately represent soil processes consistent with soil pool and flux measurements. Bayesian inference is commonly used in data assimilation procedures that estimate posterior parameter distributions with Markov chain Monte Carlo (MCMC) methods. The ability to account for data and parameter uncertainty is a strength of MCMC inference, but the computational inefficiency of MCMC methods remains a barrier to their wider application, especially with large data sets. Given the limitations of MCMC approaches, we developed an alternative variational inference framework that uses a method called normalizing flows from the field of machine learning. Normalizing flows rely on deep learning to map probability distributions and approximate SBMs that have been discretized into state space models. As a test of our method, we fit approximated SBMs to synthetic data sourced from known data-generating processes to identify discrepancies between the inference results and true parameter values. Our approach compares favorably with established MCMC methods and could be a viable alternative for SBM data assimilation that reduces computational time and resource needs. However, our method has some limitations, including challenges assimilating data with irregular measurement intervals, underestimation of posterior parameter uncertainty, and limited goodness-of-fit metrics for comparison to MCMC inference methods. Many of these limitations could be overcome with additional algorithm development based on the approaches we report here.
期刊介绍:
The Journal of Advances in Modeling Earth Systems (JAMES) is committed to advancing the science of Earth systems modeling by offering high-quality scientific research through online availability and open access licensing. JAMES invites authors and readers from the international Earth systems modeling community.
Open access. Articles are available free of charge for everyone with Internet access to view and download.
Formal peer review.
Supplemental material, such as code samples, images, and visualizations, is published at no additional charge.
No additional charge for color figures.
Modest page charges to cover production costs.
Articles published in high-quality full text PDF, HTML, and XML.
Internal and external reference linking, DOI registration, and forward linking via CrossRef.