Modeling Upscaled Mass Discharge From Complex DNAPL Source Zones Using a Bayesian Neural Network: Prediction Accuracy, Uncertainty Quantification and Source Zone Feature Importance

IF 4.6 1区 地球科学 Q2 ENVIRONMENTAL SCIENCES
Xueyuan Kang, Amalia Kokkinaki, Xiaoqing Shi, Jonghyun Lee, Zhilin Guo, Lingling Ni, Jichun Wu
{"title":"Modeling Upscaled Mass Discharge From Complex DNAPL Source Zones Using a Bayesian Neural Network: Prediction Accuracy, Uncertainty Quantification and Source Zone Feature Importance","authors":"Xueyuan Kang, Amalia Kokkinaki, Xiaoqing Shi, Jonghyun Lee, Zhilin Guo, Lingling Ni, Jichun Wu","doi":"10.1029/2023wr036864","DOIUrl":null,"url":null,"abstract":"The mass discharge emanating from dense non-aqueous phase liquid (DNAPL) source zones (SZs) is often used as a key metric for risk assessment. To predict the temporal evolution of mass discharge, upscaled models have been developed to approximate the relationship between the depletion of SZ and the mass discharge. A significant challenge stems from the choice of the SZ parameterization, so that a limited number of domain-averaged SZ metrics can suffice as an input and accurately predict the complex mass-discharge behavior. Moreover, existing deterministic upscaled models cannot quantify prediction uncertainty stemming from modeling parameterization. To address these challenges, we propose a method based on a Bayesian Neural Network (BNN) which learns the non-linear relationship between SZ metrics and mass discharge from multiphase-modeling training data. The proposed BNN-based upscaled model allows uncertainty quantification since it treats trainable parameters as distributions, and does not require a manual parameterization of the SZ a-priori. Instead, the BNN model chooses three physically meaningful SZ quantities related to mass discharge as input features. Then, we use the expected gradients method to identify the feature importance for mass-discharge prediction. We evaluated the proposed model on laboratory-scale DNAPL dissolution experiments. The results show that the BNN model accurately reproduces the multistage mass-discharge profiles with fewer parameters than existing upscaled models. Feature importance analysis shows that all chosen features are important and sufficient to reproduce complex mass discharge. This model provides accurate mass-discharge predictions and uncertainty estimation, therefore holds a great potential for probabilistic risk assessments and decision-making.","PeriodicalId":23799,"journal":{"name":"Water Resources Research","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Water Resources Research","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1029/2023wr036864","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

The mass discharge emanating from dense non-aqueous phase liquid (DNAPL) source zones (SZs) is often used as a key metric for risk assessment. To predict the temporal evolution of mass discharge, upscaled models have been developed to approximate the relationship between the depletion of SZ and the mass discharge. A significant challenge stems from the choice of the SZ parameterization, so that a limited number of domain-averaged SZ metrics can suffice as an input and accurately predict the complex mass-discharge behavior. Moreover, existing deterministic upscaled models cannot quantify prediction uncertainty stemming from modeling parameterization. To address these challenges, we propose a method based on a Bayesian Neural Network (BNN) which learns the non-linear relationship between SZ metrics and mass discharge from multiphase-modeling training data. The proposed BNN-based upscaled model allows uncertainty quantification since it treats trainable parameters as distributions, and does not require a manual parameterization of the SZ a-priori. Instead, the BNN model chooses three physically meaningful SZ quantities related to mass discharge as input features. Then, we use the expected gradients method to identify the feature importance for mass-discharge prediction. We evaluated the proposed model on laboratory-scale DNAPL dissolution experiments. The results show that the BNN model accurately reproduces the multistage mass-discharge profiles with fewer parameters than existing upscaled models. Feature importance analysis shows that all chosen features are important and sufficient to reproduce complex mass discharge. This model provides accurate mass-discharge predictions and uncertainty estimation, therefore holds a great potential for probabilistic risk assessments and decision-making.
使用贝叶斯神经网络模拟复杂 DNAPL 源区的上标质量排放:预测精度、不确定性量化和源区特征重要性
高浓度非水相液体(DNAPL)源区(SZ)的大量排放通常被用作风险评估的关键指标。为了预测大规模排放的时间演变,人们开发了放大模型来近似计算 SZ 的耗竭与大规模排放之间的关系。SZ 参数化的选择是一个重大挑战,只有数量有限的域平均 SZ 指标才足以作为输入并准确预测复杂的质量排放行为。此外,现有的确定性放大模型无法量化建模参数化带来的预测不确定性。为应对这些挑战,我们提出了一种基于贝叶斯神经网络(BNN)的方法,该方法可从多阶段建模训练数据中学习 SZ 指标与质量放电之间的非线性关系。所提出的基于贝叶斯神经网络的放大模型可以量化不确定性,因为它将可训练的参数视为分布,不需要事先对 SZ 进行手动参数化。取而代之的是,BNN 模型选择与质量排放相关的三个具有物理意义的 SZ 量作为输入特征。然后,我们使用期望梯度法来确定特征对质量排放预测的重要性。我们在实验室规模的 DNAPL 溶解实验中对所提出的模型进行了评估。结果表明,与现有的放大模型相比,BNN 模型以更少的参数准确地再现了多级质量排放曲线。特征重要性分析表明,所选的所有特征都很重要,足以再现复杂的质量排放。该模型提供了精确的质量排放预测和不确定性估计,因此在概率风险评估和决策方面具有巨大潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Water Resources Research
Water Resources Research 环境科学-湖沼学
CiteScore
8.80
自引率
13.00%
发文量
599
审稿时长
3.5 months
期刊介绍: Water Resources Research (WRR) is an interdisciplinary journal that focuses on hydrology and water resources. It publishes original research in the natural and social sciences of water. It emphasizes the role of water in the Earth system, including physical, chemical, biological, and ecological processes in water resources research and management, including social, policy, and public health implications. It encompasses observational, experimental, theoretical, analytical, numerical, and data-driven approaches that advance the science of water and its management. Submissions are evaluated for their novelty, accuracy, significance, and broader implications of the findings.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信