Modeling Upscaled Mass Discharge From Complex DNAPL Source Zones Using a Bayesian Neural Network: Prediction Accuracy, Uncertainty Quantification and Source Zone Feature Importance
{"title":"Modeling Upscaled Mass Discharge From Complex DNAPL Source Zones Using a Bayesian Neural Network: Prediction Accuracy, Uncertainty Quantification and Source Zone Feature Importance","authors":"Xueyuan Kang, Amalia Kokkinaki, Xiaoqing Shi, Jonghyun Lee, Zhilin Guo, Lingling Ni, Jichun Wu","doi":"10.1029/2023wr036864","DOIUrl":null,"url":null,"abstract":"The mass discharge emanating from dense non-aqueous phase liquid (DNAPL) source zones (SZs) is often used as a key metric for risk assessment. To predict the temporal evolution of mass discharge, upscaled models have been developed to approximate the relationship between the depletion of SZ and the mass discharge. A significant challenge stems from the choice of the SZ parameterization, so that a limited number of domain-averaged SZ metrics can suffice as an input and accurately predict the complex mass-discharge behavior. Moreover, existing deterministic upscaled models cannot quantify prediction uncertainty stemming from modeling parameterization. To address these challenges, we propose a method based on a Bayesian Neural Network (BNN) which learns the non-linear relationship between SZ metrics and mass discharge from multiphase-modeling training data. The proposed BNN-based upscaled model allows uncertainty quantification since it treats trainable parameters as distributions, and does not require a manual parameterization of the SZ a-priori. Instead, the BNN model chooses three physically meaningful SZ quantities related to mass discharge as input features. Then, we use the expected gradients method to identify the feature importance for mass-discharge prediction. We evaluated the proposed model on laboratory-scale DNAPL dissolution experiments. The results show that the BNN model accurately reproduces the multistage mass-discharge profiles with fewer parameters than existing upscaled models. Feature importance analysis shows that all chosen features are important and sufficient to reproduce complex mass discharge. This model provides accurate mass-discharge predictions and uncertainty estimation, therefore holds a great potential for probabilistic risk assessments and decision-making.","PeriodicalId":23799,"journal":{"name":"Water Resources Research","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Water Resources Research","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1029/2023wr036864","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
The mass discharge emanating from dense non-aqueous phase liquid (DNAPL) source zones (SZs) is often used as a key metric for risk assessment. To predict the temporal evolution of mass discharge, upscaled models have been developed to approximate the relationship between the depletion of SZ and the mass discharge. A significant challenge stems from the choice of the SZ parameterization, so that a limited number of domain-averaged SZ metrics can suffice as an input and accurately predict the complex mass-discharge behavior. Moreover, existing deterministic upscaled models cannot quantify prediction uncertainty stemming from modeling parameterization. To address these challenges, we propose a method based on a Bayesian Neural Network (BNN) which learns the non-linear relationship between SZ metrics and mass discharge from multiphase-modeling training data. The proposed BNN-based upscaled model allows uncertainty quantification since it treats trainable parameters as distributions, and does not require a manual parameterization of the SZ a-priori. Instead, the BNN model chooses three physically meaningful SZ quantities related to mass discharge as input features. Then, we use the expected gradients method to identify the feature importance for mass-discharge prediction. We evaluated the proposed model on laboratory-scale DNAPL dissolution experiments. The results show that the BNN model accurately reproduces the multistage mass-discharge profiles with fewer parameters than existing upscaled models. Feature importance analysis shows that all chosen features are important and sufficient to reproduce complex mass discharge. This model provides accurate mass-discharge predictions and uncertainty estimation, therefore holds a great potential for probabilistic risk assessments and decision-making.
期刊介绍:
Water Resources Research (WRR) is an interdisciplinary journal that focuses on hydrology and water resources. It publishes original research in the natural and social sciences of water. It emphasizes the role of water in the Earth system, including physical, chemical, biological, and ecological processes in water resources research and management, including social, policy, and public health implications. It encompasses observational, experimental, theoretical, analytical, numerical, and data-driven approaches that advance the science of water and its management. Submissions are evaluated for their novelty, accuracy, significance, and broader implications of the findings.