Elissar Al Aawar, Sofien Resifi, Hatem Jebari, Ibrahim Hoteit
{"title":"Bayesian Source Identification With Dual Hierarchical Neural Networks for Urban Air Pollution","authors":"Elissar Al Aawar, Sofien Resifi, Hatem Jebari, Ibrahim Hoteit","doi":"10.1029/2024MS004790","DOIUrl":null,"url":null,"abstract":"<p>Identifying urban air pollution sources is essential for public health and environmental sustainability. In this study, we propose a novel hierarchical method for urban air pollution source identification, leveraging deep learning (DL) within an efficient Bayesian inference framework. We rely on observations in the form of two-dimensional (2D) pollutant concentration distributions, and adopt the Wasserstein <span></span><math>\n <semantics>\n <mrow>\n <mfenced>\n <msub>\n <mi>W</mi>\n <mn>2</mn>\n </msub>\n </mfenced>\n </mrow>\n <annotation> $\\left({W}_{2}\\right)$</annotation>\n </semantics></math> distance to model the likelihood probability distribution. The hierarchical nature of the framework stems from the integration of two neural networks (NNs). The first one acts as an emulator that replicates the physical dispersion model to predict future pollution observations recursively over a defined timeframe. These predictions are then used as inputs for the second NN that approximates the <span></span><math>\n <semantics>\n <mrow>\n <msub>\n <mi>W</mi>\n <mn>2</mn>\n </msub>\n </mrow>\n <annotation> ${W}_{2}$</annotation>\n </semantics></math> distance between predicted and observed pollutant concentration distributions to rapidly compute the likelihood probability. The approach adopts a multi-model strategy to mitigate the accumulation of errors, particularly those arising from the recursive prediction steps across multiple time intervals, ensuring the reliability of predictions over extended periods. The proposed framework is implemented on graphics processing units (GPUs), enabling scalable computations for real-world applications and rapid decision making. Through extensive numerical experiments, we demonstrate the suggested method's effectiveness in accurately estimating pollution source parameters, including location, emission rate, and duration, using synthetic observational data. Sensitivity analyses further explore the impact of observational horizons and sampling on solution convergence and accuracy. Numerical results demonstrate robust performances and computational efficiency compared to the conventional approach, particularly in scenarios with limited computational resources and observations availability.</p>","PeriodicalId":14881,"journal":{"name":"Journal of Advances in Modeling Earth Systems","volume":"17 4","pages":""},"PeriodicalIF":4.4000,"publicationDate":"2025-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1029/2024MS004790","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Advances in Modeling Earth Systems","FirstCategoryId":"89","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1029/2024MS004790","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Identifying urban air pollution sources is essential for public health and environmental sustainability. In this study, we propose a novel hierarchical method for urban air pollution source identification, leveraging deep learning (DL) within an efficient Bayesian inference framework. We rely on observations in the form of two-dimensional (2D) pollutant concentration distributions, and adopt the Wasserstein distance to model the likelihood probability distribution. The hierarchical nature of the framework stems from the integration of two neural networks (NNs). The first one acts as an emulator that replicates the physical dispersion model to predict future pollution observations recursively over a defined timeframe. These predictions are then used as inputs for the second NN that approximates the distance between predicted and observed pollutant concentration distributions to rapidly compute the likelihood probability. The approach adopts a multi-model strategy to mitigate the accumulation of errors, particularly those arising from the recursive prediction steps across multiple time intervals, ensuring the reliability of predictions over extended periods. The proposed framework is implemented on graphics processing units (GPUs), enabling scalable computations for real-world applications and rapid decision making. Through extensive numerical experiments, we demonstrate the suggested method's effectiveness in accurately estimating pollution source parameters, including location, emission rate, and duration, using synthetic observational data. Sensitivity analyses further explore the impact of observational horizons and sampling on solution convergence and accuracy. Numerical results demonstrate robust performances and computational efficiency compared to the conventional approach, particularly in scenarios with limited computational resources and observations availability.
期刊介绍:
The Journal of Advances in Modeling Earth Systems (JAMES) is committed to advancing the science of Earth systems modeling by offering high-quality scientific research through online availability and open access licensing. JAMES invites authors and readers from the international Earth systems modeling community.
Open access. Articles are available free of charge for everyone with Internet access to view and download.
Formal peer review.
Supplemental material, such as code samples, images, and visualizations, is published at no additional charge.
No additional charge for color figures.
Modest page charges to cover production costs.
Articles published in high-quality full text PDF, HTML, and XML.
Internal and external reference linking, DOI registration, and forward linking via CrossRef.