Eliott Lumet , Mélanie C. Rochoux , Thomas Jaravel , Simon Lacroix
{"title":"Uncertainty-aware surrogate modeling for urban air pollutant dispersion prediction","authors":"Eliott Lumet , Mélanie C. Rochoux , Thomas Jaravel , Simon Lacroix","doi":"10.1016/j.buildenv.2024.112287","DOIUrl":null,"url":null,"abstract":"<div><div>This study evaluates a surrogate modeling approach that provides rapid ensemble predictions of air pollutant dispersion in urban environments for varying meteorological forcing, while estimating irreducible and modeling uncertainties. The POD–GPR approach combining Proper Orthogonal Decomposition (POD) and Gaussian Process Regression (GPR) is applied to emulate the response surface of a Large-Eddy Simulation (LES) model of the Mock Urban Setting Test (MUST) field-scale experiment. We design and validate new methods for (i) selecting the POD-latent space dimension to avoid overfitting noisy structures due to atmospheric internal variability, and (ii) estimating the uncertainty in POD–GPR predictions. To train and validate the POD–GPR surrogate in an offline phase, we build a large dataset of 200 LES 3-D time-averaged concentration fields, which are subject to substantial spatial variability from near-source to background concentration and have a very large dimension of several million grid cells. The results show that POD–GPR reaches the best achievable accuracy levels, except for the highest concentration near the source, while predicting full fields at a computational cost five orders of magnitude lower than an LES. The results also show that the proposed mode selection criterion avoids perturbing the surrogate response surface, and that the uncertainty estimate explains a large part of the surrogate error and is spatially consistent with the observed internal variability. Finally, POD–GPR can be robustly trained with much smaller datasets, paving the way for application to realistic urban configurations.</div></div>","PeriodicalId":9273,"journal":{"name":"Building and Environment","volume":"267 ","pages":"Article 112287"},"PeriodicalIF":7.1000,"publicationDate":"2024-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Building and Environment","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0360132324011296","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
This study evaluates a surrogate modeling approach that provides rapid ensemble predictions of air pollutant dispersion in urban environments for varying meteorological forcing, while estimating irreducible and modeling uncertainties. The POD–GPR approach combining Proper Orthogonal Decomposition (POD) and Gaussian Process Regression (GPR) is applied to emulate the response surface of a Large-Eddy Simulation (LES) model of the Mock Urban Setting Test (MUST) field-scale experiment. We design and validate new methods for (i) selecting the POD-latent space dimension to avoid overfitting noisy structures due to atmospheric internal variability, and (ii) estimating the uncertainty in POD–GPR predictions. To train and validate the POD–GPR surrogate in an offline phase, we build a large dataset of 200 LES 3-D time-averaged concentration fields, which are subject to substantial spatial variability from near-source to background concentration and have a very large dimension of several million grid cells. The results show that POD–GPR reaches the best achievable accuracy levels, except for the highest concentration near the source, while predicting full fields at a computational cost five orders of magnitude lower than an LES. The results also show that the proposed mode selection criterion avoids perturbing the surrogate response surface, and that the uncertainty estimate explains a large part of the surrogate error and is spatially consistent with the observed internal variability. Finally, POD–GPR can be robustly trained with much smaller datasets, paving the way for application to realistic urban configurations.
期刊介绍:
Building and Environment, an international journal, is dedicated to publishing original research papers, comprehensive review articles, editorials, and short communications in the fields of building science, urban physics, and human interaction with the indoor and outdoor built environment. The journal emphasizes innovative technologies and knowledge verified through measurement and analysis. It covers environmental performance across various spatial scales, from cities and communities to buildings and systems, fostering collaborative, multi-disciplinary research with broader significance.