{"title":"Clustering of emission–dispersion dynamics in a state space defined by pollution sources and meteorological variables","authors":"Yuval , Yoav Levi , Pavel Khain , David M. Broday","doi":"10.1016/j.scitotenv.2025.180051","DOIUrl":null,"url":null,"abstract":"<div><div>The complex relationship between air pollution sources, the meteorological conditions governing their dispersion, and the resulting pollutant concentrations is an interesting scientific topic with significant implications for air resource management. A widely adopted approach to exploring these interactions is the interpretation of statistical models which simulate them. Dispersion conditions can vary markedly—from cold, calm nights with stable atmosphere to stormy periods characterised by intense turbulence and convection. Deeper insights can be obtained by interpreting statistical models trained on data subsets that correspond to specific emission–dispersion conditions. To achieve such distinct subsets, we present a robust and reproducible methodology for partitioning the multidimensional state space, defined by pollution sources and meteorological variables, into a large number of clusters at each monitoring location. Our methodology ensures a systematic and objective analysis across multiple stations. We analyse four years of hourly data, integrating high-resolution meteorological outputs from a numerical weather prediction model with traffic volume data used as proxies for emissions or precursors of NO, NO<sub>2</sub>, NO<sub>x</sub>, PM<sub>2.5</sub>, and O<sub>3</sub> concentrations. These pollutants were observed at 85 air quality monitoring stations across Israel. The resulting clusters capture sub-daily temporal patterns that are indicative of distinct emission–dispersion scenarios in the region. We demonstrate that statistical models trained on these clustered subsets consistently outperform models trained on the full-period datasets. This highlights the value of our clustering approach in improving both predictive performance and scientific understanding of air pollution dispersion dynamics.</div></div>","PeriodicalId":422,"journal":{"name":"Science of the Total Environment","volume":"995 ","pages":"Article 180051"},"PeriodicalIF":8.2000,"publicationDate":"2025-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Science of the Total Environment","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0048969725016912","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
The complex relationship between air pollution sources, the meteorological conditions governing their dispersion, and the resulting pollutant concentrations is an interesting scientific topic with significant implications for air resource management. A widely adopted approach to exploring these interactions is the interpretation of statistical models which simulate them. Dispersion conditions can vary markedly—from cold, calm nights with stable atmosphere to stormy periods characterised by intense turbulence and convection. Deeper insights can be obtained by interpreting statistical models trained on data subsets that correspond to specific emission–dispersion conditions. To achieve such distinct subsets, we present a robust and reproducible methodology for partitioning the multidimensional state space, defined by pollution sources and meteorological variables, into a large number of clusters at each monitoring location. Our methodology ensures a systematic and objective analysis across multiple stations. We analyse four years of hourly data, integrating high-resolution meteorological outputs from a numerical weather prediction model with traffic volume data used as proxies for emissions or precursors of NO, NO2, NOx, PM2.5, and O3 concentrations. These pollutants were observed at 85 air quality monitoring stations across Israel. The resulting clusters capture sub-daily temporal patterns that are indicative of distinct emission–dispersion scenarios in the region. We demonstrate that statistical models trained on these clustered subsets consistently outperform models trained on the full-period datasets. This highlights the value of our clustering approach in improving both predictive performance and scientific understanding of air pollution dispersion dynamics.
期刊介绍:
The Science of the Total Environment is an international journal dedicated to scientific research on the environment and its interaction with humanity. It covers a wide range of disciplines and seeks to publish innovative, hypothesis-driven, and impactful research that explores the entire environment, including the atmosphere, lithosphere, hydrosphere, biosphere, and anthroposphere.
The journal's updated Aims & Scope emphasizes the importance of interdisciplinary environmental research with broad impact. Priority is given to studies that advance fundamental understanding and explore the interconnectedness of multiple environmental spheres. Field studies are preferred, while laboratory experiments must demonstrate significant methodological advancements or mechanistic insights with direct relevance to the environment.