Combining machine learning to explore the influence of VOCs and meteorological factors on ozone concentration: A case study of a chemical park in Shenyang, China
IF 4.5 2区 地球科学Q1 METEOROLOGY & ATMOSPHERIC SCIENCES
Qi Jiang , Nan Wang , Zhenling Jin , Xuebin Sun , Jiayin Wang , Ru Li
{"title":"Combining machine learning to explore the influence of VOCs and meteorological factors on ozone concentration: A case study of a chemical park in Shenyang, China","authors":"Qi Jiang , Nan Wang , Zhenling Jin , Xuebin Sun , Jiayin Wang , Ru Li","doi":"10.1016/j.atmosres.2025.108113","DOIUrl":null,"url":null,"abstract":"<div><div>Ozone (<span><math><msub><mi>O</mi><mn>3</mn></msub></math></span>) pollution is a pervasive air quality issue, with volatile organic compounds (VOCs) significantly contributing to <span><math><msub><mi>O</mi><mn>3</mn></msub></math></span> formation. Utilizing continuous online monitoring data from a chemical park in Shenyang in 2019, we analyzed variations in VOCs, conventional pollutants, and meteorological conditions, preliminarily determining their apparent effects on <span><math><msub><mi>O</mi><mn>3</mn></msub></math></span>. Results indicated an “M”-shaped monthly average ozone variation, with peaks in summer and troughs in winter, influenced by photochemical reactions and meteorological factors. The daily mean concentrations of alkanes, alkynes, and aromatic hydrocarbons in VOCs displayed clear periodicity, with peaks occurring at 5:00–7:00 and 19:00–20:00, and troughs at 14:00–15:00. The monthly mean concentrations exhibited seasonal trends, with higher levels in fall and winter, and lower levels in spring and summer, demonstrating a pattern opposite to that of <span><math><msub><mi>O</mi><mn>3</mn></msub></math></span>. Using machine learning techniques, we modeled the relationship between key factors and <span><math><msub><mi>O</mi><mn>3</mn></msub></math></span> concentration. The results revealed that the optimized Extreme Gradient Boosting (XGBoost) model achieved the highest correlation coefficient and demonstrated the best performance. Using the feature importance method, we identified the key factors most strongly associated with <span><math><msub><mi>O</mi><mn>3</mn></msub></math></span> concentration. The optimized model was then employed to examine the variation in <span><math><msub><mi>O</mi><mn>3</mn></msub></math></span> concentration across different temperature and humidity conditions. This study provides essential insights for developing effective pollution control strategies and guiding environmental management decisions.</div></div>","PeriodicalId":8600,"journal":{"name":"Atmospheric Research","volume":"322 ","pages":"Article 108113"},"PeriodicalIF":4.5000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Atmospheric Research","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169809525002054","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Ozone () pollution is a pervasive air quality issue, with volatile organic compounds (VOCs) significantly contributing to formation. Utilizing continuous online monitoring data from a chemical park in Shenyang in 2019, we analyzed variations in VOCs, conventional pollutants, and meteorological conditions, preliminarily determining their apparent effects on . Results indicated an “M”-shaped monthly average ozone variation, with peaks in summer and troughs in winter, influenced by photochemical reactions and meteorological factors. The daily mean concentrations of alkanes, alkynes, and aromatic hydrocarbons in VOCs displayed clear periodicity, with peaks occurring at 5:00–7:00 and 19:00–20:00, and troughs at 14:00–15:00. The monthly mean concentrations exhibited seasonal trends, with higher levels in fall and winter, and lower levels in spring and summer, demonstrating a pattern opposite to that of . Using machine learning techniques, we modeled the relationship between key factors and concentration. The results revealed that the optimized Extreme Gradient Boosting (XGBoost) model achieved the highest correlation coefficient and demonstrated the best performance. Using the feature importance method, we identified the key factors most strongly associated with concentration. The optimized model was then employed to examine the variation in concentration across different temperature and humidity conditions. This study provides essential insights for developing effective pollution control strategies and guiding environmental management decisions.
期刊介绍:
The journal publishes scientific papers (research papers, review articles, letters and notes) dealing with the part of the atmosphere where meteorological events occur. Attention is given to all processes extending from the earth surface to the tropopause, but special emphasis continues to be devoted to the physics of clouds, mesoscale meteorology and air pollution, i.e. atmospheric aerosols; microphysical processes; cloud dynamics and thermodynamics; numerical simulation, climatology, climate change and weather modification.