{"title":"Forecasting ground-level ozone in Shenyang using interpretable machine learning: Interaction between air pollutants and climate factors","authors":"Shenao Fan, Yunfeng Ma","doi":"10.1016/j.apr.2025.102836","DOIUrl":null,"url":null,"abstract":"<div><div>Ground-level ozone (GLO), produced through reactions between <figure><img></figure> and VOCs under sunlight, is a toxic secondary pollutant that adversely impacts ecosystems and human well-being. This study proposes an interpretable, high-resolution forecasting framework based on a weighted ensemble of machine learning algorithms, including LightGBM, XGBoost, and CatBoost. Using hourly observational data from Shenyang, China (2020–2025), the model incorporates meteorological, temporal, and air quality features to capture annual-scale ozone variability. The ensemble outperformed individual learners, achieving strong predictive accuracy (<span><math><mrow><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup><mo>=</mo><mn>0</mn><mo>.</mo><mn>868</mn></mrow></math></span>, MAE = <span><math><mrow><mn>10</mn><mo>.</mo><mn>3</mn><mspace></mspace><mi>μ</mi><mtext>g</mtext><mo>/</mo><msup><mrow><mtext>m</mtext></mrow><mrow><mn>3</mn></mrow></msup></mrow></math></span>, RMSE = <span><math><mrow><mn>14</mn><mo>.</mo><mn>7</mn><mspace></mspace><mi>μ</mi><mtext>g</mtext><mo>/</mo><msup><mrow><mtext>m</mtext></mrow><mrow><mn>3</mn></mrow></msup></mrow></math></span>). To enhance interpretability, SHAP analysis was used to reveal nonlinear interactions among meteorological and chemical drivers. High temperatures and moderate humidity were found to promote ozone formation. Seasonal transport patterns identified by concentration-weighted trajectory (CWT) analysis revealed dominant local photochemistry in summer and stronger long-range influence in winter. The proposed framework offers both predictive accuracy and physical interpretability, supporting early warning and targeted ozone control strategies. Its transferable design enables application to other urban regions with complex atmospheric conditions.</div></div>","PeriodicalId":8604,"journal":{"name":"Atmospheric Pollution Research","volume":"17 4","pages":"Article 102836"},"PeriodicalIF":3.5000,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Atmospheric Pollution Research","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1309104225004386","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/11/29 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Ground-level ozone (GLO), produced through reactions between and VOCs under sunlight, is a toxic secondary pollutant that adversely impacts ecosystems and human well-being. This study proposes an interpretable, high-resolution forecasting framework based on a weighted ensemble of machine learning algorithms, including LightGBM, XGBoost, and CatBoost. Using hourly observational data from Shenyang, China (2020–2025), the model incorporates meteorological, temporal, and air quality features to capture annual-scale ozone variability. The ensemble outperformed individual learners, achieving strong predictive accuracy (, MAE = , RMSE = ). To enhance interpretability, SHAP analysis was used to reveal nonlinear interactions among meteorological and chemical drivers. High temperatures and moderate humidity were found to promote ozone formation. Seasonal transport patterns identified by concentration-weighted trajectory (CWT) analysis revealed dominant local photochemistry in summer and stronger long-range influence in winter. The proposed framework offers both predictive accuracy and physical interpretability, supporting early warning and targeted ozone control strategies. Its transferable design enables application to other urban regions with complex atmospheric conditions.
期刊介绍:
Atmospheric Pollution Research (APR) is an international journal designed for the publication of articles on air pollution. Papers should present novel experimental results, theory and modeling of air pollution on local, regional, or global scales. Areas covered are research on inorganic, organic, and persistent organic air pollutants, air quality monitoring, air quality management, atmospheric dispersion and transport, air-surface (soil, water, and vegetation) exchange of pollutants, dry and wet deposition, indoor air quality, exposure assessment, health effects, satellite measurements, natural emissions, atmospheric chemistry, greenhouse gases, and effects on climate change.