{"title":"Explainable artificial intelligence-driven model for ultrafine particle (PM0.1) prediction and explanation using meteorological variables","authors":"Apaporn Tipsavak , Thanyabun Phutson , Thanathip Limna , Racha Dejchanchaiwong , Perapong Tekasakul , Kirttayoth Yeranee , Mallika Kliangkhlao , Bukhoree Sahoh","doi":"10.1016/j.envc.2025.101248","DOIUrl":null,"url":null,"abstract":"<div><div>PM<sub>0.1</sub>, an ultrafine urban air pollutant, poses significant health risks due to its ability to penetrate deep into the lungs, enter the bloodstream, and rapidly circulate throughout the human body, potentially causing severe respiratory diseases. Effective monitoring and explanation of PM<sub>0.1</sub> concentrations are essential for implementing targeted air quality management strategies. This research addresses these challenges through a novel soft computing framework that integrates multiple computational intelligence techniques for both accurate prediction and interpretation of PM<sub>0.1</sub> concentrations using meteorological variables. Our methodology employs boosting algorithms with hyperparameter optimization to develop the most effective predictive model. Validated using real-world urban air quality data from Southeast Asia, our framework demonstrates broad applicability for diverse urban environments. Performance evaluation using unseen data demonstrates that XGBoost with Bayesian optimization achieves superior results, with an <em>R<sup>2</sup></em> of 89 %, <em>Mean absolute error</em> (<em>MAE</em>) of 0.28 µg/m<sup>3</sup>, and <em>Root Mean Squared Error</em> (<em>RMSE)</em> of 0.38 µg/m<sup>3</sup>. By integrating SHAP (SHapley Additive exPlanations) with the XGBoost model, we provide global and individual explanations of the model's predictions, highlighting the relative importance of meteorological variables such as temperature, humidity, and wind speed. This hybrid soft computing approach enhances the model's practical utility for environmental decision-making. It sets a foundation for future intelligent environmental monitoring applications across diverse regions, offering a scalable solution for targeted urban air quality management that balances prediction accuracy with interpretability.</div></div>","PeriodicalId":34794,"journal":{"name":"Environmental Challenges","volume":"20 ","pages":"Article 101248"},"PeriodicalIF":0.0000,"publicationDate":"2025-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Challenges","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667010025001672","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Environmental Science","Score":null,"Total":0}
引用次数: 0
Abstract
PM0.1, an ultrafine urban air pollutant, poses significant health risks due to its ability to penetrate deep into the lungs, enter the bloodstream, and rapidly circulate throughout the human body, potentially causing severe respiratory diseases. Effective monitoring and explanation of PM0.1 concentrations are essential for implementing targeted air quality management strategies. This research addresses these challenges through a novel soft computing framework that integrates multiple computational intelligence techniques for both accurate prediction and interpretation of PM0.1 concentrations using meteorological variables. Our methodology employs boosting algorithms with hyperparameter optimization to develop the most effective predictive model. Validated using real-world urban air quality data from Southeast Asia, our framework demonstrates broad applicability for diverse urban environments. Performance evaluation using unseen data demonstrates that XGBoost with Bayesian optimization achieves superior results, with an R2 of 89 %, Mean absolute error (MAE) of 0.28 µg/m3, and Root Mean Squared Error (RMSE) of 0.38 µg/m3. By integrating SHAP (SHapley Additive exPlanations) with the XGBoost model, we provide global and individual explanations of the model's predictions, highlighting the relative importance of meteorological variables such as temperature, humidity, and wind speed. This hybrid soft computing approach enhances the model's practical utility for environmental decision-making. It sets a foundation for future intelligent environmental monitoring applications across diverse regions, offering a scalable solution for targeted urban air quality management that balances prediction accuracy with interpretability.