Balendra V. S. Chauhan, Maureen J. Berg, Kirsty L. Smallbone, Indra Rautela, Suhas Ballal, Kevin P. Wyche
{"title":"Machine Learning Driven Prediction and Analysis of NO2 and its Catalyst Based Reduction in Urban Environments","authors":"Balendra V. S. Chauhan, Maureen J. Berg, Kirsty L. Smallbone, Indra Rautela, Suhas Ballal, Kevin P. Wyche","doi":"10.1007/s11244-025-02161-5","DOIUrl":null,"url":null,"abstract":"<div><p>This study employed machine learning (ML) to predict nitrogen dioxide (NO₂) pollution in Marylebone Road, London a high-traffic urban corridor using historical data from 2015 to 2022 to forecast concentrations for the period January 2023 to January 2025. Four ML models were developed and evaluated: Linear Regression, Random Forest, LightGBM, and an Ensemble Stacking model. These models incorporated meteorological and pollutant data and were assessed using Mean Absolute Error (MAE), Root Mean Square Error (RMSE), and R-squared (R²). The Ensemble Stacking model outperformed the others, achieving an R² of 0.9723, MAE of 3.91 µg/m³, and RMSE of 6.25 µg/m³. In comparison, the Linear Regression model showed the lowest performance (R² = 0.8307, MAE = 11.55, RMSE = 15.45), while Random Forest (R² = 0.9232) and LightGBM (R² = 0.9719) demonstrated intermediate accuracy. The best-performing ensemble model was further used to simulate NO₂ trends with and without titanium dioxide (TiO₂) catalyst intervention, assuming a 28% NO₂ reduction. Temporal analysis revealed that NO, NO₂, and NOₓ concentrations peaked during colder months (November–January) and weekdays. Correlation analysis showed a weak negative relationship between NO₂ and ozone (O₃) (R² = 0.26), moderate positive correlations with black carbon (BC) (R² = 0.597) and sulfur dioxide (SO₂) (R² = 0.654), and a very weak positive correlation with particulate matter (PM2.5) (R² = 0.143). The study concludes that ensemble stacked ML models are effective for predicting NO₂ concentrations and that TiO₂ nanocatalyst interventions hold promise for reducing NO₂, BC, and SO₂ levels in urban environments.</p></div>","PeriodicalId":801,"journal":{"name":"Topics in Catalysis","volume":"68 18-19","pages":"2089 - 2108"},"PeriodicalIF":3.0000,"publicationDate":"2025-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s11244-025-02161-5.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Topics in Catalysis","FirstCategoryId":"92","ListUrlMain":"https://link.springer.com/article/10.1007/s11244-025-02161-5","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}
引用次数: 0
Abstract
This study employed machine learning (ML) to predict nitrogen dioxide (NO₂) pollution in Marylebone Road, London a high-traffic urban corridor using historical data from 2015 to 2022 to forecast concentrations for the period January 2023 to January 2025. Four ML models were developed and evaluated: Linear Regression, Random Forest, LightGBM, and an Ensemble Stacking model. These models incorporated meteorological and pollutant data and were assessed using Mean Absolute Error (MAE), Root Mean Square Error (RMSE), and R-squared (R²). The Ensemble Stacking model outperformed the others, achieving an R² of 0.9723, MAE of 3.91 µg/m³, and RMSE of 6.25 µg/m³. In comparison, the Linear Regression model showed the lowest performance (R² = 0.8307, MAE = 11.55, RMSE = 15.45), while Random Forest (R² = 0.9232) and LightGBM (R² = 0.9719) demonstrated intermediate accuracy. The best-performing ensemble model was further used to simulate NO₂ trends with and without titanium dioxide (TiO₂) catalyst intervention, assuming a 28% NO₂ reduction. Temporal analysis revealed that NO, NO₂, and NOₓ concentrations peaked during colder months (November–January) and weekdays. Correlation analysis showed a weak negative relationship between NO₂ and ozone (O₃) (R² = 0.26), moderate positive correlations with black carbon (BC) (R² = 0.597) and sulfur dioxide (SO₂) (R² = 0.654), and a very weak positive correlation with particulate matter (PM2.5) (R² = 0.143). The study concludes that ensemble stacked ML models are effective for predicting NO₂ concentrations and that TiO₂ nanocatalyst interventions hold promise for reducing NO₂, BC, and SO₂ levels in urban environments.
期刊介绍:
Topics in Catalysis publishes topical collections in all fields of catalysis which are composed only of invited articles from leading authors. The journal documents today’s emerging and critical trends in all branches of catalysis. Each themed issue is organized by renowned Guest Editors in collaboration with the Editors-in-Chief. Proposals for new topics are welcome and should be submitted directly to the Editors-in-Chief.
The publication of individual uninvited original research articles can be sent to our sister journal Catalysis Letters. This journal aims for rapid publication of high-impact original research articles in all fields of both applied and theoretical catalysis, including heterogeneous, homogeneous and biocatalysis.