Ravi Patel, Aditya Kumar, Jainath Yadav, Mrityunjay Singh
{"title":"用于比哈尔邦PM2.5水平时间序列预测的堆叠深度学习集成","authors":"Ravi Patel, Aditya Kumar, Jainath Yadav, Mrityunjay Singh","doi":"10.1016/j.uclim.2025.102521","DOIUrl":null,"url":null,"abstract":"<div><div>A major contributor to the deterioration of air quality is Particulate matter (PM<sub>2.5</sub>) with a diameter of less than 2.5 micrometers, which makes air pollution among the most pressing environmental concerns in the world. Bihar, one of India’s most densely populated states, has experienced deteriorating air quality over the past decade, particularly in major urban centers like Patna, Gaya, and Muzaffarpur. Given the severe health and environmental implications of high PM<sub>2.5</sub> levels, accurate forecasting models are essential for proactive pollution control measures. This study explores the application of various time series forecasting models for predicting PM<sub>2.5</sub> concentrations, focusing on deep learning ensemble methods. The framework utilizes five base deep learning models: Long Short-Term Memory (LSTM), Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Gated Recurrent Unit (GRU), and Bidirectional LSTM (Bi-LSTM), each trained individually to capture different aspects of temporal dependencies in the data. To enhance predictive accuracy, the predictions from these base models are combined using a stacking-based ensemble approach with XGBoost as the meta-learner. This ensemble method refines the final prediction by leveraging the strengths of each model and mitigating their individual weaknesses. The proposed model exhibits outstanding effectiveness in PM<sub>2.5</sub> estimation, attaining an MSE of 33.72, MAE of 2.56, RMSE of 5.80, and an R<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span> of 0.99 for Patna. Similarly, it attains an MSE of 8.90, MAE of 2.12, RMSE of 2.98, and an R<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span> of 0.99 for Gaya, while for Muzaffarpur, the model records an MSE of 11.37, MAE of 2.44, RMSE of 3.37, and an R<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span> of 0.99. These outcomes demonstrate how accurate and dependable the model is at predicting air quality in various places. The findings aim to assist policymakers and environmental agencies in making informed decisions to reduce air pollution and safeguard the general public’s health.</div></div>","PeriodicalId":48626,"journal":{"name":"Urban Climate","volume":"62 ","pages":"Article 102521"},"PeriodicalIF":6.9000,"publicationDate":"2025-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Stacked deep learning ensemble for time series prediction of PM2.5 levels in Bihar\",\"authors\":\"Ravi Patel, Aditya Kumar, Jainath Yadav, Mrityunjay Singh\",\"doi\":\"10.1016/j.uclim.2025.102521\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>A major contributor to the deterioration of air quality is Particulate matter (PM<sub>2.5</sub>) with a diameter of less than 2.5 micrometers, which makes air pollution among the most pressing environmental concerns in the world. Bihar, one of India’s most densely populated states, has experienced deteriorating air quality over the past decade, particularly in major urban centers like Patna, Gaya, and Muzaffarpur. Given the severe health and environmental implications of high PM<sub>2.5</sub> levels, accurate forecasting models are essential for proactive pollution control measures. This study explores the application of various time series forecasting models for predicting PM<sub>2.5</sub> concentrations, focusing on deep learning ensemble methods. The framework utilizes five base deep learning models: Long Short-Term Memory (LSTM), Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Gated Recurrent Unit (GRU), and Bidirectional LSTM (Bi-LSTM), each trained individually to capture different aspects of temporal dependencies in the data. To enhance predictive accuracy, the predictions from these base models are combined using a stacking-based ensemble approach with XGBoost as the meta-learner. This ensemble method refines the final prediction by leveraging the strengths of each model and mitigating their individual weaknesses. The proposed model exhibits outstanding effectiveness in PM<sub>2.5</sub> estimation, attaining an MSE of 33.72, MAE of 2.56, RMSE of 5.80, and an R<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span> of 0.99 for Patna. Similarly, it attains an MSE of 8.90, MAE of 2.12, RMSE of 2.98, and an R<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span> of 0.99 for Gaya, while for Muzaffarpur, the model records an MSE of 11.37, MAE of 2.44, RMSE of 3.37, and an R<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span> of 0.99. These outcomes demonstrate how accurate and dependable the model is at predicting air quality in various places. The findings aim to assist policymakers and environmental agencies in making informed decisions to reduce air pollution and safeguard the general public’s health.</div></div>\",\"PeriodicalId\":48626,\"journal\":{\"name\":\"Urban Climate\",\"volume\":\"62 \",\"pages\":\"Article 102521\"},\"PeriodicalIF\":6.9000,\"publicationDate\":\"2025-07-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Urban Climate\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2212095525002378\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Urban Climate","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2212095525002378","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Stacked deep learning ensemble for time series prediction of PM2.5 levels in Bihar
A major contributor to the deterioration of air quality is Particulate matter (PM2.5) with a diameter of less than 2.5 micrometers, which makes air pollution among the most pressing environmental concerns in the world. Bihar, one of India’s most densely populated states, has experienced deteriorating air quality over the past decade, particularly in major urban centers like Patna, Gaya, and Muzaffarpur. Given the severe health and environmental implications of high PM2.5 levels, accurate forecasting models are essential for proactive pollution control measures. This study explores the application of various time series forecasting models for predicting PM2.5 concentrations, focusing on deep learning ensemble methods. The framework utilizes five base deep learning models: Long Short-Term Memory (LSTM), Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Gated Recurrent Unit (GRU), and Bidirectional LSTM (Bi-LSTM), each trained individually to capture different aspects of temporal dependencies in the data. To enhance predictive accuracy, the predictions from these base models are combined using a stacking-based ensemble approach with XGBoost as the meta-learner. This ensemble method refines the final prediction by leveraging the strengths of each model and mitigating their individual weaknesses. The proposed model exhibits outstanding effectiveness in PM2.5 estimation, attaining an MSE of 33.72, MAE of 2.56, RMSE of 5.80, and an R of 0.99 for Patna. Similarly, it attains an MSE of 8.90, MAE of 2.12, RMSE of 2.98, and an R of 0.99 for Gaya, while for Muzaffarpur, the model records an MSE of 11.37, MAE of 2.44, RMSE of 3.37, and an R of 0.99. These outcomes demonstrate how accurate and dependable the model is at predicting air quality in various places. The findings aim to assist policymakers and environmental agencies in making informed decisions to reduce air pollution and safeguard the general public’s health.
期刊介绍:
Urban Climate serves the scientific and decision making communities with the publication of research on theory, science and applications relevant to understanding urban climatic conditions and change in relation to their geography and to demographic, socioeconomic, institutional, technological and environmental dynamics and global change. Targeted towards both disciplinary and interdisciplinary audiences, this journal publishes original research papers, comprehensive review articles, book reviews, and short communications on topics including, but not limited to, the following:
Urban meteorology and climate[...]
Urban environmental pollution[...]
Adaptation to global change[...]
Urban economic and social issues[...]
Research Approaches[...]