用于比哈尔邦PM2.5水平时间序列预测的堆叠深度学习集成

IF 6.9 2区工程技术 Q1 ENVIRONMENTAL SCIENCES

Urban Climate Pub Date : 2025-07-25 DOI:10.1016/j.uclim.2025.102521

Ravi Patel, Aditya Kumar, Jainath Yadav, Mrityunjay Singh

{"title":"用于比哈尔邦PM2.5水平时间序列预测的堆叠深度学习集成","authors":"Ravi Patel, Aditya Kumar, Jainath Yadav, Mrityunjay Singh","doi":"10.1016/j.uclim.2025.102521","DOIUrl":null,"url":null,"abstract":"<div><div>A major contributor to the deterioration of air quality is Particulate matter (PM<sub>2.5</sub>) with a diameter of less than 2.5 micrometers, which makes air pollution among the most pressing environmental concerns in the world. Bihar, one of India’s most densely populated states, has experienced deteriorating air quality over the past decade, particularly in major urban centers like Patna, Gaya, and Muzaffarpur. Given the severe health and environmental implications of high PM<sub>2.5</sub> levels, accurate forecasting models are essential for proactive pollution control measures. This study explores the application of various time series forecasting models for predicting PM<sub>2.5</sub> concentrations, focusing on deep learning ensemble methods. The framework utilizes five base deep learning models: Long Short-Term Memory (LSTM), Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Gated Recurrent Unit (GRU), and Bidirectional LSTM (Bi-LSTM), each trained individually to capture different aspects of temporal dependencies in the data. To enhance predictive accuracy, the predictions from these base models are combined using a stacking-based ensemble approach with XGBoost as the meta-learner. This ensemble method refines the final prediction by leveraging the strengths of each model and mitigating their individual weaknesses. The proposed model exhibits outstanding effectiveness in PM<sub>2.5</sub> estimation, attaining an MSE of 33.72, MAE of 2.56, RMSE of 5.80, and an R<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span> of 0.99 for Patna. Similarly, it attains an MSE of 8.90, MAE of 2.12, RMSE of 2.98, and an R<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span> of 0.99 for Gaya, while for Muzaffarpur, the model records an MSE of 11.37, MAE of 2.44, RMSE of 3.37, and an R<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span> of 0.99. These outcomes demonstrate how accurate and dependable the model is at predicting air quality in various places. The findings aim to assist policymakers and environmental agencies in making informed decisions to reduce air pollution and safeguard the general public’s health.</div></div>","PeriodicalId":48626,"journal":{"name":"Urban Climate","volume":"62 ","pages":"Article 102521"},"PeriodicalIF":6.9000,"publicationDate":"2025-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Stacked deep learning ensemble for time series prediction of PM2.5 levels in Bihar\",\"authors\":\"Ravi Patel, Aditya Kumar, Jainath Yadav, Mrityunjay Singh\",\"doi\":\"10.1016/j.uclim.2025.102521\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>A major contributor to the deterioration of air quality is Particulate matter (PM<sub>2.5</sub>) with a diameter of less than 2.5 micrometers, which makes air pollution among the most pressing environmental concerns in the world. Bihar, one of India’s most densely populated states, has experienced deteriorating air quality over the past decade, particularly in major urban centers like Patna, Gaya, and Muzaffarpur. Given the severe health and environmental implications of high PM<sub>2.5</sub> levels, accurate forecasting models are essential for proactive pollution control measures. This study explores the application of various time series forecasting models for predicting PM<sub>2.5</sub> concentrations, focusing on deep learning ensemble methods. The framework utilizes five base deep learning models: Long Short-Term Memory (LSTM), Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Gated Recurrent Unit (GRU), and Bidirectional LSTM (Bi-LSTM), each trained individually to capture different aspects of temporal dependencies in the data. To enhance predictive accuracy, the predictions from these base models are combined using a stacking-based ensemble approach with XGBoost as the meta-learner. This ensemble method refines the final prediction by leveraging the strengths of each model and mitigating their individual weaknesses. The proposed model exhibits outstanding effectiveness in PM<sub>2.5</sub> estimation, attaining an MSE of 33.72, MAE of 2.56, RMSE of 5.80, and an R<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span> of 0.99 for Patna. Similarly, it attains an MSE of 8.90, MAE of 2.12, RMSE of 2.98, and an R<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span> of 0.99 for Gaya, while for Muzaffarpur, the model records an MSE of 11.37, MAE of 2.44, RMSE of 3.37, and an R<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span> of 0.99. These outcomes demonstrate how accurate and dependable the model is at predicting air quality in various places. The findings aim to assist policymakers and environmental agencies in making informed decisions to reduce air pollution and safeguard the general public’s health.</div></div>\",\"PeriodicalId\":48626,\"journal\":{\"name\":\"Urban Climate\",\"volume\":\"62 \",\"pages\":\"Article 102521\"},\"PeriodicalIF\":6.9000,\"publicationDate\":\"2025-07-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Urban Climate\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2212095525002378\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Urban Climate","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2212095525002378","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

空气质量恶化的主要原因是直径小于2.5微米的颗粒物（PM2.5），这使得空气污染成为世界上最紧迫的环境问题之一。比哈尔邦是印度人口最稠密的邦之一，在过去十年里，空气质量一直在恶化，尤其是在巴特那、加亚和穆扎法尔普尔等主要城市中心。考虑到高PM2.5水平对健康和环境的严重影响，准确的预测模型对于采取主动的污染控制措施至关重要。本研究探讨了各种时间序列预测模型在PM2.5浓度预测中的应用，重点研究了深度学习集成方法。该框架利用五个基本深度学习模型：长短期记忆（LSTM）、卷积神经网络（cnn）、循环神经网络（rnn）、门通循环单元（GRU）和双向LSTM (Bi-LSTM)，每个模型都单独训练以捕获数据中时间依赖性的不同方面。为了提高预测的准确性，使用基于堆叠的集成方法将这些基本模型的预测与XGBoost作为元学习器相结合。这种集成方法通过利用每个模型的优点和减轻它们各自的缺点来改进最终的预测。该模型在PM2.5估计中表现出出色的有效性，在巴特那的MSE为33.72，MAE为2.56，RMSE为5.80，R2为0.99。同样，Gaya的MSE为8.90，MAE为2.12，RMSE为2.98，R2为0.99，而Muzaffarpur的MSE为11.37，MAE为2.44，RMSE为3.37，R2为0.99。这些结果表明，该模型在预测各地空气质量方面是多么准确和可靠。研究结果旨在帮助决策者和环境机构做出明智的决定，以减少空气污染，保障公众健康。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Stacked deep learning ensemble for time series prediction of PM2.5 levels in Bihar

A major contributor to the deterioration of air quality is Particulate matter (PM_2.5) with a diameter of less than 2.5 micrometers, which makes air pollution among the most pressing environmental concerns in the world. Bihar, one of India’s most densely populated states, has experienced deteriorating air quality over the past decade, particularly in major urban centers like Patna, Gaya, and Muzaffarpur. Given the severe health and environmental implications of high PM_2.5 levels, accurate forecasting models are essential for proactive pollution control measures. This study explores the application of various time series forecasting models for predicting PM_2.5 concentrations, focusing on deep learning ensemble methods. The framework utilizes five base deep learning models: Long Short-Term Memory (LSTM), Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Gated Recurrent Unit (GRU), and Bidirectional LSTM (Bi-LSTM), each trained individually to capture different aspects of temporal dependencies in the data. To enhance predictive accuracy, the predictions from these base models are combined using a stacking-based ensemble approach with XGBoost as the meta-learner. This ensemble method refines the final prediction by leveraging the strengths of each model and mitigating their individual weaknesses. The proposed model exhibits outstanding effectiveness in PM_2.5 estimation, attaining an MSE of 33.72, MAE of 2.56, RMSE of 5.80, and an R

^{2}

of 0.99 for Patna. Similarly, it attains an MSE of 8.90, MAE of 2.12, RMSE of 2.98, and an R

^{2}

of 0.99 for Gaya, while for Muzaffarpur, the model records an MSE of 11.37, MAE of 2.44, RMSE of 3.37, and an R

^{2}

of 0.99. These outcomes demonstrate how accurate and dependable the model is at predicting air quality in various places. The findings aim to assist policymakers and environmental agencies in making informed decisions to reduce air pollution and safeguard the general public’s health.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Urban Climate Social Sciences-Urban Studies

CiteScore

9.70

自引率

9.40%

发文量

286

期刊介绍： Urban Climate serves the scientific and decision making communities with the publication of research on theory, science and applications relevant to understanding urban climatic conditions and change in relation to their geography and to demographic, socioeconomic, institutional, technological and environmental dynamics and global change. Targeted towards both disciplinary and interdisciplinary audiences, this journal publishes original research papers, comprehensive review articles, book reviews, and short communications on topics including, but not limited to, the following: Urban meteorology and climate[...] Urban environmental pollution[...] Adaptation to global change[...] Urban economic and social issues[...] Research Approaches[...]