An Exploratory Analysis of Delhi Air Quality Using Statistics and Machine Learning Models

Anwesha Chakravarty, S. S, S. S
{"title":"An Exploratory Analysis of Delhi Air Quality Using Statistics and Machine Learning Models","authors":"Anwesha Chakravarty, S. S, S. S","doi":"10.1109/IATMSI56455.2022.10119423","DOIUrl":null,"url":null,"abstract":"Air pollution is one of the most significant concerns of the present era, which has severe and alarming effects on human health and the environment, thereby escalating the climate change issue. Hence, in-depth analysis of air pollution data and accurate air quality forecasting is crucial in controlling the growing pollution levels. It also aids in designing appropriate policies to prevent exposure to toxic pollutants and taking necessary precautionary measures. Air quality in Delhi, the capital of India, is inferior compared to other major cities in the world. In this study, daily and hourly concentrations of air pollutants in the Delhi region were collected and analyzed using various methods. A comparative analysis is performed based on months, seasons, and the topography of different stations. The effect of the Covid-19 lockdown on the reduction of pollutant levels is also studied. A correlation analysis is performed on the available data to show the relationships and dependencies among different pollutants, their relationship with weather parameters, and the correlations between the stations. Various machine learning models were used for air quality forecasting, like Linear Regression, Vector Auto Regression, Gradient Boosting Machine, Random Forest, and Decision Tree Regression. The performance of these models was compared using RMSE, MAE, and MAPE metrics. This study is focused on the dire state of air pollution in Delhi, the primary reasons behind it, and the efficacy of calculated lockdowns in bringing down pollution levels. It also highlights the potential of Linear Regression and Decision Tree Regression models in predicting the air quality for different time intervals.","PeriodicalId":221211,"journal":{"name":"2022 IEEE Conference on Interdisciplinary Approaches in Technology and Management for Social Innovation (IATMSI)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Conference on Interdisciplinary Approaches in Technology and Management for Social Innovation (IATMSI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IATMSI56455.2022.10119423","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Air pollution is one of the most significant concerns of the present era, which has severe and alarming effects on human health and the environment, thereby escalating the climate change issue. Hence, in-depth analysis of air pollution data and accurate air quality forecasting is crucial in controlling the growing pollution levels. It also aids in designing appropriate policies to prevent exposure to toxic pollutants and taking necessary precautionary measures. Air quality in Delhi, the capital of India, is inferior compared to other major cities in the world. In this study, daily and hourly concentrations of air pollutants in the Delhi region were collected and analyzed using various methods. A comparative analysis is performed based on months, seasons, and the topography of different stations. The effect of the Covid-19 lockdown on the reduction of pollutant levels is also studied. A correlation analysis is performed on the available data to show the relationships and dependencies among different pollutants, their relationship with weather parameters, and the correlations between the stations. Various machine learning models were used for air quality forecasting, like Linear Regression, Vector Auto Regression, Gradient Boosting Machine, Random Forest, and Decision Tree Regression. The performance of these models was compared using RMSE, MAE, and MAPE metrics. This study is focused on the dire state of air pollution in Delhi, the primary reasons behind it, and the efficacy of calculated lockdowns in bringing down pollution levels. It also highlights the potential of Linear Regression and Decision Tree Regression models in predicting the air quality for different time intervals.
使用统计和机器学习模型对德里空气质量进行探索性分析
空气污染是当今时代最令人关切的问题之一,它对人类健康和环境产生了严重和令人震惊的影响,从而使气候变化问题升级。因此,深入分析空气污染数据及作出准确的空气质素预测,对控制日益严重的污染程度至为重要。它还有助于制定适当的政策,防止接触有毒污染物并采取必要的预防措施。与世界上其他主要城市相比,印度首都德里的空气质量较差。在本研究中,使用各种方法收集和分析了德里地区每日和每小时的空气污染物浓度。根据月份、季节和不同站点的地形进行对比分析。还研究了新冠肺炎疫情防控对降低污染物水平的影响。对现有数据进行相关分析,以显示不同污染物之间的关系和依赖关系,它们与天气参数的关系,以及台站之间的相关性。各种机器学习模型被用于空气质量预测,如线性回归、向量自回归、梯度增强机、随机森林和决策树回归。使用RMSE、MAE和MAPE指标对这些模型的性能进行比较。这项研究的重点是德里严重的空气污染状况,其背后的主要原因,以及有计划的封锁在降低污染水平方面的效果。它还强调了线性回归和决策树回归模型在预测不同时间间隔的空气质量方面的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信