空气质量指数预测机器学习算法的比较分析

K. Kekulanadara, B. Kumara, Kuhaneswaran Banujan
{"title":"空气质量指数预测机器学习算法的比较分析","authors":"K. Kekulanadara, B. Kumara, Kuhaneswaran Banujan","doi":"10.1109/fiti54902.2021.9833033","DOIUrl":null,"url":null,"abstract":"Many scientists and researchers have been worried over the past few decades about the issue of air quality analysis and forecasting, because air pollution in the modern world has become a terrible environmental issue. There have been several health, environmental, and climatic changes owing to polluted air. The major causes leading to poor air quality are urbanization and industrialization. The major and significant air pollutants that affect air quality include NOx, SOx, CO, PM2.5, and PM10. Government agencies employ an Air Quality Index (AQI) to communicate to the public how contaminated the air is now or how polluted it is expected to become. The major emphasis of this work is the analysis of the aforementioned concentration of air pollutants, and classifying the different pollutant concentration levels that adversely affect the maintenance of favourable air quality based on a machine learning approach. We used the dataset, which consists of data on several types of air pollutants taken hourly, at different stations across various cities in India. The hourly data were collected from 15 January 2015 to 1 July 2020. There were 16 attributes. We employed the Decision Tree, Support Vector Machine (SVM), and Random Forest. The most accurate classification is done by a random forest classification algorithm. It outperformed the other approaches with a maximum accuracy of 74%. These results will assist to enhance present research, and guide the future.","PeriodicalId":201458,"journal":{"name":"2021 From Innovation To Impact (FITI)","volume":"82 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Comparative Analysis of Machine Learning Algorithms for Predicting Air Quality Index\",\"authors\":\"K. Kekulanadara, B. Kumara, Kuhaneswaran Banujan\",\"doi\":\"10.1109/fiti54902.2021.9833033\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Many scientists and researchers have been worried over the past few decades about the issue of air quality analysis and forecasting, because air pollution in the modern world has become a terrible environmental issue. There have been several health, environmental, and climatic changes owing to polluted air. The major causes leading to poor air quality are urbanization and industrialization. The major and significant air pollutants that affect air quality include NOx, SOx, CO, PM2.5, and PM10. Government agencies employ an Air Quality Index (AQI) to communicate to the public how contaminated the air is now or how polluted it is expected to become. The major emphasis of this work is the analysis of the aforementioned concentration of air pollutants, and classifying the different pollutant concentration levels that adversely affect the maintenance of favourable air quality based on a machine learning approach. We used the dataset, which consists of data on several types of air pollutants taken hourly, at different stations across various cities in India. The hourly data were collected from 15 January 2015 to 1 July 2020. There were 16 attributes. We employed the Decision Tree, Support Vector Machine (SVM), and Random Forest. The most accurate classification is done by a random forest classification algorithm. It outperformed the other approaches with a maximum accuracy of 74%. These results will assist to enhance present research, and guide the future.\",\"PeriodicalId\":201458,\"journal\":{\"name\":\"2021 From Innovation To Impact (FITI)\",\"volume\":\"82 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 From Innovation To Impact (FITI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/fiti54902.2021.9833033\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 From Innovation To Impact (FITI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/fiti54902.2021.9833033","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

在过去的几十年里,许多科学家和研究人员一直担心空气质量分析和预测问题,因为空气污染在现代世界已经成为一个可怕的环境问题。由于空气污染,出现了若干健康、环境和气候变化。导致空气质量差的主要原因是城市化和工业化。影响空气质量的主要和重要的空气污染物包括NOx、SOx、CO、PM2.5和PM10。政府机构采用空气质量指数(AQI)向公众传达空气污染的现状或预期污染程度。这项工作的主要重点是分析上述空气污染物的浓度,并基于机器学习方法对不同的污染物浓度水平进行分类,这些浓度水平会对维持良好的空气质量产生不利影响。我们使用的数据集包括印度不同城市的不同站点每小时采集的几种空气污染物的数据。每小时数据采集时间为2015年1月15日至2020年7月1日。共有16个属性。我们采用决策树、支持向量机(SVM)和随机森林。最准确的分类是由随机森林分类算法完成的。它以74%的最高准确率优于其他方法。这些结果将有助于加强目前的研究,并指导未来。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Comparative Analysis of Machine Learning Algorithms for Predicting Air Quality Index
Many scientists and researchers have been worried over the past few decades about the issue of air quality analysis and forecasting, because air pollution in the modern world has become a terrible environmental issue. There have been several health, environmental, and climatic changes owing to polluted air. The major causes leading to poor air quality are urbanization and industrialization. The major and significant air pollutants that affect air quality include NOx, SOx, CO, PM2.5, and PM10. Government agencies employ an Air Quality Index (AQI) to communicate to the public how contaminated the air is now or how polluted it is expected to become. The major emphasis of this work is the analysis of the aforementioned concentration of air pollutants, and classifying the different pollutant concentration levels that adversely affect the maintenance of favourable air quality based on a machine learning approach. We used the dataset, which consists of data on several types of air pollutants taken hourly, at different stations across various cities in India. The hourly data were collected from 15 January 2015 to 1 July 2020. There were 16 attributes. We employed the Decision Tree, Support Vector Machine (SVM), and Random Forest. The most accurate classification is done by a random forest classification algorithm. It outperformed the other approaches with a maximum accuracy of 74%. These results will assist to enhance present research, and guide the future.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信