{"title":"空气质量指数预测机器学习算法的比较分析","authors":"K. Kekulanadara, B. Kumara, Kuhaneswaran Banujan","doi":"10.1109/fiti54902.2021.9833033","DOIUrl":null,"url":null,"abstract":"Many scientists and researchers have been worried over the past few decades about the issue of air quality analysis and forecasting, because air pollution in the modern world has become a terrible environmental issue. There have been several health, environmental, and climatic changes owing to polluted air. The major causes leading to poor air quality are urbanization and industrialization. The major and significant air pollutants that affect air quality include NOx, SOx, CO, PM2.5, and PM10. Government agencies employ an Air Quality Index (AQI) to communicate to the public how contaminated the air is now or how polluted it is expected to become. The major emphasis of this work is the analysis of the aforementioned concentration of air pollutants, and classifying the different pollutant concentration levels that adversely affect the maintenance of favourable air quality based on a machine learning approach. We used the dataset, which consists of data on several types of air pollutants taken hourly, at different stations across various cities in India. The hourly data were collected from 15 January 2015 to 1 July 2020. There were 16 attributes. We employed the Decision Tree, Support Vector Machine (SVM), and Random Forest. The most accurate classification is done by a random forest classification algorithm. It outperformed the other approaches with a maximum accuracy of 74%. These results will assist to enhance present research, and guide the future.","PeriodicalId":201458,"journal":{"name":"2021 From Innovation To Impact (FITI)","volume":"82 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Comparative Analysis of Machine Learning Algorithms for Predicting Air Quality Index\",\"authors\":\"K. Kekulanadara, B. Kumara, Kuhaneswaran Banujan\",\"doi\":\"10.1109/fiti54902.2021.9833033\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Many scientists and researchers have been worried over the past few decades about the issue of air quality analysis and forecasting, because air pollution in the modern world has become a terrible environmental issue. There have been several health, environmental, and climatic changes owing to polluted air. The major causes leading to poor air quality are urbanization and industrialization. The major and significant air pollutants that affect air quality include NOx, SOx, CO, PM2.5, and PM10. Government agencies employ an Air Quality Index (AQI) to communicate to the public how contaminated the air is now or how polluted it is expected to become. The major emphasis of this work is the analysis of the aforementioned concentration of air pollutants, and classifying the different pollutant concentration levels that adversely affect the maintenance of favourable air quality based on a machine learning approach. We used the dataset, which consists of data on several types of air pollutants taken hourly, at different stations across various cities in India. The hourly data were collected from 15 January 2015 to 1 July 2020. There were 16 attributes. We employed the Decision Tree, Support Vector Machine (SVM), and Random Forest. The most accurate classification is done by a random forest classification algorithm. It outperformed the other approaches with a maximum accuracy of 74%. These results will assist to enhance present research, and guide the future.\",\"PeriodicalId\":201458,\"journal\":{\"name\":\"2021 From Innovation To Impact (FITI)\",\"volume\":\"82 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 From Innovation To Impact (FITI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/fiti54902.2021.9833033\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 From Innovation To Impact (FITI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/fiti54902.2021.9833033","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Comparative Analysis of Machine Learning Algorithms for Predicting Air Quality Index
Many scientists and researchers have been worried over the past few decades about the issue of air quality analysis and forecasting, because air pollution in the modern world has become a terrible environmental issue. There have been several health, environmental, and climatic changes owing to polluted air. The major causes leading to poor air quality are urbanization and industrialization. The major and significant air pollutants that affect air quality include NOx, SOx, CO, PM2.5, and PM10. Government agencies employ an Air Quality Index (AQI) to communicate to the public how contaminated the air is now or how polluted it is expected to become. The major emphasis of this work is the analysis of the aforementioned concentration of air pollutants, and classifying the different pollutant concentration levels that adversely affect the maintenance of favourable air quality based on a machine learning approach. We used the dataset, which consists of data on several types of air pollutants taken hourly, at different stations across various cities in India. The hourly data were collected from 15 January 2015 to 1 July 2020. There were 16 attributes. We employed the Decision Tree, Support Vector Machine (SVM), and Random Forest. The most accurate classification is done by a random forest classification algorithm. It outperformed the other approaches with a maximum accuracy of 74%. These results will assist to enhance present research, and guide the future.