P. William, Deepak Paithankar, P. Yawalkar, Sachin K. Korde, Abhijeet Rajendra, Pabale, D. Rakshe
{"title":"Divination of Air Quality Assessment using Ensembling Machine Learning Approach","authors":"P. William, Deepak Paithankar, P. Yawalkar, Sachin K. Korde, Abhijeet Rajendra, Pabale, D. Rakshe","doi":"10.1109/ICECONF57129.2023.10083751","DOIUrl":null,"url":null,"abstract":"Smart cities must address air pollution as a top environmental concern. Real-time monitoring of pollution data enables metropolitan authorities to analyze the city's current traffic conditions and implement necessary corrective actions. The increased usage of Internet of things (IoT)-based sensors has altered the dynamics of air quality prediction significantly. While earlier research has used a number of machine learning techniques to anticipate pollution, it is usually necessary to compare various tactics in order to better understand how long they take to analyse different datasets. The best model for accurately predicting air quality given the amount of data available and the processing time required was determined by a comparative study of four different advanced regression algorithms. Apache Spark was used to perform tests and estimate pollution levels from a range of publicly available data sources. MAE and the root mean square error (RMSE) are often used to compare regression models. In order to find the best-fitting mode on Apache Spark, each method was tested in terms of processing time and error rate using a mix of standalone learning and fitting the hyperparameter tweaks on Apache Spark.","PeriodicalId":436733,"journal":{"name":"2023 International Conference on Artificial Intelligence and Knowledge Discovery in Concurrent Engineering (ICECONF)","volume":"75 2","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"20","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Artificial Intelligence and Knowledge Discovery in Concurrent Engineering (ICECONF)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECONF57129.2023.10083751","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 20
Abstract
Smart cities must address air pollution as a top environmental concern. Real-time monitoring of pollution data enables metropolitan authorities to analyze the city's current traffic conditions and implement necessary corrective actions. The increased usage of Internet of things (IoT)-based sensors has altered the dynamics of air quality prediction significantly. While earlier research has used a number of machine learning techniques to anticipate pollution, it is usually necessary to compare various tactics in order to better understand how long they take to analyse different datasets. The best model for accurately predicting air quality given the amount of data available and the processing time required was determined by a comparative study of four different advanced regression algorithms. Apache Spark was used to perform tests and estimate pollution levels from a range of publicly available data sources. MAE and the root mean square error (RMSE) are often used to compare regression models. In order to find the best-fitting mode on Apache Spark, each method was tested in terms of processing time and error rate using a mix of standalone learning and fitting the hyperparameter tweaks on Apache Spark.