基于集成机器学习方法的空气质量评估预测

2023 International Conference on Artificial Intelligence and Knowledge Discovery in Concurrent Engineering (ICECONF) Pub Date : 2023-01-05 DOI:10.1109/ICECONF57129.2023.10083751

P. William, Deepak Paithankar, P. Yawalkar, Sachin K. Korde, Abhijeet Rajendra, Pabale, D. Rakshe

{"title":"基于集成机器学习方法的空气质量评估预测","authors":"P. William, Deepak Paithankar, P. Yawalkar, Sachin K. Korde, Abhijeet Rajendra, Pabale, D. Rakshe","doi":"10.1109/ICECONF57129.2023.10083751","DOIUrl":null,"url":null,"abstract":"Smart cities must address air pollution as a top environmental concern. Real-time monitoring of pollution data enables metropolitan authorities to analyze the city's current traffic conditions and implement necessary corrective actions. The increased usage of Internet of things (IoT)-based sensors has altered the dynamics of air quality prediction significantly. While earlier research has used a number of machine learning techniques to anticipate pollution, it is usually necessary to compare various tactics in order to better understand how long they take to analyse different datasets. The best model for accurately predicting air quality given the amount of data available and the processing time required was determined by a comparative study of four different advanced regression algorithms. Apache Spark was used to perform tests and estimate pollution levels from a range of publicly available data sources. MAE and the root mean square error (RMSE) are often used to compare regression models. In order to find the best-fitting mode on Apache Spark, each method was tested in terms of processing time and error rate using a mix of standalone learning and fitting the hyperparameter tweaks on Apache Spark.","PeriodicalId":436733,"journal":{"name":"2023 International Conference on Artificial Intelligence and Knowledge Discovery in Concurrent Engineering (ICECONF)","volume":"75 2","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"20","resultStr":"{\"title\":\"Divination of Air Quality Assessment using Ensembling Machine Learning Approach\",\"authors\":\"P. William, Deepak Paithankar, P. Yawalkar, Sachin K. Korde, Abhijeet Rajendra, Pabale, D. Rakshe\",\"doi\":\"10.1109/ICECONF57129.2023.10083751\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Smart cities must address air pollution as a top environmental concern. Real-time monitoring of pollution data enables metropolitan authorities to analyze the city's current traffic conditions and implement necessary corrective actions. The increased usage of Internet of things (IoT)-based sensors has altered the dynamics of air quality prediction significantly. While earlier research has used a number of machine learning techniques to anticipate pollution, it is usually necessary to compare various tactics in order to better understand how long they take to analyse different datasets. The best model for accurately predicting air quality given the amount of data available and the processing time required was determined by a comparative study of four different advanced regression algorithms. Apache Spark was used to perform tests and estimate pollution levels from a range of publicly available data sources. MAE and the root mean square error (RMSE) are often used to compare regression models. In order to find the best-fitting mode on Apache Spark, each method was tested in terms of processing time and error rate using a mix of standalone learning and fitting the hyperparameter tweaks on Apache Spark.\",\"PeriodicalId\":436733,\"journal\":{\"name\":\"2023 International Conference on Artificial Intelligence and Knowledge Discovery in Concurrent Engineering (ICECONF)\",\"volume\":\"75 2\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"20\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Conference on Artificial Intelligence and Knowledge Discovery in Concurrent Engineering (ICECONF)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICECONF57129.2023.10083751\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Artificial Intelligence and Knowledge Discovery in Concurrent Engineering (ICECONF)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECONF57129.2023.10083751","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 20

摘要

智慧城市必须解决空气污染这一首要环境问题。污染数据的实时监测使城市当局能够分析城市当前的交通状况并实施必要的纠正措施。基于物联网(IoT)的传感器的使用增加，极大地改变了空气质量预测的动态。虽然早期的研究使用了许多机器学习技术来预测污染，但通常有必要比较各种策略，以便更好地了解分析不同数据集需要多长时间。在给定数据量和所需处理时间的情况下，准确预测空气质量的最佳模型是通过对四种不同的高级回归算法的比较研究确定的。Apache Spark用于从一系列公开可用的数据源执行测试和估计污染水平。MAE和均方根误差(RMSE)常用于比较回归模型。为了找到Apache Spark上的最佳拟合模式，每种方法都在处理时间和错误率方面进行了测试，使用独立学习和拟合Apache Spark上的超参数调整。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Divination of Air Quality Assessment using Ensembling Machine Learning Approach

Smart cities must address air pollution as a top environmental concern. Real-time monitoring of pollution data enables metropolitan authorities to analyze the city's current traffic conditions and implement necessary corrective actions. The increased usage of Internet of things (IoT)-based sensors has altered the dynamics of air quality prediction significantly. While earlier research has used a number of machine learning techniques to anticipate pollution, it is usually necessary to compare various tactics in order to better understand how long they take to analyse different datasets. The best model for accurately predicting air quality given the amount of data available and the processing time required was determined by a comparative study of four different advanced regression algorithms. Apache Spark was used to perform tests and estimate pollution levels from a range of publicly available data sources. MAE and the root mean square error (RMSE) are often used to compare regression models. In order to find the best-fitting mode on Apache Spark, each method was tested in terms of processing time and error rate using a mix of standalone learning and fitting the hyperparameter tweaks on Apache Spark.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2023 International Conference on Artificial Intelligence and Knowledge Discovery in Concurrent Engineering (ICECONF)

自引率

0.00%

发文量