基于集成机器学习方法的空气质量评估预测

P. William, Deepak Paithankar, P. Yawalkar, Sachin K. Korde, Abhijeet Rajendra, Pabale, D. Rakshe
{"title":"基于集成机器学习方法的空气质量评估预测","authors":"P. William, Deepak Paithankar, P. Yawalkar, Sachin K. Korde, Abhijeet Rajendra, Pabale, D. Rakshe","doi":"10.1109/ICECONF57129.2023.10083751","DOIUrl":null,"url":null,"abstract":"Smart cities must address air pollution as a top environmental concern. Real-time monitoring of pollution data enables metropolitan authorities to analyze the city's current traffic conditions and implement necessary corrective actions. The increased usage of Internet of things (IoT)-based sensors has altered the dynamics of air quality prediction significantly. While earlier research has used a number of machine learning techniques to anticipate pollution, it is usually necessary to compare various tactics in order to better understand how long they take to analyse different datasets. The best model for accurately predicting air quality given the amount of data available and the processing time required was determined by a comparative study of four different advanced regression algorithms. Apache Spark was used to perform tests and estimate pollution levels from a range of publicly available data sources. MAE and the root mean square error (RMSE) are often used to compare regression models. In order to find the best-fitting mode on Apache Spark, each method was tested in terms of processing time and error rate using a mix of standalone learning and fitting the hyperparameter tweaks on Apache Spark.","PeriodicalId":436733,"journal":{"name":"2023 International Conference on Artificial Intelligence and Knowledge Discovery in Concurrent Engineering (ICECONF)","volume":"75 2","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"20","resultStr":"{\"title\":\"Divination of Air Quality Assessment using Ensembling Machine Learning Approach\",\"authors\":\"P. William, Deepak Paithankar, P. Yawalkar, Sachin K. Korde, Abhijeet Rajendra, Pabale, D. Rakshe\",\"doi\":\"10.1109/ICECONF57129.2023.10083751\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Smart cities must address air pollution as a top environmental concern. Real-time monitoring of pollution data enables metropolitan authorities to analyze the city's current traffic conditions and implement necessary corrective actions. The increased usage of Internet of things (IoT)-based sensors has altered the dynamics of air quality prediction significantly. While earlier research has used a number of machine learning techniques to anticipate pollution, it is usually necessary to compare various tactics in order to better understand how long they take to analyse different datasets. The best model for accurately predicting air quality given the amount of data available and the processing time required was determined by a comparative study of four different advanced regression algorithms. Apache Spark was used to perform tests and estimate pollution levels from a range of publicly available data sources. MAE and the root mean square error (RMSE) are often used to compare regression models. In order to find the best-fitting mode on Apache Spark, each method was tested in terms of processing time and error rate using a mix of standalone learning and fitting the hyperparameter tweaks on Apache Spark.\",\"PeriodicalId\":436733,\"journal\":{\"name\":\"2023 International Conference on Artificial Intelligence and Knowledge Discovery in Concurrent Engineering (ICECONF)\",\"volume\":\"75 2\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"20\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Conference on Artificial Intelligence and Knowledge Discovery in Concurrent Engineering (ICECONF)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICECONF57129.2023.10083751\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Artificial Intelligence and Knowledge Discovery in Concurrent Engineering (ICECONF)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECONF57129.2023.10083751","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 20

摘要

智慧城市必须解决空气污染这一首要环境问题。污染数据的实时监测使城市当局能够分析城市当前的交通状况并实施必要的纠正措施。基于物联网(IoT)的传感器的使用增加,极大地改变了空气质量预测的动态。虽然早期的研究使用了许多机器学习技术来预测污染,但通常有必要比较各种策略,以便更好地了解分析不同数据集需要多长时间。在给定数据量和所需处理时间的情况下,准确预测空气质量的最佳模型是通过对四种不同的高级回归算法的比较研究确定的。Apache Spark用于从一系列公开可用的数据源执行测试和估计污染水平。MAE和均方根误差(RMSE)常用于比较回归模型。为了找到Apache Spark上的最佳拟合模式,每种方法都在处理时间和错误率方面进行了测试,使用独立学习和拟合Apache Spark上的超参数调整。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Divination of Air Quality Assessment using Ensembling Machine Learning Approach
Smart cities must address air pollution as a top environmental concern. Real-time monitoring of pollution data enables metropolitan authorities to analyze the city's current traffic conditions and implement necessary corrective actions. The increased usage of Internet of things (IoT)-based sensors has altered the dynamics of air quality prediction significantly. While earlier research has used a number of machine learning techniques to anticipate pollution, it is usually necessary to compare various tactics in order to better understand how long they take to analyse different datasets. The best model for accurately predicting air quality given the amount of data available and the processing time required was determined by a comparative study of four different advanced regression algorithms. Apache Spark was used to perform tests and estimate pollution levels from a range of publicly available data sources. MAE and the root mean square error (RMSE) are often used to compare regression models. In order to find the best-fitting mode on Apache Spark, each method was tested in terms of processing time and error rate using a mix of standalone learning and fitting the hyperparameter tweaks on Apache Spark.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信