Air quality forecasting based on machine and deep learning models: an IoT application

Khalid Khan, Affan Alim, Humayun Qureshi, Imran Sabir, Ibrahim Hassan
{"title":"Air quality forecasting based on machine and deep learning models: an IoT application","authors":"Khalid Khan, Affan Alim, Humayun Qureshi, Imran Sabir, Ibrahim Hassan","doi":"10.30537/sjcms.v7i1.1259","DOIUrl":null,"url":null,"abstract":"Harmful gasoline and particulate objects that exist in the air and above the cut-off values are dangerous for human, animal, and plant health. Essentially, it leads to lung cancer, throat infection, heart attack, and other diseases. The early forecasting of these objects may help for precautions of safety.  In this paper, it is proposed to use the regression-based model auto regression integrated moving average (AIRMA) and deep learning-based model long short-term memory (LSTM) for air quality prediction. The air quality forecasting performance also depends on the quality of the available dataset. In this study, real-time data is collected from 10 different locations based on an IoT system, which is developed locally for a funded project of the Higher Education Commission (HEC). The main idea of this study is to validate the real-time collected dataset. Two objects, particulate PM2.5, and gasoline Ammonia are considered for four different locations for forecasting. Due to several issues such that electricity, Wi-Fi, sensor calibration, and collected data are not in their finest position. A number of prepossessing steps are applied to raw data to bring it into a usable form. Regardless of these issues, proposed models based on data collected by IoT system, outperform two air objects PM2.5 and Ammonia. For the case of Ammonia, an RMSE value of 0.562 is obtained which is very low to the mean value of 5.15 which indicates high performance. Similarly, very close values of 0.186 and 0.133 of RMSE and MAE were achieved respectively, and reflect the low variance in error. The LSTM-based experiment for Ammonia prediction, comparable to a very low RMSE value of 1.948 is achieved from the corresponding mean. A very small difference value of 0.287 between RMSE and MAE is obtained indicating a low variance in predicting error and high precision.","PeriodicalId":32391,"journal":{"name":"Sukkur IBA Journal of Computing and Mathematical Sciences","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sukkur IBA Journal of Computing and Mathematical Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.30537/sjcms.v7i1.1259","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Harmful gasoline and particulate objects that exist in the air and above the cut-off values are dangerous for human, animal, and plant health. Essentially, it leads to lung cancer, throat infection, heart attack, and other diseases. The early forecasting of these objects may help for precautions of safety.  In this paper, it is proposed to use the regression-based model auto regression integrated moving average (AIRMA) and deep learning-based model long short-term memory (LSTM) for air quality prediction. The air quality forecasting performance also depends on the quality of the available dataset. In this study, real-time data is collected from 10 different locations based on an IoT system, which is developed locally for a funded project of the Higher Education Commission (HEC). The main idea of this study is to validate the real-time collected dataset. Two objects, particulate PM2.5, and gasoline Ammonia are considered for four different locations for forecasting. Due to several issues such that electricity, Wi-Fi, sensor calibration, and collected data are not in their finest position. A number of prepossessing steps are applied to raw data to bring it into a usable form. Regardless of these issues, proposed models based on data collected by IoT system, outperform two air objects PM2.5 and Ammonia. For the case of Ammonia, an RMSE value of 0.562 is obtained which is very low to the mean value of 5.15 which indicates high performance. Similarly, very close values of 0.186 and 0.133 of RMSE and MAE were achieved respectively, and reflect the low variance in error. The LSTM-based experiment for Ammonia prediction, comparable to a very low RMSE value of 1.948 is achieved from the corresponding mean. A very small difference value of 0.287 between RMSE and MAE is obtained indicating a low variance in predicting error and high precision.
基于机器和深度学习模型的空气质量预测:物联网应用
存在于空气中并超过临界值的有害汽油和颗粒物体对人类、动物和植物的健康是危险的。从本质上讲,它会导致肺癌、喉咙感染、心脏病发作和其他疾病。对这些物体的早期预报有助于安全防范。本文提出了基于回归的模型自回归综合移动平均(AIRMA)和基于深度学习的模型长短期记忆(LSTM)的空气质量预测方法。空气质量预报的效果也取决于可用数据集的质量。在这项研究中,基于物联网系统从10个不同地点收集实时数据,该系统是为高等教育委员会(HEC)资助的一个项目在当地开发的。本研究的主要思想是对实时采集的数据集进行验证。PM2.5颗粒和汽油氨这两个目标被考虑用于四个不同地点的预测。由于电力,Wi-Fi,传感器校准和收集的数据等几个问题没有处于最佳位置。对原始数据进行一些处理步骤,使其变成可用的形式。尽管存在这些问题,基于物联网系统收集的数据提出的模型优于PM2.5和氨这两个空气对象。对于氨的情况,RMSE值为0.562,这是非常低的平均值5.15,这表明高性能。同样,RMSE和MAE的值分别为0.186和0.133,非常接近,反映了误差的低方差。基于lstm的氨预测实验,从相应的均值得到了非常低的RMSE值1.948。RMSE与MAE之间的差值非常小,为0.287,表明预测误差方差小,精度高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
10
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信