Short-term air quality prediction using point and interval deep learning systems coupled with multi-factor decomposition and data-driven tree compression

IF 7.2 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Jinxing Che , Kun Hu , Wenxin Xia , Yifan Xu , Yuerong Li
{"title":"Short-term air quality prediction using point and interval deep learning systems coupled with multi-factor decomposition and data-driven tree compression","authors":"Jinxing Che ,&nbsp;Kun Hu ,&nbsp;Wenxin Xia ,&nbsp;Yifan Xu ,&nbsp;Yuerong Li","doi":"10.1016/j.asoc.2024.112191","DOIUrl":null,"url":null,"abstract":"<div><p>Clean air, as a symbol of high-quality air quality, is the most basic requirement for people to maintain health. Moreover, in keeping humans fit, accurate short-term air quality prediction is vital. The decomposition algorithm can better capture the local features and temporal changes of the data. However, it increases the computation time, resource consumption, and complexity of the model. On the other hand, existing forecasting systems overlook instability and uncertainty. To solve the above problems, a deterministic and uncertainty AOA-DBGRU-MDN deep learning systems is proposed, which combines arithmetic optimization algorithm (AOA), double-layer bi-directional GRUs (DBGRU), and mixture density network (MDN). The above systems consider meteorological factors and air pollutants comprehensively. It involves feature selection using maximum information coefficient (MIC), decomposition using complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) algorithm, classification, and compression of decomposed components using entropy-Huffman tree compression. Firstly, the information measurement process reduces the number of components significantly. Following the incorporation of multi-factor data, the optimal DBGRU model is then obtained using AOA. Finally, the training errors are fitted using MDN to obtain interval prediction results. The experiments demonstrate that (1) Using the CEEMDAN algorithm can improve the prediction accuracy; (2) Classifying and reconstructing the data based on entropy-Huffman tree compression can not only decrease the model's training volume and improve training efficiency but also boost the model's prediction accuracy; (3) The AOA-DBGRU-MDN system performs probabilistic prediction to obtain an effective and intuitive prediction interval to improve the point prediction of air quality prediction.</p></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":null,"pages":null},"PeriodicalIF":7.2000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494624009657","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Clean air, as a symbol of high-quality air quality, is the most basic requirement for people to maintain health. Moreover, in keeping humans fit, accurate short-term air quality prediction is vital. The decomposition algorithm can better capture the local features and temporal changes of the data. However, it increases the computation time, resource consumption, and complexity of the model. On the other hand, existing forecasting systems overlook instability and uncertainty. To solve the above problems, a deterministic and uncertainty AOA-DBGRU-MDN deep learning systems is proposed, which combines arithmetic optimization algorithm (AOA), double-layer bi-directional GRUs (DBGRU), and mixture density network (MDN). The above systems consider meteorological factors and air pollutants comprehensively. It involves feature selection using maximum information coefficient (MIC), decomposition using complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) algorithm, classification, and compression of decomposed components using entropy-Huffman tree compression. Firstly, the information measurement process reduces the number of components significantly. Following the incorporation of multi-factor data, the optimal DBGRU model is then obtained using AOA. Finally, the training errors are fitted using MDN to obtain interval prediction results. The experiments demonstrate that (1) Using the CEEMDAN algorithm can improve the prediction accuracy; (2) Classifying and reconstructing the data based on entropy-Huffman tree compression can not only decrease the model's training volume and improve training efficiency but also boost the model's prediction accuracy; (3) The AOA-DBGRU-MDN system performs probabilistic prediction to obtain an effective and intuitive prediction interval to improve the point prediction of air quality prediction.

利用多因素分解和数据驱动树压缩的点和区间深度学习系统进行短期空气质量预测
洁净的空气象征着高质量的空气质量,是人们保持健康的最基本要求。此外,要保持人体健康,准确的短期空气质量预测也至关重要。分解算法能更好地捕捉数据的局部特征和时间变化。但是,它增加了计算时间、资源消耗和模型的复杂性。另一方面,现有的预测系统忽视了不稳定性和不确定性。为解决上述问题,本文提出了一种确定性和不确定性 AOA-DBGRU-MDN 深度学习系统,该系统结合了算术优化算法(AOA)、双层双向 GRU(DBGRU)和混合密度网络(MDN)。上述系统综合考虑了气象因素和空气污染物。它包括使用最大信息系数(MIC)进行特征选择、使用带自适应噪声的完全集合经验模式分解(CEEMDAN)算法进行分解、分类,以及使用熵-哈夫曼树压缩对分解后的成分进行压缩。首先,信息测量过程大大减少了分量的数量。在纳入多因素数据后,使用 AOA 获得 DBGRU 的最优模型。最后,利用 MDN 对训练误差进行拟合,得到区间预测结果。实验证明:(1)使用 CEEMDAN 算法可以提高预测精度;(2)基于熵-哈夫曼树压缩对数据进行分类和重构,不仅可以减少模型的训练量,提高训练效率,还可以提高模型的预测精度;(3)AOA-DBGRU-MDN 系统进行概率预测,得到有效直观的预测区间,提高空气质量预测的点预测效果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Applied Soft Computing
Applied Soft Computing 工程技术-计算机:跨学科应用
CiteScore
15.80
自引率
6.90%
发文量
874
审稿时长
10.9 months
期刊介绍: Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities. Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信