基于机器和深度学习模型的非监测城市空气质量预测

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems with Applications Pub Date : 2025-04-29 DOI:10.1016/j.eswa.2025.127749

Fernando Illescas-Martinez, Laura Garcia, Antonio-Javier Garcia-Sanchez, Rafael Asorey-Cacheda, Joan Garcia-Haro

{"title":"基于机器和深度学习模型的非监测城市空气质量预测","authors":"Fernando Illescas-Martinez, Laura Garcia, Antonio-Javier Garcia-Sanchez, Rafael Asorey-Cacheda, Joan Garcia-Haro","doi":"10.1016/j.eswa.2025.127749","DOIUrl":null,"url":null,"abstract":"<div><div>Air pollution poses a major environmental challenge, raising concerns about human health in urban environments. It leads to diseases such as asthma, exacerbates pulmonary conditions, and creates murky skies, lowering inhabitants’ quality of life. To quantify air pollution, cost-effective IoT (Internet of Things) devices are being deployed in cities, making air quality monitoring available for a wide range of end-users, including public administrations. However, full urban coverage is unfeasible, and awareness of the carbon footprint of IoT deployments is increasing. Therefore, new techniques are needed to maximize the value of IoT networks with reduced infrastructure. To address these challenges, this paper presents an air pollution analytical forecasting solution based on deep-learning/machine-learning techniques to estimate air quality in locations without deployed devices. Different combinations of well-known deep-learning models are compared with machine-learning techniques to determine the best approach for monitoring polluting gases and airborne particles based on well-defined evaluation metrics. Additionally, two new deep-learning techniques, Multipath-CNN-LSTM (M-CNN-LSTM) and Multipath-CNN-BiLSTM (M-CNN-BiLSTM), are proposed to conduct a more exhaustive comparison. Combinations of LSTM (Long Short-Term Memory) techniques give the best results, with different models working best for each pollutant. Specifically, LSTM was optimal for O<sub>3</sub>, and combinations of CNN (Convolutional Neural Networks) and BiLSTM (Bidirectional LSTM) worked best for NO<sub>2</sub>. GRU (Gated Recurrent Unit) was more efficient for PM<sub>2.5</sub>, and BiLSTM performed best for PM<sub>10</sub>. This demonstrates that the best strategy to accurately predict the time evolution of each pollutant’s behavior depends on the selection of the most suitable machine-learning or deep-learning technique.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"284 ","pages":"Article 127749"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Air quality forecasting in non-monitored urban areas through machine and deep-learning model\",\"authors\":\"Fernando Illescas-Martinez, Laura Garcia, Antonio-Javier Garcia-Sanchez, Rafael Asorey-Cacheda, Joan Garcia-Haro\",\"doi\":\"10.1016/j.eswa.2025.127749\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Air pollution poses a major environmental challenge, raising concerns about human health in urban environments. It leads to diseases such as asthma, exacerbates pulmonary conditions, and creates murky skies, lowering inhabitants’ quality of life. To quantify air pollution, cost-effective IoT (Internet of Things) devices are being deployed in cities, making air quality monitoring available for a wide range of end-users, including public administrations. However, full urban coverage is unfeasible, and awareness of the carbon footprint of IoT deployments is increasing. Therefore, new techniques are needed to maximize the value of IoT networks with reduced infrastructure. To address these challenges, this paper presents an air pollution analytical forecasting solution based on deep-learning/machine-learning techniques to estimate air quality in locations without deployed devices. Different combinations of well-known deep-learning models are compared with machine-learning techniques to determine the best approach for monitoring polluting gases and airborne particles based on well-defined evaluation metrics. Additionally, two new deep-learning techniques, Multipath-CNN-LSTM (M-CNN-LSTM) and Multipath-CNN-BiLSTM (M-CNN-BiLSTM), are proposed to conduct a more exhaustive comparison. Combinations of LSTM (Long Short-Term Memory) techniques give the best results, with different models working best for each pollutant. Specifically, LSTM was optimal for O<sub>3</sub>, and combinations of CNN (Convolutional Neural Networks) and BiLSTM (Bidirectional LSTM) worked best for NO<sub>2</sub>. GRU (Gated Recurrent Unit) was more efficient for PM<sub>2.5</sub>, and BiLSTM performed best for PM<sub>10</sub>. This demonstrates that the best strategy to accurately predict the time evolution of each pollutant’s behavior depends on the selection of the most suitable machine-learning or deep-learning technique.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"284 \",\"pages\":\"Article 127749\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-04-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0957417425013715\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425013715","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

空气污染是一项重大的环境挑战，引起人们对城市环境中人类健康的关注。它会导致哮喘等疾病，加剧肺部疾病，造成天空浑浊，降低居民的生活质量。为了量化空气污染，成本效益高的物联网设备正在城市中部署，为包括公共管理部门在内的广泛最终用户提供空气质量监测。然而，完全覆盖城市是不可行的，并且对物联网部署的碳足迹的认识正在增加。因此，需要新技术在减少基础设施的情况下最大化物联网网络的价值。为了应对这些挑战，本文提出了一种基于深度学习/机器学习技术的空气污染分析预测解决方案，以估计没有部署设备的地点的空气质量。将知名深度学习模型的不同组合与机器学习技术进行比较，以确定基于定义良好的评估指标监测污染气体和空气中颗粒的最佳方法。此外，本文还提出了两种新的深度学习技术，即Multipath-CNN-LSTM （M-CNN-LSTM）和Multipath-CNN-BiLSTM (M-CNN-BiLSTM)，以进行更详尽的比较。LSTM（长短期记忆）技术的组合给出了最好的结果，不同的模型对每种污染物都有最好的效果。具体而言，LSTM对O3最优，CNN（卷积神经网络）和BiLSTM（双向LSTM）的组合对NO2最优。GRU（门控循环单元）对PM2.5更有效，BiLSTM对PM10效果最好。这表明，准确预测每种污染物行为的时间演变的最佳策略取决于选择最合适的机器学习或深度学习技术。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Air quality forecasting in non-monitored urban areas through machine and deep-learning model

查看原文本刊更多论文

Air quality forecasting in non-monitored urban areas through machine and deep-learning model

Air pollution poses a major environmental challenge, raising concerns about human health in urban environments. It leads to diseases such as asthma, exacerbates pulmonary conditions, and creates murky skies, lowering inhabitants’ quality of life. To quantify air pollution, cost-effective IoT (Internet of Things) devices are being deployed in cities, making air quality monitoring available for a wide range of end-users, including public administrations. However, full urban coverage is unfeasible, and awareness of the carbon footprint of IoT deployments is increasing. Therefore, new techniques are needed to maximize the value of IoT networks with reduced infrastructure. To address these challenges, this paper presents an air pollution analytical forecasting solution based on deep-learning/machine-learning techniques to estimate air quality in locations without deployed devices. Different combinations of well-known deep-learning models are compared with machine-learning techniques to determine the best approach for monitoring polluting gases and airborne particles based on well-defined evaluation metrics. Additionally, two new deep-learning techniques, Multipath-CNN-LSTM (M-CNN-LSTM) and Multipath-CNN-BiLSTM (M-CNN-BiLSTM), are proposed to conduct a more exhaustive comparison. Combinations of LSTM (Long Short-Term Memory) techniques give the best results, with different models working best for each pollutant. Specifically, LSTM was optimal for O₃, and combinations of CNN (Convolutional Neural Networks) and BiLSTM (Bidirectional LSTM) worked best for NO₂. GRU (Gated Recurrent Unit) was more efficient for PM_2.5, and BiLSTM performed best for PM₁₀. This demonstrates that the best strategy to accurately predict the time evolution of each pollutant’s behavior depends on the selection of the most suitable machine-learning or deep-learning technique.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Expert Systems with Applications 工程技术-工程：电子与电气

CiteScore

13.80

自引率

10.60%

发文量

2045

审稿时长

8.7 months

期刊介绍： Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.