Fernando Illescas-Martinez, Laura Garcia, Antonio-Javier Garcia-Sanchez, Rafael Asorey-Cacheda, Joan Garcia-Haro
{"title":"基于机器和深度学习模型的非监测城市空气质量预测","authors":"Fernando Illescas-Martinez, Laura Garcia, Antonio-Javier Garcia-Sanchez, Rafael Asorey-Cacheda, Joan Garcia-Haro","doi":"10.1016/j.eswa.2025.127749","DOIUrl":null,"url":null,"abstract":"<div><div>Air pollution poses a major environmental challenge, raising concerns about human health in urban environments. It leads to diseases such as asthma, exacerbates pulmonary conditions, and creates murky skies, lowering inhabitants’ quality of life. To quantify air pollution, cost-effective IoT (Internet of Things) devices are being deployed in cities, making air quality monitoring available for a wide range of end-users, including public administrations. However, full urban coverage is unfeasible, and awareness of the carbon footprint of IoT deployments is increasing. Therefore, new techniques are needed to maximize the value of IoT networks with reduced infrastructure. To address these challenges, this paper presents an air pollution analytical forecasting solution based on deep-learning/machine-learning techniques to estimate air quality in locations without deployed devices. Different combinations of well-known deep-learning models are compared with machine-learning techniques to determine the best approach for monitoring polluting gases and airborne particles based on well-defined evaluation metrics. Additionally, two new deep-learning techniques, Multipath-CNN-LSTM (M-CNN-LSTM) and Multipath-CNN-BiLSTM (M-CNN-BiLSTM), are proposed to conduct a more exhaustive comparison. Combinations of LSTM (Long Short-Term Memory) techniques give the best results, with different models working best for each pollutant. Specifically, LSTM was optimal for O<sub>3</sub>, and combinations of CNN (Convolutional Neural Networks) and BiLSTM (Bidirectional LSTM) worked best for NO<sub>2</sub>. GRU (Gated Recurrent Unit) was more efficient for PM<sub>2.5</sub>, and BiLSTM performed best for PM<sub>10</sub>. This demonstrates that the best strategy to accurately predict the time evolution of each pollutant’s behavior depends on the selection of the most suitable machine-learning or deep-learning technique.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"284 ","pages":"Article 127749"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Air quality forecasting in non-monitored urban areas through machine and deep-learning model\",\"authors\":\"Fernando Illescas-Martinez, Laura Garcia, Antonio-Javier Garcia-Sanchez, Rafael Asorey-Cacheda, Joan Garcia-Haro\",\"doi\":\"10.1016/j.eswa.2025.127749\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Air pollution poses a major environmental challenge, raising concerns about human health in urban environments. It leads to diseases such as asthma, exacerbates pulmonary conditions, and creates murky skies, lowering inhabitants’ quality of life. To quantify air pollution, cost-effective IoT (Internet of Things) devices are being deployed in cities, making air quality monitoring available for a wide range of end-users, including public administrations. However, full urban coverage is unfeasible, and awareness of the carbon footprint of IoT deployments is increasing. Therefore, new techniques are needed to maximize the value of IoT networks with reduced infrastructure. To address these challenges, this paper presents an air pollution analytical forecasting solution based on deep-learning/machine-learning techniques to estimate air quality in locations without deployed devices. Different combinations of well-known deep-learning models are compared with machine-learning techniques to determine the best approach for monitoring polluting gases and airborne particles based on well-defined evaluation metrics. Additionally, two new deep-learning techniques, Multipath-CNN-LSTM (M-CNN-LSTM) and Multipath-CNN-BiLSTM (M-CNN-BiLSTM), are proposed to conduct a more exhaustive comparison. Combinations of LSTM (Long Short-Term Memory) techniques give the best results, with different models working best for each pollutant. Specifically, LSTM was optimal for O<sub>3</sub>, and combinations of CNN (Convolutional Neural Networks) and BiLSTM (Bidirectional LSTM) worked best for NO<sub>2</sub>. GRU (Gated Recurrent Unit) was more efficient for PM<sub>2.5</sub>, and BiLSTM performed best for PM<sub>10</sub>. This demonstrates that the best strategy to accurately predict the time evolution of each pollutant’s behavior depends on the selection of the most suitable machine-learning or deep-learning technique.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"284 \",\"pages\":\"Article 127749\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-04-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0957417425013715\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425013715","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Air quality forecasting in non-monitored urban areas through machine and deep-learning model
Air pollution poses a major environmental challenge, raising concerns about human health in urban environments. It leads to diseases such as asthma, exacerbates pulmonary conditions, and creates murky skies, lowering inhabitants’ quality of life. To quantify air pollution, cost-effective IoT (Internet of Things) devices are being deployed in cities, making air quality monitoring available for a wide range of end-users, including public administrations. However, full urban coverage is unfeasible, and awareness of the carbon footprint of IoT deployments is increasing. Therefore, new techniques are needed to maximize the value of IoT networks with reduced infrastructure. To address these challenges, this paper presents an air pollution analytical forecasting solution based on deep-learning/machine-learning techniques to estimate air quality in locations without deployed devices. Different combinations of well-known deep-learning models are compared with machine-learning techniques to determine the best approach for monitoring polluting gases and airborne particles based on well-defined evaluation metrics. Additionally, two new deep-learning techniques, Multipath-CNN-LSTM (M-CNN-LSTM) and Multipath-CNN-BiLSTM (M-CNN-BiLSTM), are proposed to conduct a more exhaustive comparison. Combinations of LSTM (Long Short-Term Memory) techniques give the best results, with different models working best for each pollutant. Specifically, LSTM was optimal for O3, and combinations of CNN (Convolutional Neural Networks) and BiLSTM (Bidirectional LSTM) worked best for NO2. GRU (Gated Recurrent Unit) was more efficient for PM2.5, and BiLSTM performed best for PM10. This demonstrates that the best strategy to accurately predict the time evolution of each pollutant’s behavior depends on the selection of the most suitable machine-learning or deep-learning technique.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.