Air Pollution Modelling by Machine Learning Methods

WIT transactions on modelling and simulation Pub Date : 2021-11-17 DOI:10.3390/modelling2040035

P. Vidnerová, Roman Neruda

{"title":"Air Pollution Modelling by Machine Learning Methods","authors":"P. Vidnerová, Roman Neruda","doi":"10.3390/modelling2040035","DOIUrl":null,"url":null,"abstract":"Precise environmental modelling of pollutants distributions represents a key factor for addresing the issue of urban air pollution. Nowadays, urban air pollution monitoring is primarily carried out by employing sparse networks of spatially distributed fixed stations. The work in this paper aims at improving the situation by utilizing machine learning models to process the outputs of multi-sensor devices that are small, cheap, albeit less reliable, thus a massive urban deployment of those devices is possible. The main contribution of the paper is the design of a mathematical model providing sensor fusion to extract the information and transform it into the desired pollutant concentrations. Multi-sensor outputs are used as input information for a particular machine learning model trained to produce the CO, NO2, and NOx concentration estimates. Several state-of-the-art machine learning methods, including original algorithms proposed by the authors, are utilized in this study: kernel methods, regularization networks, regularization networks with composite kernels, and deep neural networks. All methods are augmented with a proper hyper-parameter search to achieve the optimal performance for each model. All the methods considered achieved vital results, deep neural networks exhibited the best generalization ability, and regularization networks with product kernels achieved the best fitting of the training set.","PeriodicalId":89310,"journal":{"name":"WIT transactions on modelling and simulation","volume":"30 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"WIT transactions on modelling and simulation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/modelling2040035","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Precise environmental modelling of pollutants distributions represents a key factor for addresing the issue of urban air pollution. Nowadays, urban air pollution monitoring is primarily carried out by employing sparse networks of spatially distributed fixed stations. The work in this paper aims at improving the situation by utilizing machine learning models to process the outputs of multi-sensor devices that are small, cheap, albeit less reliable, thus a massive urban deployment of those devices is possible. The main contribution of the paper is the design of a mathematical model providing sensor fusion to extract the information and transform it into the desired pollutant concentrations. Multi-sensor outputs are used as input information for a particular machine learning model trained to produce the CO, NO2, and NOx concentration estimates. Several state-of-the-art machine learning methods, including original algorithms proposed by the authors, are utilized in this study: kernel methods, regularization networks, regularization networks with composite kernels, and deep neural networks. All methods are augmented with a proper hyper-parameter search to achieve the optimal performance for each model. All the methods considered achieved vital results, deep neural networks exhibited the best generalization ability, and regularization networks with product kernels achieved the best fitting of the training set.

查看原文本刊更多论文

基于机器学习方法的空气污染建模

对污染物分布进行精确的环境模拟是解决城市空气污染问题的关键因素。目前，城市大气污染监测主要采用空间分布的固定站点稀疏网络进行。本文的工作旨在通过利用机器学习模型来处理小型，廉价，尽管不太可靠的多传感器设备的输出来改善这种情况，因此这些设备的大规模城市部署是可能的。本文的主要贡献是设计了一个数学模型，提供传感器融合来提取信息并将其转换为所需的污染物浓度。多传感器输出被用作特定机器学习模型的输入信息，该模型经过训练后产生CO、NO2和NOx浓度估计。本研究中使用了几种最先进的机器学习方法，包括作者提出的原始算法:核方法、正则化网络、复合核正则化网络和深度神经网络。所有方法都通过适当的超参数搜索来增强，以达到每个模型的最佳性能。结果表明，深度神经网络的泛化能力最好，带乘积核的正则化网络的训练集拟合效果最好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

WIT transactions on modelling and simulation

自引率

0.00%

发文量