利用机器学习分析和预测大气臭氧浓度。

IF 2.4 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS
Frontiers in Big Data Pub Date : 2025-01-15 eCollection Date: 2024-01-01 DOI:10.3389/fdata.2024.1469809
Stephan Räss, Markus C Leuenberger
{"title":"利用机器学习分析和预测大气臭氧浓度。","authors":"Stephan Räss, Markus C Leuenberger","doi":"10.3389/fdata.2024.1469809","DOIUrl":null,"url":null,"abstract":"<p><p>Atmospheric ozone chemistry involves various substances and reactions, which makes it a complex system. We analyzed data recorded by Switzerland's National Air Pollution Monitoring Network (NABEL) to showcase the capabilities of machine learning (ML) for the prediction of ozone concentrations (daily averages) and to document a general approach that can be followed by anyone facing similar problems. We evaluated various artificial neural networks and compared them to linear as well as non-linear models deduced with ML. The main analyses and the training of the models were performed on atmospheric air data recorded from 2016 to 2023 at the NABEL station Lugano-Università in Lugano, TI, Switzerland. As a first step, we used techniques like best subset selection to determine the measurement parameters that might be relevant for the prediction of ozone concentrations; in general, the parameters identified by these methods agree with atmospheric ozone chemistry. Based on these results, we constructed various models and used them to predict ozone concentrations in Lugano for the period between January 1, 2024, and March 31, 2024; then, we compared the output of our models to the actual measurements and repeated this procedure for two NABEL stations situated in northern Switzerland (Dübendorf-Empa and Zürich-Kaserne). For these stations, predictions were made for the aforementioned period and the period between January 1, 2023, and December 31, 2023. In most of the cases, the lowest mean absolute errors (MAE) were provided by a non-linear model with 12 components (different powers and linear combinations of NO<sub>2</sub>, NO<sub>X</sub>, SO<sub>2</sub>, non-methane volatile organic compounds, temperature and radiation); the MAE of predicted ozone concentrations in Lugano was as low as 9 μgm<sup>-3</sup>. For the stations in Zürich and Dübendorf, the lowest MAEs were around 11 μgm<sup>-3</sup> and 13 μgm<sup>-3</sup>, respectively. For the tested periods, the accuracy of the best models was approximately 1 μgm<sup>-3</sup>. Since the aforementioned values are all lower than the standard deviations of the observations we conclude that using ML for complex data analyses can be very helpful and that artificial neural networks do not necessarily outperform simpler models.</p>","PeriodicalId":52859,"journal":{"name":"Frontiers in Big Data","volume":"7 ","pages":"1469809"},"PeriodicalIF":2.4000,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11774898/pdf/","citationCount":"0","resultStr":"{\"title\":\"Analysis and prediction of atmospheric ozone concentrations using machine learning.\",\"authors\":\"Stephan Räss, Markus C Leuenberger\",\"doi\":\"10.3389/fdata.2024.1469809\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Atmospheric ozone chemistry involves various substances and reactions, which makes it a complex system. We analyzed data recorded by Switzerland's National Air Pollution Monitoring Network (NABEL) to showcase the capabilities of machine learning (ML) for the prediction of ozone concentrations (daily averages) and to document a general approach that can be followed by anyone facing similar problems. We evaluated various artificial neural networks and compared them to linear as well as non-linear models deduced with ML. The main analyses and the training of the models were performed on atmospheric air data recorded from 2016 to 2023 at the NABEL station Lugano-Università in Lugano, TI, Switzerland. As a first step, we used techniques like best subset selection to determine the measurement parameters that might be relevant for the prediction of ozone concentrations; in general, the parameters identified by these methods agree with atmospheric ozone chemistry. Based on these results, we constructed various models and used them to predict ozone concentrations in Lugano for the period between January 1, 2024, and March 31, 2024; then, we compared the output of our models to the actual measurements and repeated this procedure for two NABEL stations situated in northern Switzerland (Dübendorf-Empa and Zürich-Kaserne). For these stations, predictions were made for the aforementioned period and the period between January 1, 2023, and December 31, 2023. In most of the cases, the lowest mean absolute errors (MAE) were provided by a non-linear model with 12 components (different powers and linear combinations of NO<sub>2</sub>, NO<sub>X</sub>, SO<sub>2</sub>, non-methane volatile organic compounds, temperature and radiation); the MAE of predicted ozone concentrations in Lugano was as low as 9 μgm<sup>-3</sup>. For the stations in Zürich and Dübendorf, the lowest MAEs were around 11 μgm<sup>-3</sup> and 13 μgm<sup>-3</sup>, respectively. For the tested periods, the accuracy of the best models was approximately 1 μgm<sup>-3</sup>. Since the aforementioned values are all lower than the standard deviations of the observations we conclude that using ML for complex data analyses can be very helpful and that artificial neural networks do not necessarily outperform simpler models.</p>\",\"PeriodicalId\":52859,\"journal\":{\"name\":\"Frontiers in Big Data\",\"volume\":\"7 \",\"pages\":\"1469809\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2025-01-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11774898/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in Big Data\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3389/fdata.2024.1469809\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Big Data","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fdata.2024.1469809","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

大气臭氧化学涉及多种物质和反应,是一个复杂的系统。我们分析了瑞士国家空气污染监测网络(NABEL)记录的数据,以展示机器学习(ML)预测臭氧浓度(日平均值)的能力,并记录了任何面临类似问题的人都可以遵循的一般方法。我们评估了各种人工神经网络,并将其与ML推导的线性和非线性模型进行了比较。模型的主要分析和训练是在瑞士德州卢加诺的NABEL站卢加诺大学(Lugano- universit) 2016年至2023年记录的大气数据上进行的。作为第一步,我们使用了最佳子集选择等技术来确定可能与臭氧浓度预测相关的测量参数;总的来说,这些方法确定的参数与大气臭氧化学一致。基于这些结果,我们构建了各种模型,并利用它们预测了2024年1月1日至2024年3月31日期间卢加诺的臭氧浓度;然后,我们将模型的输出与实际测量结果进行比较,并对位于瑞士北部的两个NABEL站(d本多夫-恩帕和z rich- kaserne)重复此过程。对这些台站进行了上述时期和2023年1月1日至2023年12月31日期间的预测。在大多数情况下,由NO2、NOX、SO2、非甲烷挥发性有机化合物、温度和辐射的不同幂次和线性组合组成的非线性模型提供了最低的平均绝对误差(MAE);卢加诺臭氧浓度预测MAE低至9 μgm-3。zrich和dd bendorf站点的最低MAEs分别在11 μgm-3和13 μgm-3左右。在测试周期内,最佳模型的精度约为1 μgm-3。由于上述值都低于观测值的标准差,我们得出结论,使用ML进行复杂数据分析可能非常有帮助,人工神经网络不一定优于简单模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Analysis and prediction of atmospheric ozone concentrations using machine learning.

Analysis and prediction of atmospheric ozone concentrations using machine learning.

Analysis and prediction of atmospheric ozone concentrations using machine learning.

Analysis and prediction of atmospheric ozone concentrations using machine learning.

Atmospheric ozone chemistry involves various substances and reactions, which makes it a complex system. We analyzed data recorded by Switzerland's National Air Pollution Monitoring Network (NABEL) to showcase the capabilities of machine learning (ML) for the prediction of ozone concentrations (daily averages) and to document a general approach that can be followed by anyone facing similar problems. We evaluated various artificial neural networks and compared them to linear as well as non-linear models deduced with ML. The main analyses and the training of the models were performed on atmospheric air data recorded from 2016 to 2023 at the NABEL station Lugano-Università in Lugano, TI, Switzerland. As a first step, we used techniques like best subset selection to determine the measurement parameters that might be relevant for the prediction of ozone concentrations; in general, the parameters identified by these methods agree with atmospheric ozone chemistry. Based on these results, we constructed various models and used them to predict ozone concentrations in Lugano for the period between January 1, 2024, and March 31, 2024; then, we compared the output of our models to the actual measurements and repeated this procedure for two NABEL stations situated in northern Switzerland (Dübendorf-Empa and Zürich-Kaserne). For these stations, predictions were made for the aforementioned period and the period between January 1, 2023, and December 31, 2023. In most of the cases, the lowest mean absolute errors (MAE) were provided by a non-linear model with 12 components (different powers and linear combinations of NO2, NOX, SO2, non-methane volatile organic compounds, temperature and radiation); the MAE of predicted ozone concentrations in Lugano was as low as 9 μgm-3. For the stations in Zürich and Dübendorf, the lowest MAEs were around 11 μgm-3 and 13 μgm-3, respectively. For the tested periods, the accuracy of the best models was approximately 1 μgm-3. Since the aforementioned values are all lower than the standard deviations of the observations we conclude that using ML for complex data analyses can be very helpful and that artificial neural networks do not necessarily outperform simpler models.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
5.20
自引率
3.20%
发文量
122
审稿时长
13 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信