Harnessing open remote sensing data and machine learning for daily ground-level ozone prediction models: Spatio-temporal insights in the continental biogeographical region
{"title":"Harnessing open remote sensing data and machine learning for daily ground-level ozone prediction models: Spatio-temporal insights in the continental biogeographical region","authors":"Luka Mamić , Francesco Pirotti","doi":"10.1016/j.apr.2025.102514","DOIUrl":null,"url":null,"abstract":"<div><div>Ground-level ozone (O<sub>3</sub>) pollution poses significant environmental and public health challenges and requires accurate predictive models for effective monitoring and management. In this study we observe that 91 % of the observed ground-level O<sub>3</sub> variance can potentially be explained using time-lagged data from Sentinel-5P TROPOMI and data from ERA5-Land datasets on a trained artificial intelligence (AI) model deployed by machine learning (ML) in the continental part of the Veneto region in Italy. Data from local air quality monitoring stations were used as ground truth data. The study period is from January 2019 to December 2022. Spatio-temporal ML models predicted daily O<sub>3</sub> concentrations with RMSE of 9.05 μg/m<sup>3</sup>, 8.87 μg/m<sup>3</sup> and 10.87 μg/m<sup>3</sup> respectively for RF, XGB and LSTM. Models without spatio-temporal information gave lower accuracy, with RMSE of 10.88 μg/m<sup>3</sup>, 11.45 μg/m<sup>3</sup> and 12.06 μg/m<sup>3</sup> respectively, showing that spatio-temporal information can improve performance more than 10 %. However, spatio-temporal independent models are more transferable across continental region and different seasons. Results provide spatially continuous maps of ground-level O<sub>3</sub> with a spatial resolution of ∼11.13 km (0.1°), helping to estimate pollution levels in areas without ground stations. Spatial analysis of the models’ performance showed consistent high accuracy across all stations, while temporal analysis revealed lower performance in summer months. Overall, while the spatial resolution of the models developed in this study is insufficient for risk management in urban areas, they have practical implications for daily ground-level O<sub>3</sub> monitoring in areas without ground stations in the continental region.</div></div>","PeriodicalId":8604,"journal":{"name":"Atmospheric Pollution Research","volume":"16 6","pages":"Article 102514"},"PeriodicalIF":3.9000,"publicationDate":"2025-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Atmospheric Pollution Research","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1309104225001163","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Ground-level ozone (O3) pollution poses significant environmental and public health challenges and requires accurate predictive models for effective monitoring and management. In this study we observe that 91 % of the observed ground-level O3 variance can potentially be explained using time-lagged data from Sentinel-5P TROPOMI and data from ERA5-Land datasets on a trained artificial intelligence (AI) model deployed by machine learning (ML) in the continental part of the Veneto region in Italy. Data from local air quality monitoring stations were used as ground truth data. The study period is from January 2019 to December 2022. Spatio-temporal ML models predicted daily O3 concentrations with RMSE of 9.05 μg/m3, 8.87 μg/m3 and 10.87 μg/m3 respectively for RF, XGB and LSTM. Models without spatio-temporal information gave lower accuracy, with RMSE of 10.88 μg/m3, 11.45 μg/m3 and 12.06 μg/m3 respectively, showing that spatio-temporal information can improve performance more than 10 %. However, spatio-temporal independent models are more transferable across continental region and different seasons. Results provide spatially continuous maps of ground-level O3 with a spatial resolution of ∼11.13 km (0.1°), helping to estimate pollution levels in areas without ground stations. Spatial analysis of the models’ performance showed consistent high accuracy across all stations, while temporal analysis revealed lower performance in summer months. Overall, while the spatial resolution of the models developed in this study is insufficient for risk management in urban areas, they have practical implications for daily ground-level O3 monitoring in areas without ground stations in the continental region.
期刊介绍:
Atmospheric Pollution Research (APR) is an international journal designed for the publication of articles on air pollution. Papers should present novel experimental results, theory and modeling of air pollution on local, regional, or global scales. Areas covered are research on inorganic, organic, and persistent organic air pollutants, air quality monitoring, air quality management, atmospheric dispersion and transport, air-surface (soil, water, and vegetation) exchange of pollutants, dry and wet deposition, indoor air quality, exposure assessment, health effects, satellite measurements, natural emissions, atmospheric chemistry, greenhouse gases, and effects on climate change.