评估基于xgboost的能见度预报的台站、卫星和组合数据

IF 4.4 2区 地球科学 Q1 METEOROLOGY & ATMOSPHERIC SCIENCES
Michaela Schütz, Adrian Schütz, Jörg Bendix, Jonas Müller und Boris Thies
{"title":"评估基于xgboost的能见度预报的台站、卫星和组合数据","authors":"Michaela Schütz,&nbsp;Adrian Schütz,&nbsp;Jörg Bendix,&nbsp;Jonas Müller und Boris Thies","doi":"10.1016/j.atmosres.2025.108395","DOIUrl":null,"url":null,"abstract":"<div><div>Radiation fog poses challenges for the very short-term weather forecasting due to its complex atmospheric dynamics. Accurate and spatially available visibility predictions are crucial for sectors where visibility conditions directly impact safety and operational efficiency. Traditional numerical weather prediction models lack real-time forecasting capabilities, so this study investigates a machine-learning-based visibility forecast with XGBoost for a station in Germany. Two data sources were used and compared: high-resolution station data and nationwide available Meteosat Second Generation (MSG) satellite data. The analysis investigates how the coarser spatial resolution of MSG data compares to finer station data in predicting fog formation and dissipation. Therefore, station-based predictors were substituted with MSG satellite data. Additionally, the study addresses data imbalances during training and evaluation by focusing on critical low-visibility conditions and specifically fog formation and dissipation. XGBoost significantly outperforms the three baseline models – pure visibility driven forecast, Persistence Model and Linear Regression. The mean absolute error (MAE) is less than 150 m in the low visibility range. For the predominantly MSG-variable-based model only 3 % of fog formations and 6 % of fog dissipations are completely missed. Furthermore, the MSG-model predicts 50 % of fog formations and 60 % of dissipations within the 30-min window of their actual occurrence. The model utilizing MSG data as substitutes for station-based predictors delivers comparable performance to the purely station-data-based forecast highlighting the potential of area-wide accessible MSG data. However, visibility measurements remain necessary for forecasting. Therefore, future research should develop satellite-derived products to replace visibility, enabling fully spatial forecasts.</div></div>","PeriodicalId":8600,"journal":{"name":"Atmospheric Research","volume":"328 ","pages":"Article 108395"},"PeriodicalIF":4.4000,"publicationDate":"2025-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Evaluating station, satellite, & combined data for XGBoost-based visibility forecast\",\"authors\":\"Michaela Schütz,&nbsp;Adrian Schütz,&nbsp;Jörg Bendix,&nbsp;Jonas Müller und Boris Thies\",\"doi\":\"10.1016/j.atmosres.2025.108395\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Radiation fog poses challenges for the very short-term weather forecasting due to its complex atmospheric dynamics. Accurate and spatially available visibility predictions are crucial for sectors where visibility conditions directly impact safety and operational efficiency. Traditional numerical weather prediction models lack real-time forecasting capabilities, so this study investigates a machine-learning-based visibility forecast with XGBoost for a station in Germany. Two data sources were used and compared: high-resolution station data and nationwide available Meteosat Second Generation (MSG) satellite data. The analysis investigates how the coarser spatial resolution of MSG data compares to finer station data in predicting fog formation and dissipation. Therefore, station-based predictors were substituted with MSG satellite data. Additionally, the study addresses data imbalances during training and evaluation by focusing on critical low-visibility conditions and specifically fog formation and dissipation. XGBoost significantly outperforms the three baseline models – pure visibility driven forecast, Persistence Model and Linear Regression. The mean absolute error (MAE) is less than 150 m in the low visibility range. For the predominantly MSG-variable-based model only 3 % of fog formations and 6 % of fog dissipations are completely missed. Furthermore, the MSG-model predicts 50 % of fog formations and 60 % of dissipations within the 30-min window of their actual occurrence. The model utilizing MSG data as substitutes for station-based predictors delivers comparable performance to the purely station-data-based forecast highlighting the potential of area-wide accessible MSG data. However, visibility measurements remain necessary for forecasting. Therefore, future research should develop satellite-derived products to replace visibility, enabling fully spatial forecasts.</div></div>\",\"PeriodicalId\":8600,\"journal\":{\"name\":\"Atmospheric Research\",\"volume\":\"328 \",\"pages\":\"Article 108395\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2025-07-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Atmospheric Research\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0169809525004879\",\"RegionNum\":2,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"METEOROLOGY & ATMOSPHERIC SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Atmospheric Research","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169809525004879","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

辐射雾由于其复杂的大气动力学特性,给极短期天气预报带来了挑战。准确和空间可用的能见度预测对于能见度条件直接影响安全和运营效率的部门至关重要。传统的数值天气预报模型缺乏实时预报能力,因此本研究利用XGBoost对德国某气象站进行了基于机器学习的能见度预报。使用并比较了两种数据来源:高分辨率台站数据和全国可用的气象卫星第二代(MSG)卫星数据。分析了MSG数据较粗的空间分辨率与较精细的台站数据在预测雾的形成和消散方面的比较。因此,用MSG卫星数据代替基于台站的预测。此外,该研究通过关注关键的低能见度条件,特别是雾的形成和消散,解决了训练和评估期间的数据不平衡问题。XGBoost显著优于三个基线模型——纯可见性驱动预测、持久性模型和线性回归。在低能见度范围内,平均绝对误差(MAE)小于150 m。对于主要基于msg变量的模型,只有3%的雾形成和6%的雾消散被完全遗漏。此外,msg模式预测50%的雾形成和60%的消散在实际发生的30分钟窗口内。利用MSG数据替代基于站点的预测器的模型提供了与纯粹基于站点数据的预测相当的性能,突出了区域范围内可访问MSG数据的潜力。然而,能见度测量对于预报仍然是必要的。因此,未来的研究应该开发卫星衍生产品来取代能见度,实现全空间预报。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Evaluating station, satellite, & combined data for XGBoost-based visibility forecast

Evaluating station, satellite, & combined data for XGBoost-based visibility forecast
Radiation fog poses challenges for the very short-term weather forecasting due to its complex atmospheric dynamics. Accurate and spatially available visibility predictions are crucial for sectors where visibility conditions directly impact safety and operational efficiency. Traditional numerical weather prediction models lack real-time forecasting capabilities, so this study investigates a machine-learning-based visibility forecast with XGBoost for a station in Germany. Two data sources were used and compared: high-resolution station data and nationwide available Meteosat Second Generation (MSG) satellite data. The analysis investigates how the coarser spatial resolution of MSG data compares to finer station data in predicting fog formation and dissipation. Therefore, station-based predictors were substituted with MSG satellite data. Additionally, the study addresses data imbalances during training and evaluation by focusing on critical low-visibility conditions and specifically fog formation and dissipation. XGBoost significantly outperforms the three baseline models – pure visibility driven forecast, Persistence Model and Linear Regression. The mean absolute error (MAE) is less than 150 m in the low visibility range. For the predominantly MSG-variable-based model only 3 % of fog formations and 6 % of fog dissipations are completely missed. Furthermore, the MSG-model predicts 50 % of fog formations and 60 % of dissipations within the 30-min window of their actual occurrence. The model utilizing MSG data as substitutes for station-based predictors delivers comparable performance to the purely station-data-based forecast highlighting the potential of area-wide accessible MSG data. However, visibility measurements remain necessary for forecasting. Therefore, future research should develop satellite-derived products to replace visibility, enabling fully spatial forecasts.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Atmospheric Research
Atmospheric Research 地学-气象与大气科学
CiteScore
9.40
自引率
10.90%
发文量
460
审稿时长
47 days
期刊介绍: The journal publishes scientific papers (research papers, review articles, letters and notes) dealing with the part of the atmosphere where meteorological events occur. Attention is given to all processes extending from the earth surface to the tropopause, but special emphasis continues to be devoted to the physics of clouds, mesoscale meteorology and air pollution, i.e. atmospheric aerosols; microphysical processes; cloud dynamics and thermodynamics; numerical simulation, climatology, climate change and weather modification.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信