Zied Ben Bouallègue, Mariana C A Clare, Linus Magnusson, Estibaliz Gascón, Michael Maier-Gerber, Martin Janoušek, Mark Rodwell, Florian Pinault, Jesper S Dramsch, Simon T K Lang, Baudouin Raoult, Florence Rabier, Matthieu Chevallier, Irina Sandu, Peter Dueben, Matthew Chantry, Florian Pappenberger
{"title":"The rise of data-driven weather forecasting: A first statistical assessment of machine learning-based weather forecasts in an operational-like context","authors":"Zied Ben Bouallègue, Mariana C A Clare, Linus Magnusson, Estibaliz Gascón, Michael Maier-Gerber, Martin Janoušek, Mark Rodwell, Florian Pinault, Jesper S Dramsch, Simon T K Lang, Baudouin Raoult, Florence Rabier, Matthieu Chevallier, Irina Sandu, Peter Dueben, Matthew Chantry, Florian Pappenberger","doi":"10.1175/bams-d-23-0162.1","DOIUrl":null,"url":null,"abstract":"Abstract Data-driven modeling based on machine learning (ML) is showing enormous potential for weather forecasting. Rapid progress has been made with impressive results for some applications. The uptake of ML methods could be a game-changer for the incremental progress in traditional numerical weather prediction (NWP) known as the “quiet revolution” of weather forecasting. The computational cost of running a forecast with standard NWP systems greatly hinders the improvements that can be made from increasing model resolution and ensemble sizes. An emerging new generation of ML models, developed using high-quality reanalysis datasets like ERA5 for training, allow forecasts that require much lower computational costs and that are highly-competitive in terms of accuracy. Here, we compare for the first time ML-generated forecasts with standard NWP-based forecasts in an operational-like context, initialized from the same initial conditions. Focusing on deterministic forecasts, we apply common forecast verification tools to assess to what extent a data-driven forecast produced with one of the recently developed ML models (PanguWeather) matches the quality and attributes of a forecast from one of the leading global NWP systems (the ECMWF IFS). The results are very promising, with comparable accuracy for both global metrics and extreme events, when verified against both the operational IFS analysis and synoptic observations. Overly smooth forecasts, increasing bias with forecast lead time, and poor performance in predicting tropical cyclone intensity are identified as current drawbacks of ML-based forecasts. A new NWP paradigm is emerging relying on inference from ML models and state-of-the-art analysis and reanalysis datasets for forecast initialization and model training.","PeriodicalId":9464,"journal":{"name":"Bulletin of the American Meteorological Society","volume":"171 1","pages":""},"PeriodicalIF":6.9000,"publicationDate":"2024-02-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bulletin of the American Meteorological Society","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1175/bams-d-23-0162.1","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Abstract Data-driven modeling based on machine learning (ML) is showing enormous potential for weather forecasting. Rapid progress has been made with impressive results for some applications. The uptake of ML methods could be a game-changer for the incremental progress in traditional numerical weather prediction (NWP) known as the “quiet revolution” of weather forecasting. The computational cost of running a forecast with standard NWP systems greatly hinders the improvements that can be made from increasing model resolution and ensemble sizes. An emerging new generation of ML models, developed using high-quality reanalysis datasets like ERA5 for training, allow forecasts that require much lower computational costs and that are highly-competitive in terms of accuracy. Here, we compare for the first time ML-generated forecasts with standard NWP-based forecasts in an operational-like context, initialized from the same initial conditions. Focusing on deterministic forecasts, we apply common forecast verification tools to assess to what extent a data-driven forecast produced with one of the recently developed ML models (PanguWeather) matches the quality and attributes of a forecast from one of the leading global NWP systems (the ECMWF IFS). The results are very promising, with comparable accuracy for both global metrics and extreme events, when verified against both the operational IFS analysis and synoptic observations. Overly smooth forecasts, increasing bias with forecast lead time, and poor performance in predicting tropical cyclone intensity are identified as current drawbacks of ML-based forecasts. A new NWP paradigm is emerging relying on inference from ML models and state-of-the-art analysis and reanalysis datasets for forecast initialization and model training.
摘要 基于机器学习(ML)的数据驱动建模在天气预报方面显示出巨大的潜力。在某些应用领域,已经取得了快速进展和令人印象深刻的成果。对于被称为天气预报 "静悄悄的革命 "的传统数值天气预报(NWP)而言,采用 ML 方法可能会改变其渐进式发展。使用标准 NWP 系统进行预报的计算成本极大地阻碍了提高模式分辨率和集合规模所能带来的改进。利用高质量再分析数据集(如ERA5)进行训练开发的新一代 ML 模式,可使预报所需的计算成本大大降低,而且在准确性方面具有很强的竞争力。在这里,我们首次将 ML 生成的预报与基于标准 NWP 的预报进行了类似业务化的比较,这些预报是在相同的初始条件下初始化的。以确定性预报为重点,我们应用常用的预报验证工具,评估使用最近开发的一种 ML 模型(盘古天气)生成的数据驱动预报在多大程度上与全球领先的 NWP 系统(ECMWF IFS)的预报质量和属性相匹配。结果很有希望,在与运行中的 IFS 分析和同步观测进行验证时,全球指标和极端事件的准确性都相当高。基于 ML 的预报目前存在的缺点是预报过于平滑、预报偏差随着预报准备时间的延长而增大以及热带气旋强度预报性能不佳。一种新的 NWP 模式正在出现,它依赖于 ML 模式的推断以及用于预报初始化和模式训练的最新分析和再分析数据集。
期刊介绍:
The Bulletin of the American Meteorological Society (BAMS) is the flagship magazine of AMS and publishes articles of interest and significance for the weather, water, and climate community as well as news, editorials, and reviews for AMS members.