CoDiCast: Conditional Diffusion Model for Weather Prediction with Uncertainty Quantification

Jimeng Shi, Bowen Jin, Jiawei Han, Giri Narasimhan
{"title":"CoDiCast: Conditional Diffusion Model for Weather Prediction with Uncertainty Quantification","authors":"Jimeng Shi, Bowen Jin, Jiawei Han, Giri Narasimhan","doi":"arxiv-2409.05975","DOIUrl":null,"url":null,"abstract":"Accurate weather forecasting is critical for science and society. Yet,\nexisting methods have not managed to simultaneously have the properties of high\naccuracy, low uncertainty, and high computational efficiency. On one hand, to\nquantify the uncertainty in weather predictions, the strategy of ensemble\nforecast (i.e., generating a set of diverse predictions) is often employed.\nHowever, traditional ensemble numerical weather prediction (NWP) is\ncomputationally intensive. On the other hand, most existing machine\nlearning-based weather prediction (MLWP) approaches are efficient and accurate.\nNevertheless, they are deterministic and cannot capture the uncertainty of\nweather forecasting. In this work, we propose CoDiCast, a conditional diffusion\nmodel to generate accurate global weather prediction, while achieving\nuncertainty quantification with ensemble forecasts and modest computational\ncost. The key idea is to simulate a conditional version of the reverse\ndenoising process in diffusion models, which starts from pure Gaussian noise to\ngenerate realistic weather scenarios for a future time point. Each denoising\nstep is conditioned on observations from the recent past. Ensemble forecasts\nare achieved by repeatedly sampling from stochastic Gaussian noise to represent\nuncertainty quantification. CoDiCast is trained on a decade of ERA5 reanalysis\ndata from the European Centre for Medium-Range Weather Forecasts (ECMWF).\nExperimental results demonstrate that our approach outperforms several existing\ndata-driven methods in accuracy. Our conditional diffusion model, CoDiCast, can\ngenerate 3-day global weather forecasts, at 6-hour steps and $5.625^\\circ$\nlatitude-longitude resolution, for over 5 variables, in about 12 minutes on a\ncommodity A100 GPU machine with 80GB memory. The open-souced code is provided\nat \\url{https://github.com/JimengShi/CoDiCast}.","PeriodicalId":501166,"journal":{"name":"arXiv - PHYS - Atmospheric and Oceanic Physics","volume":"8 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - PHYS - Atmospheric and Oceanic Physics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.05975","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Accurate weather forecasting is critical for science and society. Yet, existing methods have not managed to simultaneously have the properties of high accuracy, low uncertainty, and high computational efficiency. On one hand, to quantify the uncertainty in weather predictions, the strategy of ensemble forecast (i.e., generating a set of diverse predictions) is often employed. However, traditional ensemble numerical weather prediction (NWP) is computationally intensive. On the other hand, most existing machine learning-based weather prediction (MLWP) approaches are efficient and accurate. Nevertheless, they are deterministic and cannot capture the uncertainty of weather forecasting. In this work, we propose CoDiCast, a conditional diffusion model to generate accurate global weather prediction, while achieving uncertainty quantification with ensemble forecasts and modest computational cost. The key idea is to simulate a conditional version of the reverse denoising process in diffusion models, which starts from pure Gaussian noise to generate realistic weather scenarios for a future time point. Each denoising step is conditioned on observations from the recent past. Ensemble forecasts are achieved by repeatedly sampling from stochastic Gaussian noise to represent uncertainty quantification. CoDiCast is trained on a decade of ERA5 reanalysis data from the European Centre for Medium-Range Weather Forecasts (ECMWF). Experimental results demonstrate that our approach outperforms several existing data-driven methods in accuracy. Our conditional diffusion model, CoDiCast, can generate 3-day global weather forecasts, at 6-hour steps and $5.625^\circ$ latitude-longitude resolution, for over 5 variables, in about 12 minutes on a commodity A100 GPU machine with 80GB memory. The open-souced code is provided at \url{https://github.com/JimengShi/CoDiCast}.
CoDiCast:带不确定性量化的天气预报条件扩散模型
准确的天气预报对科学和社会至关重要。然而,现有的方法还无法同时具备高准确度、低不确定性和高计算效率的特性。一方面,为了量化天气预报的不确定性,通常采用集合预报(即生成一组不同的预报)的策略,但传统的集合数值天气预报(NWP)需要大量的计算。另一方面,现有的基于机器学习的天气预报(MLWP)方法大多高效准确,但它们都是确定性的,无法捕捉天气预报的不确定性。在这项工作中,我们提出了一种条件扩散模型 CoDiCast,用于生成准确的全球天气预报,同时通过集合预报和适度的计算成本实现不确定性量化。其关键思路是模拟扩散模型中条件版的反向去噪过程,该过程从纯高斯噪声开始,生成未来时间点的真实天气情况。每个去噪步骤都以近期的观测结果为条件。通过从随机高斯噪声中反复采样来表示不确定性量化,从而实现集合预报。CoDiCast 是在欧洲中期天气预报中心(ECMWF)十年ERA5 再分析数据的基础上进行训练的。我们的条件扩散模型CoDiCast可以生成3天的全球天气预报,步长为6小时,纬度-经度分辨率为5.625^\circ$,变量超过5个,在80GB内存的A100 GPU机器上只需约12分钟。开放式代码在 \url{https://github.com/JimengShi/CoDiCast} 上提供。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信