Guillaume Bertoli, Salman Mohebi, Firat Ozdemir, Jonas Jucker, Stefan Rüdisühli, Fernando Perez-Cruz, Mathieu Salzmann, Sebastian Schemm
{"title":"Revisiting Machine Learning Approaches for Short- and Longwave Radiation Inference in Weather and Climate Models","authors":"Guillaume Bertoli, Salman Mohebi, Firat Ozdemir, Jonas Jucker, Stefan Rüdisühli, Fernando Perez-Cruz, Mathieu Salzmann, Sebastian Schemm","doi":"10.1029/2025MS004956","DOIUrl":null,"url":null,"abstract":"<p>This paper explores Machine Learning (ML) parameterizations for radiative transfer in the ICOsahedral Nonhydrostatic weather and climate model (ICON) and investigates the achieved ML model speed-up with ICON running on graphics processing units (GPUs). Five ML models, with varying complexity and size, are coupled to ICON; more specifically, a multilayer perceptron (MLP), a Unet model, a bidirectional recurrent neural network with long short-term memory (BiLSTM), a vision transformer (ViT), and a random forest (RF) as a baseline. The ML parameterizations are coupled to the ICON code that includes OpenACC compiler directives to enable GPU support. The coupling is done with the PyTorch-Fortran coupler developed at NVIDIA. The most accurate model is the BiLSTM with a physics-informed normalization strategy, a penalty for the heating rates during training, a Gaussian smoothing as postprocessing and a simplified computation of the fluxes at the upper levels to ensure stability of the ICON model top. The presented setup enables stable aquaplanet simulations with ICON for several weeks at a resolution of about 80 km and compares well with the physics-based default radiative transfer parameterization, ecRad. Our results indicate that the compute requirements of the ML models that can ensure the stability of ICON are comparable to GPU optimized classical physics parameterizations in terms of memory consumption and computational speed.</p>","PeriodicalId":14881,"journal":{"name":"Journal of Advances in Modeling Earth Systems","volume":"17 9","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2025-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://agupubs.onlinelibrary.wiley.com/doi/epdf/10.1029/2025MS004956","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Advances in Modeling Earth Systems","FirstCategoryId":"89","ListUrlMain":"https://agupubs.onlinelibrary.wiley.com/doi/10.1029/2025MS004956","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
This paper explores Machine Learning (ML) parameterizations for radiative transfer in the ICOsahedral Nonhydrostatic weather and climate model (ICON) and investigates the achieved ML model speed-up with ICON running on graphics processing units (GPUs). Five ML models, with varying complexity and size, are coupled to ICON; more specifically, a multilayer perceptron (MLP), a Unet model, a bidirectional recurrent neural network with long short-term memory (BiLSTM), a vision transformer (ViT), and a random forest (RF) as a baseline. The ML parameterizations are coupled to the ICON code that includes OpenACC compiler directives to enable GPU support. The coupling is done with the PyTorch-Fortran coupler developed at NVIDIA. The most accurate model is the BiLSTM with a physics-informed normalization strategy, a penalty for the heating rates during training, a Gaussian smoothing as postprocessing and a simplified computation of the fluxes at the upper levels to ensure stability of the ICON model top. The presented setup enables stable aquaplanet simulations with ICON for several weeks at a resolution of about 80 km and compares well with the physics-based default radiative transfer parameterization, ecRad. Our results indicate that the compute requirements of the ML models that can ensure the stability of ICON are comparable to GPU optimized classical physics parameterizations in terms of memory consumption and computational speed.
本文探讨了ICOsahedral non - hydrostatic weather and climate model (ICON)中辐射传输的机器学习(ML)参数化,并研究了在图形处理单元(gpu)上运行ICON所实现的ML模型加速。五个ML模型,具有不同的复杂性和大小,耦合到ICON;更具体地说,是一个多层感知器(MLP)、一个Unet模型、一个具有长短期记忆(BiLSTM)的双向循环神经网络、一个视觉变压器(ViT)和一个随机森林(RF)作为基线。ML参数化与ICON代码耦合,其中包括OpenACC编译器指令以启用GPU支持。耦合是通过NVIDIA开发的PyTorch-Fortran耦合器完成的。最准确的模型是BiLSTM,它采用了物理信息归一化策略,在训练期间对加热速率进行惩罚,作为后处理的高斯平滑和上层通量的简化计算,以确保ICON模型顶部的稳定性。所提出的设置可以用ICON进行数周的稳定水行星模拟,分辨率约为80公里,与基于物理的默认辐射传输参数化ecRad相比,效果很好。我们的研究结果表明,在内存消耗和计算速度方面,能够确保ICON稳定性的ML模型的计算需求与GPU优化的经典物理参数化相当。
期刊介绍:
The Journal of Advances in Modeling Earth Systems (JAMES) is committed to advancing the science of Earth systems modeling by offering high-quality scientific research through online availability and open access licensing. JAMES invites authors and readers from the international Earth systems modeling community.
Open access. Articles are available free of charge for everyone with Internet access to view and download.
Formal peer review.
Supplemental material, such as code samples, images, and visualizations, is published at no additional charge.
No additional charge for color figures.
Modest page charges to cover production costs.
Articles published in high-quality full text PDF, HTML, and XML.
Internal and external reference linking, DOI registration, and forward linking via CrossRef.