Vertically Recurrent Neural Networks for Sub-Grid Parameterization

IF 4.6 2区地球科学 Q1 METEOROLOGY & ATMOSPHERIC SCIENCES

Journal of Advances in Modeling Earth Systems Pub Date : 2025-06-10 DOI:10.1029/2024MS004833

P. Ukkonen, M. Chantry

{"title":"Vertically Recurrent Neural Networks for Sub-Grid Parameterization","authors":"P. Ukkonen, M. Chantry","doi":"10.1029/2024MS004833","DOIUrl":null,"url":null,"abstract":"<p>Machine learning has the potential to improve the physical realism and/or computational efficiency of parameterizations. A typical approach has been to feed concatenated vertical profiles to a dense neural network. However, feed-forward networks lack the connections to propagate information sequentially through the vertical column. Here we examine if predictions can be improved by instead traversing the column with recurrent neural networks (RNNs) such as Long Short-Term Memory (LSTMs). This method encodes physical priors (locality) and uses parameters more efficiently. Firstly, we test RNN-based radiation emulators in the Integrated Forecasting System. We achieve near-perfect offline accuracy, and the forecast skill of a suite of global weather simulations using the emulator are for the most part statistically indistinguishable from reference runs. But can radiation emulators provide both high accuracy and a speed-up? We find optimized, state-of-the-art radiation code on CPU generally faster than RNN-based emulators on GPU, although the latter can be more energy efficient. To test the method more broadly, and explore recent challenges in parameterization, we also adapt it to data sets from other studies. RNNs outperform reference feed-forward networks in emulating gravity waves, and when combined with horizontal convolutions, for non-local unified parameterization. In emulation of moist physics with memory, the RNNs have similar offline accuracy as ResNets, the previous state-of-the-art. However, the RNNs are more efficient, and more stable in autoregressive semi-prognostic tests. Multi-step autoregressive training improves performance in these tests and enables a latent representation of convective memory. Recently proposed linearly recurrent models achieve similar performance to LSTMs.</p>","PeriodicalId":14881,"journal":{"name":"Journal of Advances in Modeling Earth Systems","volume":"17 6","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1029/2024MS004833","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Advances in Modeling Earth Systems","FirstCategoryId":"89","ListUrlMain":"https://agupubs.onlinelibrary.wiley.com/doi/10.1029/2024MS004833","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}

引用次数: 0

Abstract

Machine learning has the potential to improve the physical realism and/or computational efficiency of parameterizations. A typical approach has been to feed concatenated vertical profiles to a dense neural network. However, feed-forward networks lack the connections to propagate information sequentially through the vertical column. Here we examine if predictions can be improved by instead traversing the column with recurrent neural networks (RNNs) such as Long Short-Term Memory (LSTMs). This method encodes physical priors (locality) and uses parameters more efficiently. Firstly, we test RNN-based radiation emulators in the Integrated Forecasting System. We achieve near-perfect offline accuracy, and the forecast skill of a suite of global weather simulations using the emulator are for the most part statistically indistinguishable from reference runs. But can radiation emulators provide both high accuracy and a speed-up? We find optimized, state-of-the-art radiation code on CPU generally faster than RNN-based emulators on GPU, although the latter can be more energy efficient. To test the method more broadly, and explore recent challenges in parameterization, we also adapt it to data sets from other studies. RNNs outperform reference feed-forward networks in emulating gravity waves, and when combined with horizontal convolutions, for non-local unified parameterization. In emulation of moist physics with memory, the RNNs have similar offline accuracy as ResNets, the previous state-of-the-art. However, the RNNs are more efficient, and more stable in autoregressive semi-prognostic tests. Multi-step autoregressive training improves performance in these tests and enables a latent representation of convective memory. Recently proposed linearly recurrent models achieve similar performance to LSTMs.

Abstract Image

查看原文本刊更多论文

用于子网格参数化的垂直递归神经网络

机器学习有可能提高物理真实感和/或参数化的计算效率。一种典型的方法是将连接的垂直剖面馈送到密集的神经网络中。然而，前馈网络缺乏通过垂直列依次传播信息的连接。在这里，我们研究是否可以通过使用循环神经网络（rnn）（如长短期记忆（LSTMs））来遍历列来改进预测。该方法对物理先验（局部性）进行编码，并更有效地利用参数。首先，我们在综合预报系统中对基于rnn的辐射模拟器进行了测试。我们实现了近乎完美的离线精度，并且使用模拟器的一套全球天气模拟的预测技能在很大程度上与参考运行没有统计学上的区别。但是辐射模拟器能同时提供高精度和加速吗？我们发现CPU上优化的、最先进的辐射代码通常比GPU上基于rnn的模拟器更快，尽管后者可能更节能。为了更广泛地测试该方法，并探索参数化中的最新挑战，我们还将其应用于其他研究的数据集。rnn在模拟重力波方面优于参考前馈网络，当与水平卷积相结合时，用于非局部统一参数化。在模拟具有记忆的潮湿物理时，rnn具有与先前最先进的ResNets相似的离线准确性。然而，rnn在自回归半预后测试中更有效，更稳定。多步自回归训练提高了这些测试的性能，并实现了对流记忆的潜在表征。最近提出的线性循环模型达到了与lstm相似的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Advances in Modeling Earth Systems METEOROLOGY & ATMOSPHERIC SCIENCES-

CiteScore

11.40

自引率

11.80%

发文量

241

审稿时长

>12 weeks

期刊介绍： The Journal of Advances in Modeling Earth Systems (JAMES) is committed to advancing the science of Earth systems modeling by offering high-quality scientific research through online availability and open access licensing. JAMES invites authors and readers from the international Earth systems modeling community. Open access. Articles are available free of charge for everyone with Internet access to view and download. Formal peer review. Supplemental material, such as code samples, images, and visualizations, is published at no additional charge. No additional charge for color figures. Modest page charges to cover production costs. Articles published in high-quality full text PDF, HTML, and XML. Internal and external reference linking, DOI registration, and forward linking via CrossRef.