Vertically Recurrent Neural Networks for Sub-Grid Parameterization

IF 4.4 2区 地球科学 Q1 METEOROLOGY & ATMOSPHERIC SCIENCES
P. Ukkonen, M. Chantry
{"title":"Vertically Recurrent Neural Networks for Sub-Grid Parameterization","authors":"P. Ukkonen,&nbsp;M. Chantry","doi":"10.1029/2024MS004833","DOIUrl":null,"url":null,"abstract":"<p>Machine learning has the potential to improve the physical realism and/or computational efficiency of parameterizations. A typical approach has been to feed concatenated vertical profiles to a dense neural network. However, feed-forward networks lack the connections to propagate information sequentially through the vertical column. Here we examine if predictions can be improved by instead traversing the column with recurrent neural networks (RNNs) such as Long Short-Term Memory (LSTMs). This method encodes physical priors (locality) and uses parameters more efficiently. Firstly, we test RNN-based radiation emulators in the Integrated Forecasting System. We achieve near-perfect offline accuracy, and the forecast skill of a suite of global weather simulations using the emulator are for the most part statistically indistinguishable from reference runs. But can radiation emulators provide both high accuracy and a speed-up? We find optimized, state-of-the-art radiation code on CPU generally faster than RNN-based emulators on GPU, although the latter can be more energy efficient. To test the method more broadly, and explore recent challenges in parameterization, we also adapt it to data sets from other studies. RNNs outperform reference feed-forward networks in emulating gravity waves, and when combined with horizontal convolutions, for non-local unified parameterization. In emulation of moist physics with memory, the RNNs have similar offline accuracy as ResNets, the previous state-of-the-art. However, the RNNs are more efficient, and more stable in autoregressive semi-prognostic tests. Multi-step autoregressive training improves performance in these tests and enables a latent representation of convective memory. Recently proposed linearly recurrent models achieve similar performance to LSTMs.</p>","PeriodicalId":14881,"journal":{"name":"Journal of Advances in Modeling Earth Systems","volume":"17 6","pages":""},"PeriodicalIF":4.4000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1029/2024MS004833","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Advances in Modeling Earth Systems","FirstCategoryId":"89","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1029/2024MS004833","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Machine learning has the potential to improve the physical realism and/or computational efficiency of parameterizations. A typical approach has been to feed concatenated vertical profiles to a dense neural network. However, feed-forward networks lack the connections to propagate information sequentially through the vertical column. Here we examine if predictions can be improved by instead traversing the column with recurrent neural networks (RNNs) such as Long Short-Term Memory (LSTMs). This method encodes physical priors (locality) and uses parameters more efficiently. Firstly, we test RNN-based radiation emulators in the Integrated Forecasting System. We achieve near-perfect offline accuracy, and the forecast skill of a suite of global weather simulations using the emulator are for the most part statistically indistinguishable from reference runs. But can radiation emulators provide both high accuracy and a speed-up? We find optimized, state-of-the-art radiation code on CPU generally faster than RNN-based emulators on GPU, although the latter can be more energy efficient. To test the method more broadly, and explore recent challenges in parameterization, we also adapt it to data sets from other studies. RNNs outperform reference feed-forward networks in emulating gravity waves, and when combined with horizontal convolutions, for non-local unified parameterization. In emulation of moist physics with memory, the RNNs have similar offline accuracy as ResNets, the previous state-of-the-art. However, the RNNs are more efficient, and more stable in autoregressive semi-prognostic tests. Multi-step autoregressive training improves performance in these tests and enables a latent representation of convective memory. Recently proposed linearly recurrent models achieve similar performance to LSTMs.

用于子网格参数化的垂直递归神经网络
机器学习有可能提高物理真实感和/或参数化的计算效率。一种典型的方法是将连接的垂直剖面馈送到密集的神经网络中。然而,前馈网络缺乏通过垂直列依次传播信息的连接。在这里,我们研究是否可以通过使用循环神经网络(rnn)(如长短期记忆(LSTMs))来遍历列来改进预测。该方法对物理先验(局部性)进行编码,并更有效地利用参数。首先,我们在综合预报系统中对基于rnn的辐射模拟器进行了测试。我们实现了近乎完美的离线精度,并且使用模拟器的一套全球天气模拟的预测技能在很大程度上与参考运行没有统计学上的区别。但是辐射模拟器能同时提供高精度和加速吗?我们发现CPU上优化的、最先进的辐射代码通常比GPU上基于rnn的模拟器更快,尽管后者可能更节能。为了更广泛地测试该方法,并探索参数化中的最新挑战,我们还将其应用于其他研究的数据集。rnn在模拟重力波方面优于参考前馈网络,当与水平卷积相结合时,用于非局部统一参数化。在模拟具有记忆的潮湿物理时,rnn具有与先前最先进的ResNets相似的离线准确性。然而,rnn在自回归半预后测试中更有效,更稳定。多步自回归训练提高了这些测试的性能,并实现了对流记忆的潜在表征。最近提出的线性循环模型达到了与lstm相似的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Advances in Modeling Earth Systems
Journal of Advances in Modeling Earth Systems METEOROLOGY & ATMOSPHERIC SCIENCES-
CiteScore
11.40
自引率
11.80%
发文量
241
审稿时长
>12 weeks
期刊介绍: The Journal of Advances in Modeling Earth Systems (JAMES) is committed to advancing the science of Earth systems modeling by offering high-quality scientific research through online availability and open access licensing. JAMES invites authors and readers from the international Earth systems modeling community. Open access. Articles are available free of charge for everyone with Internet access to view and download. Formal peer review. Supplemental material, such as code samples, images, and visualizations, is published at no additional charge. No additional charge for color figures. Modest page charges to cover production costs. Articles published in high-quality full text PDF, HTML, and XML. Internal and external reference linking, DOI registration, and forward linking via CrossRef.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信