Divide and conquer: Learning chaotic dynamical systems with multistep penalty neural ordinary differential equations

IF 6.9 1区工程技术 Q1 ENGINEERING, MULTIDISCIPLINARY

Computer Methods in Applied Mechanics and Engineering Pub Date : 2024-10-14 DOI:10.1016/j.cma.2024.117442

Dibyajyoti Chakraborty , Seung Whan Chung , Troy Arcomano , Romit Maulik

{"title":"Divide and conquer: Learning chaotic dynamical systems with multistep penalty neural ordinary differential equations","authors":"Dibyajyoti Chakraborty , Seung Whan Chung , Troy Arcomano , Romit Maulik","doi":"10.1016/j.cma.2024.117442","DOIUrl":null,"url":null,"abstract":"<div><div>Forecasting high-dimensional dynamical systems is a fundamental challenge in various fields, such as geosciences and engineering. Neural Ordinary Differential Equations (NODEs), which combine the power of neural networks and numerical solvers, have emerged as a promising algorithm for forecasting complex nonlinear dynamical systems. However, classical techniques used for NODE training are ineffective for learning chaotic dynamical systems. In this work, we propose a novel NODE-training approach that allows for robust learning of chaotic dynamical systems. Our method addresses the challenges of non-convexity and exploding gradients associated with underlying chaotic dynamics. Training data trajectories from such systems are split into multiple, non-overlapping time windows. In addition to the deviation from the training data, the optimization loss term further penalizes the discontinuities of the predicted trajectory between the time windows. The window size is selected based on the fastest Lyapunov time scale of the system. Multi-step penalty(MP) method is first demonstrated on Lorenz equation, to illustrate how it improves the loss landscape and thereby accelerates the optimization convergence. MP method can optimize chaotic systems in a manner similar to least-squares shadowing with significantly lower computational costs. Our proposed algorithm, denoted the Multistep Penalty NODE, is applied to chaotic systems such as the Kuramoto–Sivashinsky equation, the two-dimensional Kolmogorov flow, and ERA5 reanalysis data for the atmosphere. It is observed that MP-NODE provide viable performance for such chaotic systems, not only for short-term trajectory predictions but also for invariant statistics that are hallmarks of the chaotic nature of these dynamics.</div></div>","PeriodicalId":55222,"journal":{"name":"Computer Methods in Applied Mechanics and Engineering","volume":"432 ","pages":"Article 117442"},"PeriodicalIF":6.9000,"publicationDate":"2024-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Methods in Applied Mechanics and Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0045782524006972","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

Forecasting high-dimensional dynamical systems is a fundamental challenge in various fields, such as geosciences and engineering. Neural Ordinary Differential Equations (NODEs), which combine the power of neural networks and numerical solvers, have emerged as a promising algorithm for forecasting complex nonlinear dynamical systems. However, classical techniques used for NODE training are ineffective for learning chaotic dynamical systems. In this work, we propose a novel NODE-training approach that allows for robust learning of chaotic dynamical systems. Our method addresses the challenges of non-convexity and exploding gradients associated with underlying chaotic dynamics. Training data trajectories from such systems are split into multiple, non-overlapping time windows. In addition to the deviation from the training data, the optimization loss term further penalizes the discontinuities of the predicted trajectory between the time windows. The window size is selected based on the fastest Lyapunov time scale of the system. Multi-step penalty(MP) method is first demonstrated on Lorenz equation, to illustrate how it improves the loss landscape and thereby accelerates the optimization convergence. MP method can optimize chaotic systems in a manner similar to least-squares shadowing with significantly lower computational costs. Our proposed algorithm, denoted the Multistep Penalty NODE, is applied to chaotic systems such as the Kuramoto–Sivashinsky equation, the two-dimensional Kolmogorov flow, and ERA5 reanalysis data for the atmosphere. It is observed that MP-NODE provide viable performance for such chaotic systems, not only for short-term trajectory predictions but also for invariant statistics that are hallmarks of the chaotic nature of these dynamics.

查看原文本刊更多论文

分而治之：利用多步惩罚神经常微分方程学习混沌动力系统

预测高维动态系统是地球科学和工程学等各个领域面临的一项基本挑战。神经常微分方程（NODE）结合了神经网络和数值求解器的力量，已成为预测复杂非线性动力系统的一种有前途的算法。然而，用于 NODE 训练的经典技术对学习混沌动力系统无效。在这项工作中，我们提出了一种新颖的 NODE 训练方法，可以对混沌动力学系统进行稳健学习。我们的方法解决了与底层混沌动力学相关的非凸性和梯度爆炸的难题。来自此类系统的训练数据轨迹被分割成多个不重叠的时间窗口。除了与训练数据的偏差外，优化损失项还会进一步惩罚时间窗口之间预测轨迹的不连续性。窗口大小根据系统最快的 Lyapunov 时间尺度来选择。首先在洛伦兹方程上演示了多步惩罚（MP）方法，以说明该方法如何改善损失景观，从而加速优化收敛。MP 方法能以类似于最小二乘阴影法的方式优化混沌系统，而计算成本却大大降低。我们提出的算法被命名为 "多步惩罚 NODE"，它被应用于混沌系统，如 Kuramoto-Sivashinsky 方程、二维 Kolmogorov 流和 ERA5 大气再分析数据。结果表明，MP-NODE 为这类混沌系统提供了可行的性能，不仅适用于短期轨迹预测，而且适用于作为这些动力学混沌特性标志的不变统计。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computer Methods in Applied Mechanics and Engineering 工程技术-工程：综合

CiteScore

12.70

自引率

15.30%

发文量

719

审稿时长

44 days

期刊介绍： Computer Methods in Applied Mechanics and Engineering stands as a cornerstone in the realm of computational science and engineering. With a history spanning over five decades, the journal has been a key platform for disseminating papers on advanced mathematical modeling and numerical solutions. Interdisciplinary in nature, these contributions encompass mechanics, mathematics, computer science, and various scientific disciplines. The journal welcomes a broad range of computational methods addressing the simulation, analysis, and design of complex physical problems, making it a vital resource for researchers in the field.