Reconstructing coupled time series in climate systems using three kinds of machine-learning methods

Yu Huang, Lichao Yang, Zuntao Fu
{"title":"Reconstructing coupled time series in climate systems using three kinds of machine-learning methods","authors":"Yu Huang, Lichao Yang, Zuntao Fu","doi":"10.5194/ESD-11-835-2020","DOIUrl":null,"url":null,"abstract":"Abstract. Despite the great success of machine learning, its application in climate\ndynamics has not been well developed. One concern might be how well the\ntrained neural networks could learn a dynamical system and what will be the\npotential application of this kind of learning. In this paper, three machine-learning methods are used: reservoir computer (RC), backpropagation-based (BP)\nartificial neural network, and long short-term memory (LSTM) neural network. It shows that the coupling relations or dynamics among variables in\nlinear or nonlinear systems can be inferred by RC and LSTM, which can be\nfurther applied to reconstruct one time series from the other. Specifically,\nwe analyzed the climatic toy models to address two questions: (i) what\nfactors significantly influence machine-learning reconstruction and (ii)\nhow do we select suitable explanatory variables for machine-learning\nreconstruction. The results reveal that both linear and nonlinear coupling\nrelations between variables do influence the reconstruction quality of\nmachine learning. If there is a strong linear coupling between two\nvariables, then the reconstruction can be bidirectional, and both of these\ntwo variables can be an explanatory variable for reconstructing the other.\nWhen the linear coupling among variables is absent but with the significant\nnonlinear coupling, the machine-learning reconstruction between two\nvariables is direction dependent, and it may be only unidirectional. Then\nthe convergent cross mapping (CCM) causality index is proposed to determine\nwhich variable can be taken as the reconstructed one and which as the\nexplanatory variable. In a real-world example, the Pearson correlation\nbetween the average tropical surface air temperature (TSAT) and the average\nNorthern Hemisphere SAT (NHSAT) is weak (0.08), but the CCM index of NHSAT\ncross mapped with TSAT is large (0.70). And this indicates that TSAT can be well\nreconstructed from NHSAT through machine learning. All results shown in this study could provide insights into machine-learning\napproaches for paleoclimate reconstruction, parameterization scheme, and\nprediction in related climate research. Highlights: i The coupling dynamics learned by machine learning can be used to reconstruct\ntime series. ii Reconstruction quality is direction dependent and variable dependent for nonlinear\nsystems. iii The CCM index is a potential indicator to choose reconstructed and\nexplanatory variables. iv The tropical average SAT can be well reconstructed from the average Northern\nHemisphere SAT.","PeriodicalId":11466,"journal":{"name":"Earth System Dynamics Discussions","volume":"1 1","pages":"835-853"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Earth System Dynamics Discussions","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5194/ESD-11-835-2020","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11

Abstract

Abstract. Despite the great success of machine learning, its application in climate dynamics has not been well developed. One concern might be how well the trained neural networks could learn a dynamical system and what will be the potential application of this kind of learning. In this paper, three machine-learning methods are used: reservoir computer (RC), backpropagation-based (BP) artificial neural network, and long short-term memory (LSTM) neural network. It shows that the coupling relations or dynamics among variables in linear or nonlinear systems can be inferred by RC and LSTM, which can be further applied to reconstruct one time series from the other. Specifically, we analyzed the climatic toy models to address two questions: (i) what factors significantly influence machine-learning reconstruction and (ii) how do we select suitable explanatory variables for machine-learning reconstruction. The results reveal that both linear and nonlinear coupling relations between variables do influence the reconstruction quality of machine learning. If there is a strong linear coupling between two variables, then the reconstruction can be bidirectional, and both of these two variables can be an explanatory variable for reconstructing the other. When the linear coupling among variables is absent but with the significant nonlinear coupling, the machine-learning reconstruction between two variables is direction dependent, and it may be only unidirectional. Then the convergent cross mapping (CCM) causality index is proposed to determine which variable can be taken as the reconstructed one and which as the explanatory variable. In a real-world example, the Pearson correlation between the average tropical surface air temperature (TSAT) and the average Northern Hemisphere SAT (NHSAT) is weak (0.08), but the CCM index of NHSAT cross mapped with TSAT is large (0.70). And this indicates that TSAT can be well reconstructed from NHSAT through machine learning. All results shown in this study could provide insights into machine-learning approaches for paleoclimate reconstruction, parameterization scheme, and prediction in related climate research. Highlights: i The coupling dynamics learned by machine learning can be used to reconstruct time series. ii Reconstruction quality is direction dependent and variable dependent for nonlinear systems. iii The CCM index is a potential indicator to choose reconstructed and explanatory variables. iv The tropical average SAT can be well reconstructed from the average Northern Hemisphere SAT.
利用三种机器学习方法重建气候系统中的耦合时间序列
摘要尽管机器学习取得了巨大的成功,但它在气候动力学中的应用还没有得到很好的发展。一个问题可能是经过训练的神经网络如何很好地学习动态系统,以及这种学习的潜在应用将是什么。本文采用了三种机器学习方法:水库计算机(RC)、基于反向传播(BP)的人工神经网络和长短期记忆(LSTM)神经网络。结果表明,RC和LSTM可以推断出变量间的耦合关系或动态关系,并可以进一步应用于从一个时间序列重构另一个时间序列。具体来说,我们分析了气候玩具模型来解决两个问题:(i)什么因素显著影响机器学习重建和(ii)我们如何为机器学习重建选择合适的解释变量。结果表明,变量之间的线性和非线性耦合关系都会影响机器学习的重建质量。如果两个变量之间存在强线性耦合,则重构可以是双向的,并且这两个变量都可以作为重构另一个变量的解释变量。当变量之间不存在线性耦合但存在显著的非线性耦合时,两个变量之间的机器学习重构是方向相关的,可能只是单向的。然后提出了收敛交叉映射(CCM)因果关系指数来确定哪些变量可以作为重构变量,哪些变量可以作为解释变量。在实际例子中,热带平均地表气温(TSAT)与北半球平均地表气温(NHSAT)之间的Pearson相关性较弱(0.08),但NHSATcross与TSAT的CCM指数较大(0.70)。这表明通过机器学习可以很好地从NHSAT重构出TSAT。本研究结果可为相关气候研究中古气候重建、参数化方案和预测的机器学习方法提供参考。i通过机器学习学习到的耦合动力学可以用来重建时间序列。非线性系统的重构质量是方向相关和变量相关的。CCM指数是选择重构变量和解释变量的潜在指标。从北半球平均SAT可以很好地重建热带平均SAT。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信