{"title":"Reconstructing coupled time series in climate systems using three kinds of machine-learning methods","authors":"Yu Huang, Lichao Yang, Zuntao Fu","doi":"10.5194/ESD-11-835-2020","DOIUrl":null,"url":null,"abstract":"Abstract. Despite the great success of machine learning, its application in climate\ndynamics has not been well developed. One concern might be how well the\ntrained neural networks could learn a dynamical system and what will be the\npotential application of this kind of learning. In this paper, three machine-learning methods are used: reservoir computer (RC), backpropagation-based (BP)\nartificial neural network, and long short-term memory (LSTM) neural network. It shows that the coupling relations or dynamics among variables in\nlinear or nonlinear systems can be inferred by RC and LSTM, which can be\nfurther applied to reconstruct one time series from the other. Specifically,\nwe analyzed the climatic toy models to address two questions: (i) what\nfactors significantly influence machine-learning reconstruction and (ii)\nhow do we select suitable explanatory variables for machine-learning\nreconstruction. The results reveal that both linear and nonlinear coupling\nrelations between variables do influence the reconstruction quality of\nmachine learning. If there is a strong linear coupling between two\nvariables, then the reconstruction can be bidirectional, and both of these\ntwo variables can be an explanatory variable for reconstructing the other.\nWhen the linear coupling among variables is absent but with the significant\nnonlinear coupling, the machine-learning reconstruction between two\nvariables is direction dependent, and it may be only unidirectional. Then\nthe convergent cross mapping (CCM) causality index is proposed to determine\nwhich variable can be taken as the reconstructed one and which as the\nexplanatory variable. In a real-world example, the Pearson correlation\nbetween the average tropical surface air temperature (TSAT) and the average\nNorthern Hemisphere SAT (NHSAT) is weak (0.08), but the CCM index of NHSAT\ncross mapped with TSAT is large (0.70). And this indicates that TSAT can be well\nreconstructed from NHSAT through machine learning. All results shown in this study could provide insights into machine-learning\napproaches for paleoclimate reconstruction, parameterization scheme, and\nprediction in related climate research. Highlights: i The coupling dynamics learned by machine learning can be used to reconstruct\ntime series. ii Reconstruction quality is direction dependent and variable dependent for nonlinear\nsystems. iii The CCM index is a potential indicator to choose reconstructed and\nexplanatory variables. iv The tropical average SAT can be well reconstructed from the average Northern\nHemisphere SAT.","PeriodicalId":11466,"journal":{"name":"Earth System Dynamics Discussions","volume":"1 1","pages":"835-853"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Earth System Dynamics Discussions","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5194/ESD-11-835-2020","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11
Abstract
Abstract. Despite the great success of machine learning, its application in climate
dynamics has not been well developed. One concern might be how well the
trained neural networks could learn a dynamical system and what will be the
potential application of this kind of learning. In this paper, three machine-learning methods are used: reservoir computer (RC), backpropagation-based (BP)
artificial neural network, and long short-term memory (LSTM) neural network. It shows that the coupling relations or dynamics among variables in
linear or nonlinear systems can be inferred by RC and LSTM, which can be
further applied to reconstruct one time series from the other. Specifically,
we analyzed the climatic toy models to address two questions: (i) what
factors significantly influence machine-learning reconstruction and (ii)
how do we select suitable explanatory variables for machine-learning
reconstruction. The results reveal that both linear and nonlinear coupling
relations between variables do influence the reconstruction quality of
machine learning. If there is a strong linear coupling between two
variables, then the reconstruction can be bidirectional, and both of these
two variables can be an explanatory variable for reconstructing the other.
When the linear coupling among variables is absent but with the significant
nonlinear coupling, the machine-learning reconstruction between two
variables is direction dependent, and it may be only unidirectional. Then
the convergent cross mapping (CCM) causality index is proposed to determine
which variable can be taken as the reconstructed one and which as the
explanatory variable. In a real-world example, the Pearson correlation
between the average tropical surface air temperature (TSAT) and the average
Northern Hemisphere SAT (NHSAT) is weak (0.08), but the CCM index of NHSAT
cross mapped with TSAT is large (0.70). And this indicates that TSAT can be well
reconstructed from NHSAT through machine learning. All results shown in this study could provide insights into machine-learning
approaches for paleoclimate reconstruction, parameterization scheme, and
prediction in related climate research. Highlights: i The coupling dynamics learned by machine learning can be used to reconstruct
time series. ii Reconstruction quality is direction dependent and variable dependent for nonlinear
systems. iii The CCM index is a potential indicator to choose reconstructed and
explanatory variables. iv The tropical average SAT can be well reconstructed from the average Northern
Hemisphere SAT.