Shuang Zhou , Shiyu Wang , Herbert Ho-Ching Iu , Yingqian Zhang , Hao Zhang
{"title":"Machine learning-based estimation of correlation dimension from short chaotic time series","authors":"Shuang Zhou , Shiyu Wang , Herbert Ho-Ching Iu , Yingqian Zhang , Hao Zhang","doi":"10.1016/j.physd.2025.134869","DOIUrl":null,"url":null,"abstract":"<div><div>Fractal dimension is an important tool for describing complex systems, while correlation dimension is a type of fractal dimension of time series. It can give a more accurate understanding of the characteristics of the system and plays an important role in practical applications. To address the challenge of accurately calculating the correlation dimension of short chaotic data, this paper explores a machine learning-based approach. This method leverages the universal approximation theory of neural networks to compare the Long Short-Term Memory (LSTM) network, the Transformer architecture, and the Backpropagation (BP) network in deep learning, and then selects the optimal approach. Ultimately, the method integrates the backpropagation (BP) neural network with a genetic programming (GP) algorithm for the calculation of the correlation dimension, thereby constructing a novel model. This method aims to expand the volume of chaotic short data and thereby better utilize data information. Initially, a BP neural network with ten hidden layer neurons is constructed, combined with the calculation of correlation dimension to form a new method. Subsequently, simulations are conducted on various commonly encountered chaotic systems, including Lorenz, Hénon, Chen, Logistic systems, solar activity time series and daily female births time series. And their mean squared error (MSE) values are recorded after data augmentation. Finally, the correlation dimensions of these expanded data sets are recalculated and compared with their original values. The findings reveal a remarkable proximity to the theoretical values, thereby validating the method's efficacy in augmenting chaotic short-term sequences. This approach not only enhances the utilization of original chaotic time series data but also paves the way for extracting deeper insights and greater value from short chaotic data.</div></div>","PeriodicalId":20050,"journal":{"name":"Physica D: Nonlinear Phenomena","volume":"482 ","pages":"Article 134869"},"PeriodicalIF":2.9000,"publicationDate":"2025-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physica D: Nonlinear Phenomena","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S016727892500346X","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}
引用次数: 0
Abstract
Fractal dimension is an important tool for describing complex systems, while correlation dimension is a type of fractal dimension of time series. It can give a more accurate understanding of the characteristics of the system and plays an important role in practical applications. To address the challenge of accurately calculating the correlation dimension of short chaotic data, this paper explores a machine learning-based approach. This method leverages the universal approximation theory of neural networks to compare the Long Short-Term Memory (LSTM) network, the Transformer architecture, and the Backpropagation (BP) network in deep learning, and then selects the optimal approach. Ultimately, the method integrates the backpropagation (BP) neural network with a genetic programming (GP) algorithm for the calculation of the correlation dimension, thereby constructing a novel model. This method aims to expand the volume of chaotic short data and thereby better utilize data information. Initially, a BP neural network with ten hidden layer neurons is constructed, combined with the calculation of correlation dimension to form a new method. Subsequently, simulations are conducted on various commonly encountered chaotic systems, including Lorenz, Hénon, Chen, Logistic systems, solar activity time series and daily female births time series. And their mean squared error (MSE) values are recorded after data augmentation. Finally, the correlation dimensions of these expanded data sets are recalculated and compared with their original values. The findings reveal a remarkable proximity to the theoretical values, thereby validating the method's efficacy in augmenting chaotic short-term sequences. This approach not only enhances the utilization of original chaotic time series data but also paves the way for extracting deeper insights and greater value from short chaotic data.
期刊介绍:
Physica D (Nonlinear Phenomena) publishes research and review articles reporting on experimental and theoretical works, techniques and ideas that advance the understanding of nonlinear phenomena. Topics encompass wave motion in physical, chemical and biological systems; physical or biological phenomena governed by nonlinear field equations, including hydrodynamics and turbulence; pattern formation and cooperative phenomena; instability, bifurcations, chaos, and space-time disorder; integrable/Hamiltonian systems; asymptotic analysis and, more generally, mathematical methods for nonlinear systems.