{"title":"Data-driven enhancement of the Hastings–Powell model using sparse identification algorithm","authors":"Nitu Kumari, Anurag Singh","doi":"10.1016/j.jocs.2025.102682","DOIUrl":null,"url":null,"abstract":"<div><div>A significant challenge in various fields of science and engineering is extracting governing equations from data. Prey-predator models are particularly complex due to their nonlinear behavior, making traditional analytical methods insufficient for accurately capturing their dynamics. In this study, we introduce a data-driven approach to model the intricate dynamics of Hastings–Powell model solely from time series data. This article explores the application of the sparse identification of nonlinear dynamics (SINDy) and its extension, the SINDy-PI (parallel, implicit) method, in a model representing a chaotic food chain. The main goal is to determine the governing equations that describe the chaotic dynamics of the prey-predator populations. Hence, this study uses the parameters wherein the dynamics exhibit chaotic behavior. The method of SINDy was developed with the aim of identifying governing equations of nonlinear dynamical systems. In both methods, a library of potential terms are created and then a regression problem is solved. We have employed both methods as our model incorporates not only nonlinear terms but also rational terms. Our results shows that SINDy method is unable to find the exact form of governing equations but SINDy-PI method has the capability to accurately capture the authentic structure of the governing equations. In addition, we applied model selection techniques to identify the most parsimonious model possible. Through the application of SINDy and SINDy-PI, this research contributes to the advancement of data-centric approaches in ecological modeling, offering insights into the intricate dynamics of multi-species interactions within ecosystems. Further, for this study to be more realistic, utilizing real-world data from three-species would have been ideal. However, due to non-availability of three species real data, simulated data set has been used for validation purpose.</div></div>","PeriodicalId":48907,"journal":{"name":"Journal of Computational Science","volume":"91 ","pages":"Article 102682"},"PeriodicalIF":3.7000,"publicationDate":"2025-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational Science","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1877750325001590","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
A significant challenge in various fields of science and engineering is extracting governing equations from data. Prey-predator models are particularly complex due to their nonlinear behavior, making traditional analytical methods insufficient for accurately capturing their dynamics. In this study, we introduce a data-driven approach to model the intricate dynamics of Hastings–Powell model solely from time series data. This article explores the application of the sparse identification of nonlinear dynamics (SINDy) and its extension, the SINDy-PI (parallel, implicit) method, in a model representing a chaotic food chain. The main goal is to determine the governing equations that describe the chaotic dynamics of the prey-predator populations. Hence, this study uses the parameters wherein the dynamics exhibit chaotic behavior. The method of SINDy was developed with the aim of identifying governing equations of nonlinear dynamical systems. In both methods, a library of potential terms are created and then a regression problem is solved. We have employed both methods as our model incorporates not only nonlinear terms but also rational terms. Our results shows that SINDy method is unable to find the exact form of governing equations but SINDy-PI method has the capability to accurately capture the authentic structure of the governing equations. In addition, we applied model selection techniques to identify the most parsimonious model possible. Through the application of SINDy and SINDy-PI, this research contributes to the advancement of data-centric approaches in ecological modeling, offering insights into the intricate dynamics of multi-species interactions within ecosystems. Further, for this study to be more realistic, utilizing real-world data from three-species would have been ideal. However, due to non-availability of three species real data, simulated data set has been used for validation purpose.
期刊介绍:
Computational Science is a rapidly growing multi- and interdisciplinary field that uses advanced computing and data analysis to understand and solve complex problems. It has reached a level of predictive capability that now firmly complements the traditional pillars of experimentation and theory.
The recent advances in experimental techniques such as detectors, on-line sensor networks and high-resolution imaging techniques, have opened up new windows into physical and biological processes at many levels of detail. The resulting data explosion allows for detailed data driven modeling and simulation.
This new discipline in science combines computational thinking, modern computational methods, devices and collateral technologies to address problems far beyond the scope of traditional numerical methods.
Computational science typically unifies three distinct elements:
• Modeling, Algorithms and Simulations (e.g. numerical and non-numerical, discrete and continuous);
• Software developed to solve science (e.g., biological, physical, and social), engineering, medicine, and humanities problems;
• Computer and information science that develops and optimizes the advanced system hardware, software, networking, and data management components (e.g. problem solving environments).