Novel mathematical expression for dynamic stage-discharge relationship of Rivers under flow unsteadiness explored through machine learning models via symbolic regression in PySR
{"title":"Novel mathematical expression for dynamic stage-discharge relationship of Rivers under flow unsteadiness explored through machine learning models via symbolic regression in PySR","authors":"Rijurekha Dasgupta, Archisha Bhar, Subhasish Das, Rajib Das, Gourab Banerjee, Asis Mazumdar","doi":"10.1016/j.jhydrol.2025.133947","DOIUrl":null,"url":null,"abstract":"<div><div>Stage-discharge (<em>h</em>-<em>Q</em>) rating curve (<em>Q</em> = <em>ah<sup>b</sup></em>) defined by parameters <em>a</em> and <em>b</em> is a crucial tool for measuring riverflow. However, this relationship struggles with flow fluctuations, backwater effects, tides, and changes in cross-sectional geometry, impacting its accuracy. Machine learning (ML) algorithms model the discharge with sequential stages as input, capturing intricate non-linear relationships, despite their “black box” nature that lacks explicit mathematical expressions. This study aims, therefore, to explore ML models trained on stage-discharge data using symbolic regression to produce actual mathematical expressions for stage-discharge relationships. By using sequential stage data as independent variables, the effect of unsteadiness is captured on stage-discharge relationships for River Churni (in India), Brays Bayou, Little Fishing Creek, and Obion (in USA). Symbolic regression using PySR tool is newly implemented to derive the mathematical expressions, followed by ML-based rating curve models. For each river, five generic mathematical expressions have been derived through symbolic regression where discharge is evaluated as polynomials of powered terms of sequential stages, rather than relying on conventional relationships. These expressions are assessed based on their modeling performance, prediction accuracy, and physical relevance. The results indicate that the coefficient of determination has significantly increased by 9–56 % with ML models compared to conventional relationships. The evaluation results show the best performance of new stage-discharge equation, recognizing the unique mathematical framework of stage-discharge relationship at river cross-sections and establishing a methodology for deriving it. This study executes a novel application of symbolic regression to evaluate explicit mathematical expressions relating to the sequential stage and discharge.</div></div>","PeriodicalId":362,"journal":{"name":"Journal of Hydrology","volume":"662 ","pages":"Article 133947"},"PeriodicalIF":5.9000,"publicationDate":"2025-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Hydrology","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0022169425012855","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0
Abstract
Stage-discharge (h-Q) rating curve (Q = ahb) defined by parameters a and b is a crucial tool for measuring riverflow. However, this relationship struggles with flow fluctuations, backwater effects, tides, and changes in cross-sectional geometry, impacting its accuracy. Machine learning (ML) algorithms model the discharge with sequential stages as input, capturing intricate non-linear relationships, despite their “black box” nature that lacks explicit mathematical expressions. This study aims, therefore, to explore ML models trained on stage-discharge data using symbolic regression to produce actual mathematical expressions for stage-discharge relationships. By using sequential stage data as independent variables, the effect of unsteadiness is captured on stage-discharge relationships for River Churni (in India), Brays Bayou, Little Fishing Creek, and Obion (in USA). Symbolic regression using PySR tool is newly implemented to derive the mathematical expressions, followed by ML-based rating curve models. For each river, five generic mathematical expressions have been derived through symbolic regression where discharge is evaluated as polynomials of powered terms of sequential stages, rather than relying on conventional relationships. These expressions are assessed based on their modeling performance, prediction accuracy, and physical relevance. The results indicate that the coefficient of determination has significantly increased by 9–56 % with ML models compared to conventional relationships. The evaluation results show the best performance of new stage-discharge equation, recognizing the unique mathematical framework of stage-discharge relationship at river cross-sections and establishing a methodology for deriving it. This study executes a novel application of symbolic regression to evaluate explicit mathematical expressions relating to the sequential stage and discharge.
期刊介绍:
The Journal of Hydrology publishes original research papers and comprehensive reviews in all the subfields of the hydrological sciences including water based management and policy issues that impact on economics and society. These comprise, but are not limited to the physical, chemical, biogeochemical, stochastic and systems aspects of surface and groundwater hydrology, hydrometeorology and hydrogeology. Relevant topics incorporating the insights and methodologies of disciplines such as climatology, water resource systems, hydraulics, agrohydrology, geomorphology, soil science, instrumentation and remote sensing, civil and environmental engineering are included. Social science perspectives on hydrological problems such as resource and ecological economics, environmental sociology, psychology and behavioural science, management and policy analysis are also invited. Multi-and interdisciplinary analyses of hydrological problems are within scope. The science published in the Journal of Hydrology is relevant to catchment scales rather than exclusively to a local scale or site.