Min Gan , Yongping Chen , Shunqi Pan , Xijun Lai , Haidong Pan , Yuncheng Wen , Mingyan Xia
{"title":"An improved machine learning-based model to predict estuarine water levels","authors":"Min Gan , Yongping Chen , Shunqi Pan , Xijun Lai , Haidong Pan , Yuncheng Wen , Mingyan Xia","doi":"10.1016/j.ocemod.2024.102376","DOIUrl":null,"url":null,"abstract":"<div><p>The areas around estuaries are typically densely populated and economically developed. Therefore, robust flood risk assessment in these areas is critical. One of the key elements of flood risk assessment is the accurate prediction of estuarine water levels. However, the nonlinear interactions between riverine (i.e., upstream river discharge) and marine (i.e., tides) forces complicate the prediction of estuarine water levels. Traditional physics-based and data-driven models have made significant progress in predicting estuarine water levels, but they require upstream river discharge data as inputs. Considering the lack of such data, the development of new approaches is crucial. This study investigated a machine-learning-based light gradient boosting machine (LightGBM) framework for predicting estuarine water levels using historical water levels as the only inputs. Two prediction models based on the LightGBM framework, denoted as LightGBM1 and LightGBM2, are developed. The LightGBM1 model constructs only a single regression model and uses a recursive approach to generate multidimensional outputs. The LightGBM2 model constructs multiple regression models between the same inputs and outputs in each dimension. The LightGBM1 and LightGBM2 models were applied to the Yangtze estuary as a test case. The results demonstrate that both models are effective at predicting short-term (within 48 hours) estuarine water levels, but the statistical performance of LightGBM2 is better overall. For 24-hour prediction, the root-mean-squared errors of the LightGBM1 and LightGBM2 models are in the ranges of 0.14–0.17 m and 0.12–0.15 m, respectively.</p></div>","PeriodicalId":19457,"journal":{"name":"Ocean Modelling","volume":"190 ","pages":"Article 102376"},"PeriodicalIF":3.1000,"publicationDate":"2024-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ocean Modelling","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1463500324000635","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
The areas around estuaries are typically densely populated and economically developed. Therefore, robust flood risk assessment in these areas is critical. One of the key elements of flood risk assessment is the accurate prediction of estuarine water levels. However, the nonlinear interactions between riverine (i.e., upstream river discharge) and marine (i.e., tides) forces complicate the prediction of estuarine water levels. Traditional physics-based and data-driven models have made significant progress in predicting estuarine water levels, but they require upstream river discharge data as inputs. Considering the lack of such data, the development of new approaches is crucial. This study investigated a machine-learning-based light gradient boosting machine (LightGBM) framework for predicting estuarine water levels using historical water levels as the only inputs. Two prediction models based on the LightGBM framework, denoted as LightGBM1 and LightGBM2, are developed. The LightGBM1 model constructs only a single regression model and uses a recursive approach to generate multidimensional outputs. The LightGBM2 model constructs multiple regression models between the same inputs and outputs in each dimension. The LightGBM1 and LightGBM2 models were applied to the Yangtze estuary as a test case. The results demonstrate that both models are effective at predicting short-term (within 48 hours) estuarine water levels, but the statistical performance of LightGBM2 is better overall. For 24-hour prediction, the root-mean-squared errors of the LightGBM1 and LightGBM2 models are in the ranges of 0.14–0.17 m and 0.12–0.15 m, respectively.
期刊介绍:
The main objective of Ocean Modelling is to provide rapid communication between those interested in ocean modelling, whether through direct observation, or through analytical, numerical or laboratory models, and including interactions between physical and biogeochemical or biological phenomena. Because of the intimate links between ocean and atmosphere, involvement of scientists interested in influences of either medium on the other is welcome. The journal has a wide scope and includes ocean-atmosphere interaction in various forms as well as pure ocean results. In addition to primary peer-reviewed papers, the journal provides review papers, preliminary communications, and discussions.