An improved machine learning-based model to predict estuarine water levels

IF 3.1 3区地球科学 Q2 METEOROLOGY & ATMOSPHERIC SCIENCES

Ocean Modelling Pub Date : 2024-05-06 DOI:10.1016/j.ocemod.2024.102376

Min Gan , Yongping Chen , Shunqi Pan , Xijun Lai , Haidong Pan , Yuncheng Wen , Mingyan Xia

{"title":"An improved machine learning-based model to predict estuarine water levels","authors":"Min Gan , Yongping Chen , Shunqi Pan , Xijun Lai , Haidong Pan , Yuncheng Wen , Mingyan Xia","doi":"10.1016/j.ocemod.2024.102376","DOIUrl":null,"url":null,"abstract":"<div><p>The areas around estuaries are typically densely populated and economically developed. Therefore, robust flood risk assessment in these areas is critical. One of the key elements of flood risk assessment is the accurate prediction of estuarine water levels. However, the nonlinear interactions between riverine (i.e., upstream river discharge) and marine (i.e., tides) forces complicate the prediction of estuarine water levels. Traditional physics-based and data-driven models have made significant progress in predicting estuarine water levels, but they require upstream river discharge data as inputs. Considering the lack of such data, the development of new approaches is crucial. This study investigated a machine-learning-based light gradient boosting machine (LightGBM) framework for predicting estuarine water levels using historical water levels as the only inputs. Two prediction models based on the LightGBM framework, denoted as LightGBM1 and LightGBM2, are developed. The LightGBM1 model constructs only a single regression model and uses a recursive approach to generate multidimensional outputs. The LightGBM2 model constructs multiple regression models between the same inputs and outputs in each dimension. The LightGBM1 and LightGBM2 models were applied to the Yangtze estuary as a test case. The results demonstrate that both models are effective at predicting short-term (within 48 hours) estuarine water levels, but the statistical performance of LightGBM2 is better overall. For 24-hour prediction, the root-mean-squared errors of the LightGBM1 and LightGBM2 models are in the ranges of 0.14–0.17 m and 0.12–0.15 m, respectively.</p></div>","PeriodicalId":19457,"journal":{"name":"Ocean Modelling","volume":"190 ","pages":"Article 102376"},"PeriodicalIF":3.1000,"publicationDate":"2024-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ocean Modelling","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1463500324000635","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}

引用次数: 0

Abstract

The areas around estuaries are typically densely populated and economically developed. Therefore, robust flood risk assessment in these areas is critical. One of the key elements of flood risk assessment is the accurate prediction of estuarine water levels. However, the nonlinear interactions between riverine (i.e., upstream river discharge) and marine (i.e., tides) forces complicate the prediction of estuarine water levels. Traditional physics-based and data-driven models have made significant progress in predicting estuarine water levels, but they require upstream river discharge data as inputs. Considering the lack of such data, the development of new approaches is crucial. This study investigated a machine-learning-based light gradient boosting machine (LightGBM) framework for predicting estuarine water levels using historical water levels as the only inputs. Two prediction models based on the LightGBM framework, denoted as LightGBM1 and LightGBM2, are developed. The LightGBM1 model constructs only a single regression model and uses a recursive approach to generate multidimensional outputs. The LightGBM2 model constructs multiple regression models between the same inputs and outputs in each dimension. The LightGBM1 and LightGBM2 models were applied to the Yangtze estuary as a test case. The results demonstrate that both models are effective at predicting short-term (within 48 hours) estuarine water levels, but the statistical performance of LightGBM2 is better overall. For 24-hour prediction, the root-mean-squared errors of the LightGBM1 and LightGBM2 models are in the ranges of 0.14–0.17 m and 0.12–0.15 m, respectively.

查看原文本刊更多论文

基于机器学习的河口水位预测改进模型

河口周围地区通常人口稠密，经济发达。因此，在这些地区进行稳健的洪水风险评估至关重要。洪水风险评估的关键要素之一是准确预测河口水位。然而，河水（即上游河流排水量）和海洋（即潮汐）力量之间的非线性相互作用使河口水位预测变得复杂。传统的物理模型和数据驱动模型在预测河口水位方面取得了重大进展，但这些模型需要上游河流排水量数据作为输入。考虑到此类数据的缺乏，开发新方法至关重要。本研究研究了基于机器学习的光梯度提升机（LightGBM）框架，以历史水位作为唯一输入来预测河口水位。基于 LightGBM 框架开发了两个预测模型，分别称为 LightGBM1 和 LightGBM2。LightGBM1 模型仅构建一个回归模型，并使用递归方法生成多维输出。LightGBM2 模型在每个维度的相同输入和输出之间构建多个回归模型。将 LightGBM1 和 LightGBM2 模型作为一个测试案例应用于长江河口。结果表明，两个模型都能有效预测短期（48 小时内）河口水位，但 LightGBM2 的统计性能总体上更好。在 24 小时预测中，LightGBM1 和 LightGBM2 模型的均方根误差分别为 0.14-0.17 米和 0.12-0.15 米。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Ocean Modelling 地学-海洋学

CiteScore

5.50

自引率

9.40%

发文量

审稿时长

19.6 weeks

期刊介绍： The main objective of Ocean Modelling is to provide rapid communication between those interested in ocean modelling, whether through direct observation, or through analytical, numerical or laboratory models, and including interactions between physical and biogeochemical or biological phenomena. Because of the intimate links between ocean and atmosphere, involvement of scientists interested in influences of either medium on the other is welcome. The journal has a wide scope and includes ocean-atmosphere interaction in various forms as well as pure ocean results. In addition to primary peer-reviewed papers, the journal provides review papers, preliminary communications, and discussions.