Comparative evaluation of machine learning models for extreme river water level forecasting in Bangladesh: Implications for flood and drought resilience

IF 3.8 Q3 ENVIRONMENTAL SCIENCES
Md Touhidul Islam , Sujan Chandra Roy , Nusrat Jahan , Al-Mahmud , Md Mazharul Islam , Abdullah Al Ferdaus , Kazunori Fujisawa , A.K.M. Adham
{"title":"Comparative evaluation of machine learning models for extreme river water level forecasting in Bangladesh: Implications for flood and drought resilience","authors":"Md Touhidul Islam ,&nbsp;Sujan Chandra Roy ,&nbsp;Nusrat Jahan ,&nbsp;Al-Mahmud ,&nbsp;Md Mazharul Islam ,&nbsp;Abdullah Al Ferdaus ,&nbsp;Kazunori Fujisawa ,&nbsp;A.K.M. Adham","doi":"10.1016/j.pdisas.2025.100449","DOIUrl":null,"url":null,"abstract":"<div><div>Reliable forecasting of extreme river water levels is vital for managing flood and drought risks in Bangladesh, a deltaic nation highly vulnerable to climate change. This study compares nine machine learning (ML) models for predicting monthly maximum and minimum water levels at three key stations along the Old Brahmaputra River using a 34-year dataset (1990–2024). Performance was assessed using ten metrics including RMSE, R<sup>2</sup>, and NSE. Random Forest Regression (RFR) consistently outperformed other models, achieving the highest accuracy for both maximum (RMSE: 0.64–0.77 m; R<sup>2</sup>: 0.87–0.92) and minimum water levels (RMSE: 0.49–0.66 m; R<sup>2</sup>: 0.82–0.92), while linear models underperformed in capturing nonlinear patterns. A PCA-based framework further validated RFR's robustness, with average normalized composite scores of 1.00 (maximum) and 0.99 (minimum), significantly surpassing Ensemble Regression (0.89/0.84), Support Vector Regression (0.88/0.88), and other models. Spatially, midstream stations showed higher accuracy (R<sup>2</sup> &gt; 0.90) due to stable hydrodynamics, while downstream performance decreased from tidal effects. Key innovations including autoregressive lag features, sliding windows, and a multivariate evaluation framework significantly improved prediction accuracy. These findings demonstrate that ML models can enhance water level forecasting and disaster resilience in climate-vulnerable regions, even with limited data.</div></div>","PeriodicalId":52341,"journal":{"name":"Progress in Disaster Science","volume":"27 ","pages":"Article 100449"},"PeriodicalIF":3.8000,"publicationDate":"2025-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Progress in Disaster Science","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590061725000468","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Reliable forecasting of extreme river water levels is vital for managing flood and drought risks in Bangladesh, a deltaic nation highly vulnerable to climate change. This study compares nine machine learning (ML) models for predicting monthly maximum and minimum water levels at three key stations along the Old Brahmaputra River using a 34-year dataset (1990–2024). Performance was assessed using ten metrics including RMSE, R2, and NSE. Random Forest Regression (RFR) consistently outperformed other models, achieving the highest accuracy for both maximum (RMSE: 0.64–0.77 m; R2: 0.87–0.92) and minimum water levels (RMSE: 0.49–0.66 m; R2: 0.82–0.92), while linear models underperformed in capturing nonlinear patterns. A PCA-based framework further validated RFR's robustness, with average normalized composite scores of 1.00 (maximum) and 0.99 (minimum), significantly surpassing Ensemble Regression (0.89/0.84), Support Vector Regression (0.88/0.88), and other models. Spatially, midstream stations showed higher accuracy (R2 > 0.90) due to stable hydrodynamics, while downstream performance decreased from tidal effects. Key innovations including autoregressive lag features, sliding windows, and a multivariate evaluation framework significantly improved prediction accuracy. These findings demonstrate that ML models can enhance water level forecasting and disaster resilience in climate-vulnerable regions, even with limited data.
孟加拉国极端河流水位预测的机器学习模型的比较评估:对洪水和干旱恢复能力的影响
孟加拉国是一个极易受到气候变化影响的三角洲国家,对极端河流水位的可靠预测对于管理洪水和干旱风险至关重要。本研究比较了九种机器学习(ML)模型,使用34年数据集(1990-2024)预测老雅鲁藏布江沿岸三个关键站点的每月最高和最低水位。使用RMSE、R2和NSE等10个指标评估性能。随机森林回归(RFR)始终优于其他模型,在两个最大值(RMSE: 0.64-0.77 m;R2: 0.87-0.92)和最小水位(RMSE: 0.49-0.66 m;R2: 0.82-0.92),而线性模型在捕获非线性模式方面表现不佳。基于pca的框架进一步验证了RFR的稳健性,平均归一化综合得分为1.00(最大值)和0.99(最小值),显著超过Ensemble Regression(0.89/0.84)、Support Vector Regression(0.88/0.88)等模型。空间上,中游站精度较高(R2 >;0.90)由于稳定的流体力学,而下游性能则因潮汐效应而下降。关键创新包括自回归滞后特征、滑动窗口和多变量评估框架,显著提高了预测精度。这些发现表明,即使数据有限,ML模型也可以增强气候脆弱地区的水位预测和抗灾能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Progress in Disaster Science
Progress in Disaster Science Social Sciences-Safety Research
CiteScore
14.60
自引率
3.20%
发文量
51
审稿时长
12 weeks
期刊介绍: Progress in Disaster Science is a Gold Open Access journal focusing on integrating research and policy in disaster research, and publishes original research papers and invited viewpoint articles on disaster risk reduction; response; emergency management and recovery. A key part of the Journal's Publication output will see key experts invited to assess and comment on the current trends in disaster research, as well as highlight key papers.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信