Comparative evaluation of machine learning models for extreme river water level forecasting in Bangladesh: Implications for flood and drought resilience
Md Touhidul Islam , Sujan Chandra Roy , Nusrat Jahan , Al-Mahmud , Md Mazharul Islam , Abdullah Al Ferdaus , Kazunori Fujisawa , A.K.M. Adham
{"title":"Comparative evaluation of machine learning models for extreme river water level forecasting in Bangladesh: Implications for flood and drought resilience","authors":"Md Touhidul Islam , Sujan Chandra Roy , Nusrat Jahan , Al-Mahmud , Md Mazharul Islam , Abdullah Al Ferdaus , Kazunori Fujisawa , A.K.M. Adham","doi":"10.1016/j.pdisas.2025.100449","DOIUrl":null,"url":null,"abstract":"<div><div>Reliable forecasting of extreme river water levels is vital for managing flood and drought risks in Bangladesh, a deltaic nation highly vulnerable to climate change. This study compares nine machine learning (ML) models for predicting monthly maximum and minimum water levels at three key stations along the Old Brahmaputra River using a 34-year dataset (1990–2024). Performance was assessed using ten metrics including RMSE, R<sup>2</sup>, and NSE. Random Forest Regression (RFR) consistently outperformed other models, achieving the highest accuracy for both maximum (RMSE: 0.64–0.77 m; R<sup>2</sup>: 0.87–0.92) and minimum water levels (RMSE: 0.49–0.66 m; R<sup>2</sup>: 0.82–0.92), while linear models underperformed in capturing nonlinear patterns. A PCA-based framework further validated RFR's robustness, with average normalized composite scores of 1.00 (maximum) and 0.99 (minimum), significantly surpassing Ensemble Regression (0.89/0.84), Support Vector Regression (0.88/0.88), and other models. Spatially, midstream stations showed higher accuracy (R<sup>2</sup> > 0.90) due to stable hydrodynamics, while downstream performance decreased from tidal effects. Key innovations including autoregressive lag features, sliding windows, and a multivariate evaluation framework significantly improved prediction accuracy. These findings demonstrate that ML models can enhance water level forecasting and disaster resilience in climate-vulnerable regions, even with limited data.</div></div>","PeriodicalId":52341,"journal":{"name":"Progress in Disaster Science","volume":"27 ","pages":"Article 100449"},"PeriodicalIF":3.8000,"publicationDate":"2025-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Progress in Disaster Science","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590061725000468","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Reliable forecasting of extreme river water levels is vital for managing flood and drought risks in Bangladesh, a deltaic nation highly vulnerable to climate change. This study compares nine machine learning (ML) models for predicting monthly maximum and minimum water levels at three key stations along the Old Brahmaputra River using a 34-year dataset (1990–2024). Performance was assessed using ten metrics including RMSE, R2, and NSE. Random Forest Regression (RFR) consistently outperformed other models, achieving the highest accuracy for both maximum (RMSE: 0.64–0.77 m; R2: 0.87–0.92) and minimum water levels (RMSE: 0.49–0.66 m; R2: 0.82–0.92), while linear models underperformed in capturing nonlinear patterns. A PCA-based framework further validated RFR's robustness, with average normalized composite scores of 1.00 (maximum) and 0.99 (minimum), significantly surpassing Ensemble Regression (0.89/0.84), Support Vector Regression (0.88/0.88), and other models. Spatially, midstream stations showed higher accuracy (R2 > 0.90) due to stable hydrodynamics, while downstream performance decreased from tidal effects. Key innovations including autoregressive lag features, sliding windows, and a multivariate evaluation framework significantly improved prediction accuracy. These findings demonstrate that ML models can enhance water level forecasting and disaster resilience in climate-vulnerable regions, even with limited data.
期刊介绍:
Progress in Disaster Science is a Gold Open Access journal focusing on integrating research and policy in disaster research, and publishes original research papers and invited viewpoint articles on disaster risk reduction; response; emergency management and recovery.
A key part of the Journal's Publication output will see key experts invited to assess and comment on the current trends in disaster research, as well as highlight key papers.