Xuemei Wang , Ronghua Liu , Chaoxing Sun , Xiaoyan Zhai , Liuqian Ding , Xiao Liu , Xiaolei Zhang
{"title":"Optimizing flood resilience in China’s mountainous areas: Design flood estimation using advanced machine learning techniques","authors":"Xuemei Wang , Ronghua Liu , Chaoxing Sun , Xiaoyan Zhai , Liuqian Ding , Xiao Liu , Xiaolei Zhang","doi":"10.1016/j.ejrh.2025.102345","DOIUrl":null,"url":null,"abstract":"<div><h3>Study region</h3><div>China</div></div><div><h3>Study focus</h3><div>We developed machine learning (ML) models for design flood estimation in mountainous catchments (≤ 500 km²) across China. This process considered different ML algorithms (random forest, extreme gradient boosting, and support vector regression), model scopes (nation and hydrological zones), and feature input sets (1–14 features) to optimize model development strategies.</div></div><div><h3>New hydrological insights for the region</h3><div>Based on estimation performance and hyperparameter tuning efficiency, random forest was found to be the optimal algorithm. The optimal model scope resulted in five distinct models: a single lumped model encompassing six eastern zones and four separate zonal models for the western zones. Considering both accuracy and efficiency, the optimal number of input features ranged from 5 to 14 for different models. High estimation accuracy was observed in the Qinba-Dabie North, Southeast, Southwest, and Yunnan-Tibet Zone, with average <em>RMSE</em>, <em>R</em>², <em>MQE</em>, and <em>QR</em> ranging from 55.90 to 103.97, 0.83–0.93, 45.62–65.77 %, and 55.90–60.98 %, respectively, for the test set across different return periods. The remaining zones exhibited moderate accuracy, with the Northwest Basin Zone demonstrating particularly low accuracy due to fewer catchments. Notably, catchments with areas > 100 km² demonstrated higher estimation accuracy, with an average 60 % reduction in <em>MQE</em> and a 30 % increase in <em>QR</em> compared to catchments of all sizes. This study provides crucial reference and data support for national flash flood prevention efforts.</div></div>","PeriodicalId":48620,"journal":{"name":"Journal of Hydrology-Regional Studies","volume":"59 ","pages":"Article 102345"},"PeriodicalIF":4.7000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Hydrology-Regional Studies","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2214581825001703","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"WATER RESOURCES","Score":null,"Total":0}
引用次数: 0
Abstract
Study region
China
Study focus
We developed machine learning (ML) models for design flood estimation in mountainous catchments (≤ 500 km²) across China. This process considered different ML algorithms (random forest, extreme gradient boosting, and support vector regression), model scopes (nation and hydrological zones), and feature input sets (1–14 features) to optimize model development strategies.
New hydrological insights for the region
Based on estimation performance and hyperparameter tuning efficiency, random forest was found to be the optimal algorithm. The optimal model scope resulted in five distinct models: a single lumped model encompassing six eastern zones and four separate zonal models for the western zones. Considering both accuracy and efficiency, the optimal number of input features ranged from 5 to 14 for different models. High estimation accuracy was observed in the Qinba-Dabie North, Southeast, Southwest, and Yunnan-Tibet Zone, with average RMSE, R², MQE, and QR ranging from 55.90 to 103.97, 0.83–0.93, 45.62–65.77 %, and 55.90–60.98 %, respectively, for the test set across different return periods. The remaining zones exhibited moderate accuracy, with the Northwest Basin Zone demonstrating particularly low accuracy due to fewer catchments. Notably, catchments with areas > 100 km² demonstrated higher estimation accuracy, with an average 60 % reduction in MQE and a 30 % increase in QR compared to catchments of all sizes. This study provides crucial reference and data support for national flash flood prevention efforts.
期刊介绍:
Journal of Hydrology: Regional Studies publishes original research papers enhancing the science of hydrology and aiming at region-specific problems, past and future conditions, analysis, review and solutions. The journal particularly welcomes research papers that deliver new insights into region-specific hydrological processes and responses to changing conditions, as well as contributions that incorporate interdisciplinarity and translational science.