Identifying homogeneous hydrological zones for flood prediction using multivariable statistical methods and machine learning

IF 5.7 3区 环境科学与生态学 Q1 WATER RESOURCES
Sahar Safari, Mohammad Sadegh Sadeghian, Hooman Hajikandi, S. Sajad Mehdizadeh
{"title":"Identifying homogeneous hydrological zones for flood prediction using multivariable statistical methods and machine learning","authors":"Sahar Safari,&nbsp;Mohammad Sadegh Sadeghian,&nbsp;Hooman Hajikandi,&nbsp;S. Sajad Mehdizadeh","doi":"10.1007/s13201-024-02316-x","DOIUrl":null,"url":null,"abstract":"<div><p>One method for estimating floods in areas lacking statistical data is the use of regional frequency analysis based on machine learning. In this study, statistical and clustering-based approaches were evaluated for flood estimation in the Karkheh watershed. The hydrological homogeneity of the obtained zones was then assessed using linear moments and heterogeneity adjustment methods proposed by Hosking and Wallis. Then, the ZDIST statistic was used to calculate the three-parameter distributions for stations within each hydrologically homogeneous cluster. These parameters were computed using linear moments, and floods with different return periods at each station were estimated using regional relationships. The results indicated the creation of two clusters in this area, with five stations in cluster one and 11 stations in cluster two. The statistical homogeneity values for clusters one and two were calculated as 0.33 and 0.17, respectively, indicating the homogeneity of each region. Generalized Pearson type III and generalized extreme value distributions were selected as the best regional distributions for clusters 1 and 2, respectively. The results also showed that floods could be estimated for return periods of 2, 5, 25 years, and more. The highest estimated flood is predicted at the Jelugir-e Majin station, where the flood with a 2-year return period reaches 1034 m<sup>3</sup> s<sup>−1</sup>. This increases to 5360 m<sup>3</sup> s<sup>−1</sup> for a 100-year return period. The approach presented in this study is recommended for similar regions lacking complete information.</p></div>","PeriodicalId":8374,"journal":{"name":"Applied Water Science","volume":"14 12","pages":""},"PeriodicalIF":5.7000,"publicationDate":"2024-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s13201-024-02316-x.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Water Science","FirstCategoryId":"93","ListUrlMain":"https://link.springer.com/article/10.1007/s13201-024-02316-x","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"WATER RESOURCES","Score":null,"Total":0}
引用次数: 0

Abstract

One method for estimating floods in areas lacking statistical data is the use of regional frequency analysis based on machine learning. In this study, statistical and clustering-based approaches were evaluated for flood estimation in the Karkheh watershed. The hydrological homogeneity of the obtained zones was then assessed using linear moments and heterogeneity adjustment methods proposed by Hosking and Wallis. Then, the ZDIST statistic was used to calculate the three-parameter distributions for stations within each hydrologically homogeneous cluster. These parameters were computed using linear moments, and floods with different return periods at each station were estimated using regional relationships. The results indicated the creation of two clusters in this area, with five stations in cluster one and 11 stations in cluster two. The statistical homogeneity values for clusters one and two were calculated as 0.33 and 0.17, respectively, indicating the homogeneity of each region. Generalized Pearson type III and generalized extreme value distributions were selected as the best regional distributions for clusters 1 and 2, respectively. The results also showed that floods could be estimated for return periods of 2, 5, 25 years, and more. The highest estimated flood is predicted at the Jelugir-e Majin station, where the flood with a 2-year return period reaches 1034 m3 s−1. This increases to 5360 m3 s−1 for a 100-year return period. The approach presented in this study is recommended for similar regions lacking complete information.

利用多变量统计方法和机器学习确定洪水预测的同质水文区
在缺乏统计数据的地区估算洪水的一种方法是使用基于机器学习的区域频率分析。在这项研究中,对基于统计和聚类的方法进行了评估,以估算 Karkheh 流域的洪水。然后,使用 Hosking 和 Wallis 提出的线性矩和异质性调整方法评估所获得区域的水文同质性。然后,使用 ZDIST 统计法计算每个水文同质性群组内各站的三参数分布。利用线性矩计算这些参数,并利用区域关系估算各站不同重现期的洪水。结果表明,该地区形成了两个群组,群组一有 5 个站点,群组二有 11 个站点。经计算,群组一和群组二的统计同质性值分别为 0.33 和 0.17,表明各区域具有同质性。广义皮尔逊 III 型分布和广义极值分布分别被选为群组 1 和群组 2 的最佳区域分布。结果还显示,可以估算出重现期为 2 年、5 年、25 年及以上的洪水。杰卢吉尔-马金站预测的洪水流量最大,2 年重现期的洪水流量达到 1034 立方米/秒。重现期为 100 年的洪水则增至 5360 立方米/秒。建议缺乏完整信息的类似地区采用本研究提出的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Applied Water Science
Applied Water Science WATER RESOURCES-
CiteScore
9.90
自引率
3.60%
发文量
268
审稿时长
13 weeks
期刊介绍:
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信