Nikunj K. Mangukiya, Abhishek Kumar, Ashutosh Sharma
{"title":"基于机器学习的印度流域区域洪水频率分析","authors":"Nikunj K. Mangukiya, Abhishek Kumar, Ashutosh Sharma","doi":"10.1002/clen.202300120","DOIUrl":null,"url":null,"abstract":"Abstract Estimates of the flood quantile for ungauged watersheds are crucial for water resources management but challenging due to the nonlinear complex hydrological system. For ungauged watersheds, estimating flood quantiles relies on various interdependent physiometeorological variables, many of which are not adequately considered in regional flood frequency analysis (RFFA). In this study, we utilized the random forest (RF) and support vector regression (SVR) algorithms, which can learn the nonlinear relationship between the physiometeorological variables and flood quantiles for RFFA. Thirteen physiometeorological variables that were not collectively employed before were used to estimate the 10‐year, 50‐year, and 100‐year return period flood quantiles (Q10, Q50, and Q100), respectively, for 39 watersheds spread across India. The RF and SVR models were trained on 29 (75%) watersheds to estimate individual flood quantiles and were subsequently tested on the remaining ten (25%) ungauged watersheds. The R 2 achieved by RF is 0.862, 0.813, and 0.845, and SVR is 0.807, 0.793, and 0.789 for Q10, Q50, and Q100, respectively. Overall, the results indicate that RF can effectively learn the nonlinear relationships, while SVR with a linear kernel requires further improvement to estimate reliable flood quantiles. The study demonstrates that machine learning algorithms, with appropriate physiometeorological input datasets, can be used to estimate flood quantiles even in the sparse data region.","PeriodicalId":10306,"journal":{"name":"Clean-soil Air Water","volume":"54 1","pages":"0"},"PeriodicalIF":1.5000,"publicationDate":"2023-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine learning‐based regional flood frequency analysis of Indian watersheds\",\"authors\":\"Nikunj K. Mangukiya, Abhishek Kumar, Ashutosh Sharma\",\"doi\":\"10.1002/clen.202300120\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract Estimates of the flood quantile for ungauged watersheds are crucial for water resources management but challenging due to the nonlinear complex hydrological system. For ungauged watersheds, estimating flood quantiles relies on various interdependent physiometeorological variables, many of which are not adequately considered in regional flood frequency analysis (RFFA). In this study, we utilized the random forest (RF) and support vector regression (SVR) algorithms, which can learn the nonlinear relationship between the physiometeorological variables and flood quantiles for RFFA. Thirteen physiometeorological variables that were not collectively employed before were used to estimate the 10‐year, 50‐year, and 100‐year return period flood quantiles (Q10, Q50, and Q100), respectively, for 39 watersheds spread across India. The RF and SVR models were trained on 29 (75%) watersheds to estimate individual flood quantiles and were subsequently tested on the remaining ten (25%) ungauged watersheds. The R 2 achieved by RF is 0.862, 0.813, and 0.845, and SVR is 0.807, 0.793, and 0.789 for Q10, Q50, and Q100, respectively. Overall, the results indicate that RF can effectively learn the nonlinear relationships, while SVR with a linear kernel requires further improvement to estimate reliable flood quantiles. The study demonstrates that machine learning algorithms, with appropriate physiometeorological input datasets, can be used to estimate flood quantiles even in the sparse data region.\",\"PeriodicalId\":10306,\"journal\":{\"name\":\"Clean-soil Air Water\",\"volume\":\"54 1\",\"pages\":\"0\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2023-09-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Clean-soil Air Water\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1002/clen.202300120\",\"RegionNum\":4,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clean-soil Air Water","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/clen.202300120","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Machine learning‐based regional flood frequency analysis of Indian watersheds
Abstract Estimates of the flood quantile for ungauged watersheds are crucial for water resources management but challenging due to the nonlinear complex hydrological system. For ungauged watersheds, estimating flood quantiles relies on various interdependent physiometeorological variables, many of which are not adequately considered in regional flood frequency analysis (RFFA). In this study, we utilized the random forest (RF) and support vector regression (SVR) algorithms, which can learn the nonlinear relationship between the physiometeorological variables and flood quantiles for RFFA. Thirteen physiometeorological variables that were not collectively employed before were used to estimate the 10‐year, 50‐year, and 100‐year return period flood quantiles (Q10, Q50, and Q100), respectively, for 39 watersheds spread across India. The RF and SVR models were trained on 29 (75%) watersheds to estimate individual flood quantiles and were subsequently tested on the remaining ten (25%) ungauged watersheds. The R 2 achieved by RF is 0.862, 0.813, and 0.845, and SVR is 0.807, 0.793, and 0.789 for Q10, Q50, and Q100, respectively. Overall, the results indicate that RF can effectively learn the nonlinear relationships, while SVR with a linear kernel requires further improvement to estimate reliable flood quantiles. The study demonstrates that machine learning algorithms, with appropriate physiometeorological input datasets, can be used to estimate flood quantiles even in the sparse data region.
期刊介绍:
CLEAN covers all aspects of Sustainability and Environmental Safety. The journal focuses on organ/human--environment interactions giving interdisciplinary insights on a broad range of topics including air pollution, waste management, the water cycle, and environmental conservation. With a 2019 Journal Impact Factor of 1.603 (Journal Citation Reports (Clarivate Analytics, 2020), the journal publishes an attractive mixture of peer-reviewed scientific reviews, research papers, and short communications.
Papers dealing with environmental sustainability issues from such fields as agriculture, biological sciences, energy, food sciences, geography, geology, meteorology, nutrition, soil and water sciences, etc., are welcome.