{"title":"利用监督机器学习进行法国大都市区域水文灾害评估","authors":"Qifan Ding, Patrick Arnaud","doi":"10.1016/j.ejrh.2024.101872","DOIUrl":null,"url":null,"abstract":"<div><h3>Study region</h3><p>This study is carried out for 1929 gauged catchments in France, ranging from 1 to 10,000 km², where quality hydrometric observations are available for flood frequency analysis.</p></div><div><h3>Study focus</h3><p>The regional estimation of hydrological hazards is studied for flood risk management and prevention in hydrology. For gauged catchments, flow quantiles can be estimated from observations using statistical approaches based on suitable probability distributions or simulation approaches based on rainfall-runoff transformation models. For ungauged catchments, the lack of hydrological observations means that we have to extrapolate our knowledge of hazards from gauged catchments to ungauged catchments, using regionalization methods. It is therefore necessary to combine regionalization methods with the implemented hazard estimation approach. In this paper, two popular machine learning methods, Random Forest and Neural Networks, are tested and compared as regionalization methods. A classical regionalization method using multiple linear regression is also applied as a benchmark to evaluate the performance of all configurations. All these regionalization methods are applied to a simulation-based approach (the SHYREG method) and to a statistical-based approach using generalized extreme value distribution (GEV).</p></div><div><h3>New hydrological insights</h3><p></p><ul><li><span>•</span><span><p>Regionalization approaches based on multiple linear regression have limitations to explain parameters with environmental descriptors in Regional Flood Frequency Analysis (RFFA) domain.</p></span></li></ul><ul><li><span>•</span><span><p>Regionalizing RFFA parameters using Random Forest allows more explanatory variables to be considered through non-linear relationships, resulting in better parameter estimation.</p></span></li></ul><p></p><ul><li><span>•</span><span><p>Machine learning techniques can better handle environmental descriptors for regionalization, this providing a notable performance improvement, especially for the statistical approach.</p></span></li></ul><ul><li><span>•</span><span><p>The tested simulation-based approach is less sensitive to the choice of spatial interpolation method than the studied statistical approach.</p></span></li></ul></div>","PeriodicalId":48620,"journal":{"name":"Journal of Hydrology-Regional Studies","volume":null,"pages":null},"PeriodicalIF":4.7000,"publicationDate":"2024-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2214581824002209/pdfft?md5=faec473c186362781cbf896606b11f35&pid=1-s2.0-S2214581824002209-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Using supervised machine learning for regional hydrological hazard estimation in metropolitan France\",\"authors\":\"Qifan Ding, Patrick Arnaud\",\"doi\":\"10.1016/j.ejrh.2024.101872\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Study region</h3><p>This study is carried out for 1929 gauged catchments in France, ranging from 1 to 10,000 km², where quality hydrometric observations are available for flood frequency analysis.</p></div><div><h3>Study focus</h3><p>The regional estimation of hydrological hazards is studied for flood risk management and prevention in hydrology. For gauged catchments, flow quantiles can be estimated from observations using statistical approaches based on suitable probability distributions or simulation approaches based on rainfall-runoff transformation models. For ungauged catchments, the lack of hydrological observations means that we have to extrapolate our knowledge of hazards from gauged catchments to ungauged catchments, using regionalization methods. It is therefore necessary to combine regionalization methods with the implemented hazard estimation approach. In this paper, two popular machine learning methods, Random Forest and Neural Networks, are tested and compared as regionalization methods. A classical regionalization method using multiple linear regression is also applied as a benchmark to evaluate the performance of all configurations. All these regionalization methods are applied to a simulation-based approach (the SHYREG method) and to a statistical-based approach using generalized extreme value distribution (GEV).</p></div><div><h3>New hydrological insights</h3><p></p><ul><li><span>•</span><span><p>Regionalization approaches based on multiple linear regression have limitations to explain parameters with environmental descriptors in Regional Flood Frequency Analysis (RFFA) domain.</p></span></li></ul><ul><li><span>•</span><span><p>Regionalizing RFFA parameters using Random Forest allows more explanatory variables to be considered through non-linear relationships, resulting in better parameter estimation.</p></span></li></ul><p></p><ul><li><span>•</span><span><p>Machine learning techniques can better handle environmental descriptors for regionalization, this providing a notable performance improvement, especially for the statistical approach.</p></span></li></ul><ul><li><span>•</span><span><p>The tested simulation-based approach is less sensitive to the choice of spatial interpolation method than the studied statistical approach.</p></span></li></ul></div>\",\"PeriodicalId\":48620,\"journal\":{\"name\":\"Journal of Hydrology-Regional Studies\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":4.7000,\"publicationDate\":\"2024-06-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2214581824002209/pdfft?md5=faec473c186362781cbf896606b11f35&pid=1-s2.0-S2214581824002209-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Hydrology-Regional Studies\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2214581824002209\",\"RegionNum\":2,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"WATER RESOURCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Hydrology-Regional Studies","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2214581824002209","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"WATER RESOURCES","Score":null,"Total":0}
Using supervised machine learning for regional hydrological hazard estimation in metropolitan France
Study region
This study is carried out for 1929 gauged catchments in France, ranging from 1 to 10,000 km², where quality hydrometric observations are available for flood frequency analysis.
Study focus
The regional estimation of hydrological hazards is studied for flood risk management and prevention in hydrology. For gauged catchments, flow quantiles can be estimated from observations using statistical approaches based on suitable probability distributions or simulation approaches based on rainfall-runoff transformation models. For ungauged catchments, the lack of hydrological observations means that we have to extrapolate our knowledge of hazards from gauged catchments to ungauged catchments, using regionalization methods. It is therefore necessary to combine regionalization methods with the implemented hazard estimation approach. In this paper, two popular machine learning methods, Random Forest and Neural Networks, are tested and compared as regionalization methods. A classical regionalization method using multiple linear regression is also applied as a benchmark to evaluate the performance of all configurations. All these regionalization methods are applied to a simulation-based approach (the SHYREG method) and to a statistical-based approach using generalized extreme value distribution (GEV).
New hydrological insights
•
Regionalization approaches based on multiple linear regression have limitations to explain parameters with environmental descriptors in Regional Flood Frequency Analysis (RFFA) domain.
•
Regionalizing RFFA parameters using Random Forest allows more explanatory variables to be considered through non-linear relationships, resulting in better parameter estimation.
•
Machine learning techniques can better handle environmental descriptors for regionalization, this providing a notable performance improvement, especially for the statistical approach.
•
The tested simulation-based approach is less sensitive to the choice of spatial interpolation method than the studied statistical approach.
期刊介绍:
Journal of Hydrology: Regional Studies publishes original research papers enhancing the science of hydrology and aiming at region-specific problems, past and future conditions, analysis, review and solutions. The journal particularly welcomes research papers that deliver new insights into region-specific hydrological processes and responses to changing conditions, as well as contributions that incorporate interdisciplinarity and translational science.