{"title":"利用机器学习方法估算改良半圆筒形围堰的排流系数","authors":"Reza Fatahi-Alkouhi, Ehsan Afaridegan, Nosratollah Amanian","doi":"10.1007/s00477-024-02739-7","DOIUrl":null,"url":null,"abstract":"<p>Based on the principles design of hydrofoil weirs, Modified Semi-Cylindrical Weirs (MSCWs) incorporate an innovative tangential ramp along the downstream crest contour, thereby significantly enhancing their performance compared to conventional semi-cylindrical weirs. A pivotal parameter in the calculation of flow discharge over the weir is the discharge coefficient (<i>C</i><sub><i>d</i></sub>). This study involves a comprehensive comparative analysis of various <i>C</i><sub><i>d</i></sub> estimation methodologies for MSCWs, employing a range of machine learning-based models, notably including Artificial Neural Network (ANN), Multivariate Adaptive Regression Splines (MARS), M5 tree, Locally Weighted Polynomial Regression (LWPR), and Support Vector Machine (SVM) models. To begin, a feature selection analysis utilizing the Gamma Test (GT) method was conducted to identify the optimal input configuration for modeling the discharge of MSCWs. The results of the feature selection revealed that the <i>C</i><sub><i>d</i></sub> of the MSCWs is primarily influenced by the ratio of upstream flow depth (<i>y</i><sub><i>up</i></sub>) to crest radius (<i>R</i>), while showing negligible sensitivity to the slope of the downstream ramp (<i>θ</i>). The dataset was partitioned into two segments: 70% were assigned to the training stage, while the remaining 30% were allocated to the testing stage. The precision of <i>C</i><sub><i>d</i></sub> predictions is evaluated through four key statistical metrics: Mean Absolute Error (MAE), Mean Squared Error (MSE), Root Mean Square Error (RMSE), Correlation Coefficient (<i>R</i><sup>2</sup>), and Nash –Sutcliff Efficiency (NSE). The outcomes reveal that, for the training and testing phases, the <i>R</i><sup>2</sup> values for the ANN, MARS, M5 tree, LWPR and SVM models are respectively 0.967, 0.931, 0.974, 0.937, and 0.933, and 0.925, 0.953, 0.953, 0.980, and 0.954. Notably, the LWPR model outperforms the ANN, MARS, M5 tree, and SVM models, boasting MAE, MSE, RMSE, and NSE values of 0.0167, 0.0005, 0.0217, and 0.942 during training, and 0.0107, 0.0002, 0.0136, and 0.949 during testing. As a result, the LWPR model clearly emerges as the superior model, followed by the M5 model tree.</p>","PeriodicalId":21987,"journal":{"name":"Stochastic Environmental Research and Risk Assessment","volume":"12 1","pages":""},"PeriodicalIF":3.9000,"publicationDate":"2024-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Discharge coefficient estimation of modified semi-cylindrical weirs using machine learning approaches\",\"authors\":\"Reza Fatahi-Alkouhi, Ehsan Afaridegan, Nosratollah Amanian\",\"doi\":\"10.1007/s00477-024-02739-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Based on the principles design of hydrofoil weirs, Modified Semi-Cylindrical Weirs (MSCWs) incorporate an innovative tangential ramp along the downstream crest contour, thereby significantly enhancing their performance compared to conventional semi-cylindrical weirs. A pivotal parameter in the calculation of flow discharge over the weir is the discharge coefficient (<i>C</i><sub><i>d</i></sub>). This study involves a comprehensive comparative analysis of various <i>C</i><sub><i>d</i></sub> estimation methodologies for MSCWs, employing a range of machine learning-based models, notably including Artificial Neural Network (ANN), Multivariate Adaptive Regression Splines (MARS), M5 tree, Locally Weighted Polynomial Regression (LWPR), and Support Vector Machine (SVM) models. To begin, a feature selection analysis utilizing the Gamma Test (GT) method was conducted to identify the optimal input configuration for modeling the discharge of MSCWs. The results of the feature selection revealed that the <i>C</i><sub><i>d</i></sub> of the MSCWs is primarily influenced by the ratio of upstream flow depth (<i>y</i><sub><i>up</i></sub>) to crest radius (<i>R</i>), while showing negligible sensitivity to the slope of the downstream ramp (<i>θ</i>). The dataset was partitioned into two segments: 70% were assigned to the training stage, while the remaining 30% were allocated to the testing stage. The precision of <i>C</i><sub><i>d</i></sub> predictions is evaluated through four key statistical metrics: Mean Absolute Error (MAE), Mean Squared Error (MSE), Root Mean Square Error (RMSE), Correlation Coefficient (<i>R</i><sup>2</sup>), and Nash –Sutcliff Efficiency (NSE). The outcomes reveal that, for the training and testing phases, the <i>R</i><sup>2</sup> values for the ANN, MARS, M5 tree, LWPR and SVM models are respectively 0.967, 0.931, 0.974, 0.937, and 0.933, and 0.925, 0.953, 0.953, 0.980, and 0.954. Notably, the LWPR model outperforms the ANN, MARS, M5 tree, and SVM models, boasting MAE, MSE, RMSE, and NSE values of 0.0167, 0.0005, 0.0217, and 0.942 during training, and 0.0107, 0.0002, 0.0136, and 0.949 during testing. As a result, the LWPR model clearly emerges as the superior model, followed by the M5 model tree.</p>\",\"PeriodicalId\":21987,\"journal\":{\"name\":\"Stochastic Environmental Research and Risk Assessment\",\"volume\":\"12 1\",\"pages\":\"\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2024-05-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Stochastic Environmental Research and Risk Assessment\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://doi.org/10.1007/s00477-024-02739-7\",\"RegionNum\":3,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Stochastic Environmental Research and Risk Assessment","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1007/s00477-024-02739-7","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
Discharge coefficient estimation of modified semi-cylindrical weirs using machine learning approaches
Based on the principles design of hydrofoil weirs, Modified Semi-Cylindrical Weirs (MSCWs) incorporate an innovative tangential ramp along the downstream crest contour, thereby significantly enhancing their performance compared to conventional semi-cylindrical weirs. A pivotal parameter in the calculation of flow discharge over the weir is the discharge coefficient (Cd). This study involves a comprehensive comparative analysis of various Cd estimation methodologies for MSCWs, employing a range of machine learning-based models, notably including Artificial Neural Network (ANN), Multivariate Adaptive Regression Splines (MARS), M5 tree, Locally Weighted Polynomial Regression (LWPR), and Support Vector Machine (SVM) models. To begin, a feature selection analysis utilizing the Gamma Test (GT) method was conducted to identify the optimal input configuration for modeling the discharge of MSCWs. The results of the feature selection revealed that the Cd of the MSCWs is primarily influenced by the ratio of upstream flow depth (yup) to crest radius (R), while showing negligible sensitivity to the slope of the downstream ramp (θ). The dataset was partitioned into two segments: 70% were assigned to the training stage, while the remaining 30% were allocated to the testing stage. The precision of Cd predictions is evaluated through four key statistical metrics: Mean Absolute Error (MAE), Mean Squared Error (MSE), Root Mean Square Error (RMSE), Correlation Coefficient (R2), and Nash –Sutcliff Efficiency (NSE). The outcomes reveal that, for the training and testing phases, the R2 values for the ANN, MARS, M5 tree, LWPR and SVM models are respectively 0.967, 0.931, 0.974, 0.937, and 0.933, and 0.925, 0.953, 0.953, 0.980, and 0.954. Notably, the LWPR model outperforms the ANN, MARS, M5 tree, and SVM models, boasting MAE, MSE, RMSE, and NSE values of 0.0167, 0.0005, 0.0217, and 0.942 during training, and 0.0107, 0.0002, 0.0136, and 0.949 during testing. As a result, the LWPR model clearly emerges as the superior model, followed by the M5 model tree.
期刊介绍:
Stochastic Environmental Research and Risk Assessment (SERRA) will publish research papers, reviews and technical notes on stochastic and probabilistic approaches to environmental sciences and engineering, including interactions of earth and atmospheric environments with people and ecosystems. The basic idea is to bring together research papers on stochastic modelling in various fields of environmental sciences and to provide an interdisciplinary forum for the exchange of ideas, for communicating on issues that cut across disciplinary barriers, and for the dissemination of stochastic techniques used in different fields to the community of interested researchers. Original contributions will be considered dealing with modelling (theoretical and computational), measurements and instrumentation in one or more of the following topical areas:
- Spatiotemporal analysis and mapping of natural processes.
- Enviroinformatics.
- Environmental risk assessment, reliability analysis and decision making.
- Surface and subsurface hydrology and hydraulics.
- Multiphase porous media domains and contaminant transport modelling.
- Hazardous waste site characterization.
- Stochastic turbulence and random hydrodynamic fields.
- Chaotic and fractal systems.
- Random waves and seafloor morphology.
- Stochastic atmospheric and climate processes.
- Air pollution and quality assessment research.
- Modern geostatistics.
- Mechanisms of pollutant formation, emission, exposure and absorption.
- Physical, chemical and biological analysis of human exposure from single and multiple media and routes; control and protection.
- Bioinformatics.
- Probabilistic methods in ecology and population biology.
- Epidemiological investigations.
- Models using stochastic differential equations stochastic or partial differential equations.
- Hazardous waste site characterization.