利用机器学习方法估算改良半圆筒形围堰的排流系数

IF 3.9 3区 环境科学与生态学 Q1 ENGINEERING, CIVIL
Reza Fatahi-Alkouhi, Ehsan Afaridegan, Nosratollah Amanian
{"title":"利用机器学习方法估算改良半圆筒形围堰的排流系数","authors":"Reza Fatahi-Alkouhi, Ehsan Afaridegan, Nosratollah Amanian","doi":"10.1007/s00477-024-02739-7","DOIUrl":null,"url":null,"abstract":"<p>Based on the principles design of hydrofoil weirs, Modified Semi-Cylindrical Weirs (MSCWs) incorporate an innovative tangential ramp along the downstream crest contour, thereby significantly enhancing their performance compared to conventional semi-cylindrical weirs. A pivotal parameter in the calculation of flow discharge over the weir is the discharge coefficient (<i>C</i><sub><i>d</i></sub>). This study involves a comprehensive comparative analysis of various <i>C</i><sub><i>d</i></sub> estimation methodologies for MSCWs, employing a range of machine learning-based models, notably including Artificial Neural Network (ANN), Multivariate Adaptive Regression Splines (MARS), M5 tree, Locally Weighted Polynomial Regression (LWPR), and Support Vector Machine (SVM) models. To begin, a feature selection analysis utilizing the Gamma Test (GT) method was conducted to identify the optimal input configuration for modeling the discharge of MSCWs. The results of the feature selection revealed that the <i>C</i><sub><i>d</i></sub> of the MSCWs is primarily influenced by the ratio of upstream flow depth (<i>y</i><sub><i>up</i></sub>) to crest radius (<i>R</i>), while showing negligible sensitivity to the slope of the downstream ramp (<i>θ</i>). The dataset was partitioned into two segments: 70% were assigned to the training stage, while the remaining 30% were allocated to the testing stage. The precision of <i>C</i><sub><i>d</i></sub> predictions is evaluated through four key statistical metrics: Mean Absolute Error (MAE), Mean Squared Error (MSE), Root Mean Square Error (RMSE), Correlation Coefficient (<i>R</i><sup>2</sup>), and Nash –Sutcliff Efficiency (NSE). The outcomes reveal that, for the training and testing phases, the <i>R</i><sup>2</sup> values for the ANN, MARS, M5 tree, LWPR and SVM models are respectively 0.967, 0.931, 0.974, 0.937, and 0.933, and 0.925, 0.953, 0.953, 0.980, and 0.954. Notably, the LWPR model outperforms the ANN, MARS, M5 tree, and SVM models, boasting MAE, MSE, RMSE, and NSE values of 0.0167, 0.0005, 0.0217, and 0.942 during training, and 0.0107, 0.0002, 0.0136, and 0.949 during testing. As a result, the LWPR model clearly emerges as the superior model, followed by the M5 model tree.</p>","PeriodicalId":21987,"journal":{"name":"Stochastic Environmental Research and Risk Assessment","volume":"12 1","pages":""},"PeriodicalIF":3.9000,"publicationDate":"2024-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Discharge coefficient estimation of modified semi-cylindrical weirs using machine learning approaches\",\"authors\":\"Reza Fatahi-Alkouhi, Ehsan Afaridegan, Nosratollah Amanian\",\"doi\":\"10.1007/s00477-024-02739-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Based on the principles design of hydrofoil weirs, Modified Semi-Cylindrical Weirs (MSCWs) incorporate an innovative tangential ramp along the downstream crest contour, thereby significantly enhancing their performance compared to conventional semi-cylindrical weirs. A pivotal parameter in the calculation of flow discharge over the weir is the discharge coefficient (<i>C</i><sub><i>d</i></sub>). This study involves a comprehensive comparative analysis of various <i>C</i><sub><i>d</i></sub> estimation methodologies for MSCWs, employing a range of machine learning-based models, notably including Artificial Neural Network (ANN), Multivariate Adaptive Regression Splines (MARS), M5 tree, Locally Weighted Polynomial Regression (LWPR), and Support Vector Machine (SVM) models. To begin, a feature selection analysis utilizing the Gamma Test (GT) method was conducted to identify the optimal input configuration for modeling the discharge of MSCWs. The results of the feature selection revealed that the <i>C</i><sub><i>d</i></sub> of the MSCWs is primarily influenced by the ratio of upstream flow depth (<i>y</i><sub><i>up</i></sub>) to crest radius (<i>R</i>), while showing negligible sensitivity to the slope of the downstream ramp (<i>θ</i>). The dataset was partitioned into two segments: 70% were assigned to the training stage, while the remaining 30% were allocated to the testing stage. The precision of <i>C</i><sub><i>d</i></sub> predictions is evaluated through four key statistical metrics: Mean Absolute Error (MAE), Mean Squared Error (MSE), Root Mean Square Error (RMSE), Correlation Coefficient (<i>R</i><sup>2</sup>), and Nash –Sutcliff Efficiency (NSE). The outcomes reveal that, for the training and testing phases, the <i>R</i><sup>2</sup> values for the ANN, MARS, M5 tree, LWPR and SVM models are respectively 0.967, 0.931, 0.974, 0.937, and 0.933, and 0.925, 0.953, 0.953, 0.980, and 0.954. Notably, the LWPR model outperforms the ANN, MARS, M5 tree, and SVM models, boasting MAE, MSE, RMSE, and NSE values of 0.0167, 0.0005, 0.0217, and 0.942 during training, and 0.0107, 0.0002, 0.0136, and 0.949 during testing. As a result, the LWPR model clearly emerges as the superior model, followed by the M5 model tree.</p>\",\"PeriodicalId\":21987,\"journal\":{\"name\":\"Stochastic Environmental Research and Risk Assessment\",\"volume\":\"12 1\",\"pages\":\"\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2024-05-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Stochastic Environmental Research and Risk Assessment\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://doi.org/10.1007/s00477-024-02739-7\",\"RegionNum\":3,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Stochastic Environmental Research and Risk Assessment","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1007/s00477-024-02739-7","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0

摘要

根据水翼堰的设计原理,改良半圆筒形堰(MSCW)沿下游堰顶轮廓线设计了一个创新的切向斜坡,从而使其性能大大优于传统的半圆筒形堰。计算堰上水流排量的一个关键参数是排量系数(Cd)。本研究采用一系列基于机器学习的模型,主要包括人工神经网络 (ANN)、多元自适应回归样条 (MARS)、M5 树、局部加权多项式回归 (LWPR) 和支持向量机 (SVM) 模型,对用于 MSCW 的各种 Cd 估算方法进行了全面的比较分析。首先,利用伽马测试(GT)方法进行了特征选择分析,以确定中层海洋水体排放建模的最佳输入配置。特征选择的结果表明,中横向水道的 Cd 主要受上游水流深度(yup)与坡顶半径(R)之比的影响,而对下游坡道坡度(θ)的敏感性可忽略不计。数据集被分为两部分:其中 70% 分配给训练阶段,其余 30% 分配给测试阶段。Cd 预测精度通过四个关键统计指标进行评估:平均绝对误差 (MAE)、平均平方误差 (MSE)、均方根误差 (RMSE)、相关系数 (R2) 和 Nash -Sutcliff 效率 (NSE)。结果显示,在训练和测试阶段,ANN、MARS、M5 树、LWPR 和 SVM 模型的 R2 值分别为 0.967、0.931、0.974、0.937 和 0.933,以及 0.925、0.953、0.953、0.980 和 0.954。值得注意的是,LWPR 模型优于 ANN、MARS、M5 树和 SVM 模型,在训练期间的 MAE、MSE、RMSE 和 NSE 值分别为 0.0167、0.0005、0.0217 和 0.942,在测试期间的 MAE、MSE、RMSE 和 NSE 值分别为 0.0107、0.0002、0.0136 和 0.949。因此,LWPR 模型明显优于 M5 模型树。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Discharge coefficient estimation of modified semi-cylindrical weirs using machine learning approaches

Discharge coefficient estimation of modified semi-cylindrical weirs using machine learning approaches

Based on the principles design of hydrofoil weirs, Modified Semi-Cylindrical Weirs (MSCWs) incorporate an innovative tangential ramp along the downstream crest contour, thereby significantly enhancing their performance compared to conventional semi-cylindrical weirs. A pivotal parameter in the calculation of flow discharge over the weir is the discharge coefficient (Cd). This study involves a comprehensive comparative analysis of various Cd estimation methodologies for MSCWs, employing a range of machine learning-based models, notably including Artificial Neural Network (ANN), Multivariate Adaptive Regression Splines (MARS), M5 tree, Locally Weighted Polynomial Regression (LWPR), and Support Vector Machine (SVM) models. To begin, a feature selection analysis utilizing the Gamma Test (GT) method was conducted to identify the optimal input configuration for modeling the discharge of MSCWs. The results of the feature selection revealed that the Cd of the MSCWs is primarily influenced by the ratio of upstream flow depth (yup) to crest radius (R), while showing negligible sensitivity to the slope of the downstream ramp (θ). The dataset was partitioned into two segments: 70% were assigned to the training stage, while the remaining 30% were allocated to the testing stage. The precision of Cd predictions is evaluated through four key statistical metrics: Mean Absolute Error (MAE), Mean Squared Error (MSE), Root Mean Square Error (RMSE), Correlation Coefficient (R2), and Nash –Sutcliff Efficiency (NSE). The outcomes reveal that, for the training and testing phases, the R2 values for the ANN, MARS, M5 tree, LWPR and SVM models are respectively 0.967, 0.931, 0.974, 0.937, and 0.933, and 0.925, 0.953, 0.953, 0.980, and 0.954. Notably, the LWPR model outperforms the ANN, MARS, M5 tree, and SVM models, boasting MAE, MSE, RMSE, and NSE values of 0.0167, 0.0005, 0.0217, and 0.942 during training, and 0.0107, 0.0002, 0.0136, and 0.949 during testing. As a result, the LWPR model clearly emerges as the superior model, followed by the M5 model tree.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
7.10
自引率
9.50%
发文量
189
审稿时长
3.8 months
期刊介绍: Stochastic Environmental Research and Risk Assessment (SERRA) will publish research papers, reviews and technical notes on stochastic and probabilistic approaches to environmental sciences and engineering, including interactions of earth and atmospheric environments with people and ecosystems. The basic idea is to bring together research papers on stochastic modelling in various fields of environmental sciences and to provide an interdisciplinary forum for the exchange of ideas, for communicating on issues that cut across disciplinary barriers, and for the dissemination of stochastic techniques used in different fields to the community of interested researchers. Original contributions will be considered dealing with modelling (theoretical and computational), measurements and instrumentation in one or more of the following topical areas: - Spatiotemporal analysis and mapping of natural processes. - Enviroinformatics. - Environmental risk assessment, reliability analysis and decision making. - Surface and subsurface hydrology and hydraulics. - Multiphase porous media domains and contaminant transport modelling. - Hazardous waste site characterization. - Stochastic turbulence and random hydrodynamic fields. - Chaotic and fractal systems. - Random waves and seafloor morphology. - Stochastic atmospheric and climate processes. - Air pollution and quality assessment research. - Modern geostatistics. - Mechanisms of pollutant formation, emission, exposure and absorption. - Physical, chemical and biological analysis of human exposure from single and multiple media and routes; control and protection. - Bioinformatics. - Probabilistic methods in ecology and population biology. - Epidemiological investigations. - Models using stochastic differential equations stochastic or partial differential equations. - Hazardous waste site characterization.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信