预测粘性土干裂的概率机器学习

IF 3.7 2区 工程技术 Q3 ENGINEERING, ENVIRONMENTAL
Babak Jamhiri, Yongfu Xu, Mahdi Shadabfar, Susanga Costa
{"title":"预测粘性土干裂的概率机器学习","authors":"Babak Jamhiri,&nbsp;Yongfu Xu,&nbsp;Mahdi Shadabfar,&nbsp;Susanga Costa","doi":"10.1007/s10064-023-03366-2","DOIUrl":null,"url":null,"abstract":"<div><p>With frequent heatwaves and drought-downpour cycles, climate change gives rise to severe desiccation cracks. In this research, a probabilistic machine learning (ML) framework is developed to improve the deterministic models. Therefore, a complete set of data-driven soil and environment parameters, including initial water content (IWC), crack water content (CWC), final water content (FWC), soil layer thickness (SLT), temperature (Temp), and relative humidity (RH), is utilized as inputs to predict the crack surface ratio (CSR). Also, a comprehensive set of MLs, including an ensemble of regression trees (i.e., random forests [RF] and regression trees [RT]), gradient-boosted trees (viz. GBT and XGBT), support-vector machines (SVM), and artificial neural network-particle swarm optimization (ANN-PSO), is developed for predictions. Monte Carlo simulation (MCS) is then employed to insert uncertainties in the given models via shuffling and randomizing samples. Two sensitivity analyses, in particular input exclusion and partial dependence-individual conditional expectation plots, are further established to assess the prediction reliability. Results indicate that the performance ranking of developed MLs can be put as SVM &gt; GBT &gt; XGBT &gt; ANN-PSO &gt; RF &gt; RT. However, according to the probabilistic modeling based on the MCS, GBTs are highly capable for predictions with the lowest errors and uncertainties. The performance order of the models in terms of the higher coefficient of determination and lower standard deviation is GBT &gt; SVM &gt; XGBT &gt; RF &gt; ANN-PSO &gt; RT. The sensitivity analyses also categorized the parameter importance in the order of FWC &gt; CWC &gt; SLT &gt; IWC &gt; Temp &gt; RH. These findings demonstrate the immense capabilities of probabilistic MLs under uncertainties by measuring prediction error variances and hence improving performance precision.</p></div>","PeriodicalId":500,"journal":{"name":"Bulletin of Engineering Geology and the Environment","volume":"82 9","pages":""},"PeriodicalIF":3.7000,"publicationDate":"2023-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Probabilistic machine learning for predicting desiccation cracks in clayey soils\",\"authors\":\"Babak Jamhiri,&nbsp;Yongfu Xu,&nbsp;Mahdi Shadabfar,&nbsp;Susanga Costa\",\"doi\":\"10.1007/s10064-023-03366-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>With frequent heatwaves and drought-downpour cycles, climate change gives rise to severe desiccation cracks. In this research, a probabilistic machine learning (ML) framework is developed to improve the deterministic models. Therefore, a complete set of data-driven soil and environment parameters, including initial water content (IWC), crack water content (CWC), final water content (FWC), soil layer thickness (SLT), temperature (Temp), and relative humidity (RH), is utilized as inputs to predict the crack surface ratio (CSR). Also, a comprehensive set of MLs, including an ensemble of regression trees (i.e., random forests [RF] and regression trees [RT]), gradient-boosted trees (viz. GBT and XGBT), support-vector machines (SVM), and artificial neural network-particle swarm optimization (ANN-PSO), is developed for predictions. Monte Carlo simulation (MCS) is then employed to insert uncertainties in the given models via shuffling and randomizing samples. Two sensitivity analyses, in particular input exclusion and partial dependence-individual conditional expectation plots, are further established to assess the prediction reliability. Results indicate that the performance ranking of developed MLs can be put as SVM &gt; GBT &gt; XGBT &gt; ANN-PSO &gt; RF &gt; RT. However, according to the probabilistic modeling based on the MCS, GBTs are highly capable for predictions with the lowest errors and uncertainties. The performance order of the models in terms of the higher coefficient of determination and lower standard deviation is GBT &gt; SVM &gt; XGBT &gt; RF &gt; ANN-PSO &gt; RT. The sensitivity analyses also categorized the parameter importance in the order of FWC &gt; CWC &gt; SLT &gt; IWC &gt; Temp &gt; RH. These findings demonstrate the immense capabilities of probabilistic MLs under uncertainties by measuring prediction error variances and hence improving performance precision.</p></div>\",\"PeriodicalId\":500,\"journal\":{\"name\":\"Bulletin of Engineering Geology and the Environment\",\"volume\":\"82 9\",\"pages\":\"\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2023-08-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Bulletin of Engineering Geology and the Environment\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10064-023-03366-2\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, ENVIRONMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bulletin of Engineering Geology and the Environment","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1007/s10064-023-03366-2","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0

摘要

随着频繁的热浪和旱雨循环,气候变化导致了严重的干旱裂缝。在本研究中,开发了一个概率机器学习(ML)框架来改进确定性模型。因此,将初始含水量(IWC)、裂缝含水量(CWC)、最终含水量(FWC)、土层厚度(SLT)、温度(Temp)、相对湿度(RH)等完整的数据驱动土壤和环境参数作为预测裂缝面比(CSR)的输入。此外,还开发了一套全面的机器学习,包括回归树(即随机森林[RF]和回归树[RT]),梯度增强树(即GBT和XGBT),支持向量机(SVM)和人工神经网络粒子群优化(ANN-PSO)的集合,用于预测。然后采用蒙特卡罗模拟(MCS)通过洗牌和随机化样本在给定模型中插入不确定性。进一步建立了两种敏感性分析,特别是输入排除和部分依赖-个体条件期望图,以评估预测的可靠性。结果表明,已开发ml的性能排序为SVM > GBT > XGBT > ANN-PSO > RF > RT。然而,根据基于MCS的概率建模,gbt具有较强的预测能力,其误差和不确定性最小。各模型在决定系数较高、标准差较低的表现顺序为:GBT > SVM > XGBT > RF > ANN-PSO > RT。敏感性分析还将参数重要性按FWC > CWC > SLT > IWC > Temp > RH排序。这些发现证明了概率机器学习在不确定性下的巨大能力,通过测量预测误差方差,从而提高性能精度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Probabilistic machine learning for predicting desiccation cracks in clayey soils

With frequent heatwaves and drought-downpour cycles, climate change gives rise to severe desiccation cracks. In this research, a probabilistic machine learning (ML) framework is developed to improve the deterministic models. Therefore, a complete set of data-driven soil and environment parameters, including initial water content (IWC), crack water content (CWC), final water content (FWC), soil layer thickness (SLT), temperature (Temp), and relative humidity (RH), is utilized as inputs to predict the crack surface ratio (CSR). Also, a comprehensive set of MLs, including an ensemble of regression trees (i.e., random forests [RF] and regression trees [RT]), gradient-boosted trees (viz. GBT and XGBT), support-vector machines (SVM), and artificial neural network-particle swarm optimization (ANN-PSO), is developed for predictions. Monte Carlo simulation (MCS) is then employed to insert uncertainties in the given models via shuffling and randomizing samples. Two sensitivity analyses, in particular input exclusion and partial dependence-individual conditional expectation plots, are further established to assess the prediction reliability. Results indicate that the performance ranking of developed MLs can be put as SVM > GBT > XGBT > ANN-PSO > RF > RT. However, according to the probabilistic modeling based on the MCS, GBTs are highly capable for predictions with the lowest errors and uncertainties. The performance order of the models in terms of the higher coefficient of determination and lower standard deviation is GBT > SVM > XGBT > RF > ANN-PSO > RT. The sensitivity analyses also categorized the parameter importance in the order of FWC > CWC > SLT > IWC > Temp > RH. These findings demonstrate the immense capabilities of probabilistic MLs under uncertainties by measuring prediction error variances and hence improving performance precision.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Bulletin of Engineering Geology and the Environment
Bulletin of Engineering Geology and the Environment 工程技术-地球科学综合
CiteScore
7.10
自引率
11.90%
发文量
445
审稿时长
4.1 months
期刊介绍: Engineering geology is defined in the statutes of the IAEG as the science devoted to the investigation, study and solution of engineering and environmental problems which may arise as the result of the interaction between geology and the works or activities of man, as well as of the prediction of and development of measures for the prevention or remediation of geological hazards. Engineering geology embraces: • the applications/implications of the geomorphology, structural geology, and hydrogeological conditions of geological formations; • the characterisation of the mineralogical, physico-geomechanical, chemical and hydraulic properties of all earth materials involved in construction, resource recovery and environmental change; • the assessment of the mechanical and hydrological behaviour of soil and rock masses; • the prediction of changes to the above properties with time; • the determination of the parameters to be considered in the stability analysis of engineering works and earth masses.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信