Uncertainty quantification in data-driven modelling with application to soil properties prediction

IF 5.6 1区 工程技术 Q1 ENGINEERING, GEOLOGICAL
Geng-Fu He, Zhen-Yu Yin, Pin Zhang
{"title":"Uncertainty quantification in data-driven modelling with application to soil properties prediction","authors":"Geng-Fu He,&nbsp;Zhen-Yu Yin,&nbsp;Pin Zhang","doi":"10.1007/s11440-024-02484-9","DOIUrl":null,"url":null,"abstract":"<div><p>Accurate estimation of soil properties is crucial for reliability-based design in engineering practices. Conventional empirical equations and prevalent data-driven models rarely consider uncertainty quantification in both measurement and modelling processes. This study tailors three uncertainty quantification methods including Bayesian learning, Markov chain Monte Carlo and ensemble learning into data-driven modelling, in which support vector regression is selected as the baseline algorithm. The compression index of clay is adopted as an example for model training and testing. In this context, Bayesian learning and Markov chain quantify uncertainty by considering the distribution of function and hyper-parameters, respectively, while different sampled data are employed to explore model uncertainty. These models are evaluated in terms of accuracy, reliability and cost-effectiveness and also compared with Gaussian process regression, etc. The results reveal that based on built-in structural risk minimization, sparse solution and uncertainty quantification, developed models can capture more accurate and reliable correlations from actual measured data over other methods. Their practicability and generalization ability are also verified on a new creep index database. The proposed probabilistic methods are also compiled into a user-friendly platform, showing a significant potential to enrich the data-driven modelling framework and be applied in other geotechnical properties.</p></div>","PeriodicalId":49308,"journal":{"name":"Acta Geotechnica","volume":"20 2","pages":"843 - 859"},"PeriodicalIF":5.6000,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acta Geotechnica","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1007/s11440-024-02484-9","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, GEOLOGICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Accurate estimation of soil properties is crucial for reliability-based design in engineering practices. Conventional empirical equations and prevalent data-driven models rarely consider uncertainty quantification in both measurement and modelling processes. This study tailors three uncertainty quantification methods including Bayesian learning, Markov chain Monte Carlo and ensemble learning into data-driven modelling, in which support vector regression is selected as the baseline algorithm. The compression index of clay is adopted as an example for model training and testing. In this context, Bayesian learning and Markov chain quantify uncertainty by considering the distribution of function and hyper-parameters, respectively, while different sampled data are employed to explore model uncertainty. These models are evaluated in terms of accuracy, reliability and cost-effectiveness and also compared with Gaussian process regression, etc. The results reveal that based on built-in structural risk minimization, sparse solution and uncertainty quantification, developed models can capture more accurate and reliable correlations from actual measured data over other methods. Their practicability and generalization ability are also verified on a new creep index database. The proposed probabilistic methods are also compiled into a user-friendly platform, showing a significant potential to enrich the data-driven modelling framework and be applied in other geotechnical properties.

Abstract Image

数据驱动建模中的不确定性量化,应用于土壤特性预测
在工程实践中,准确估计土的性质对基于可靠性的设计至关重要。传统的经验方程和流行的数据驱动模型在测量和建模过程中很少考虑不确定性的量化。本研究将贝叶斯学习、马尔可夫链蒙特卡罗和集成学习三种不确定性量化方法定制为数据驱动建模,其中选择支持向量回归作为基线算法。以粘土的压缩指数为例,进行了模型的训练和测试。在这种情况下,贝叶斯学习和马尔可夫链分别通过考虑函数和超参数的分布来量化不确定性,同时使用不同的采样数据来探索模型的不确定性。对这些模型的准确性、可靠性和成本效益进行了评价,并与高斯过程回归等进行了比较。结果表明,基于结构风险最小化、稀疏解和不确定性量化,所建立的模型比其他方法更能准确、可靠地捕获实际测量数据的相关性。并在一个新的蠕变指数数据库上验证了它们的实用性和泛化能力。所提出的概率方法也被编译成一个用户友好的平台,显示出丰富数据驱动的建模框架和应用于其他岩土力学性质的巨大潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Acta Geotechnica
Acta Geotechnica ENGINEERING, GEOLOGICAL-
CiteScore
9.90
自引率
17.50%
发文量
297
审稿时长
4 months
期刊介绍: Acta Geotechnica is an international journal devoted to the publication and dissemination of basic and applied research in geoengineering – an interdisciplinary field dealing with geomaterials such as soils and rocks. Coverage emphasizes the interplay between geomechanical models and their engineering applications. The journal presents original research papers on fundamental concepts in geomechanics and their novel applications in geoengineering based on experimental, analytical and/or numerical approaches. The main purpose of the journal is to foster understanding of the fundamental mechanisms behind the phenomena and processes in geomaterials, from kilometer-scale problems as they occur in geoscience, and down to the nano-scale, with their potential impact on geoengineering. The journal strives to report and archive progress in the field in a timely manner, presenting research papers, review articles, short notes and letters to the editors.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信