A stacking-based machine learning framework for predicting the unconfined compressive strength of frozen soil with missing data imputation

IF 5.5 2区 工程技术 Q1 ENGINEERING, CIVIL
Zhengyu Li , Guanya Lu , Xiyin Zhang , Bingzhe Zhang , Xuhao Lv
{"title":"A stacking-based machine learning framework for predicting the unconfined compressive strength of frozen soil with missing data imputation","authors":"Zhengyu Li ,&nbsp;Guanya Lu ,&nbsp;Xiyin Zhang ,&nbsp;Bingzhe Zhang ,&nbsp;Xuhao Lv","doi":"10.1016/j.trgeo.2025.101730","DOIUrl":null,"url":null,"abstract":"<div><div>In cold-region engineering and artificial ground freezing applications, the unconfined compressive strength (UCS, <em>σ<sub>m</sub></em>) and failure strain (<em>ε<sub>f</sub></em>) of frozen soil are critical mechanical parameters for design and analysis. However, precisely predicting frozen soil mechanical behavior under complex conditions remains a significant challenge. This study compiled a dataset of 1,346 unconfined compression test records for frozen soils and addressed missing data via multivariate imputation by chained equations (MICE) utilizing a Random Forest (RF) algorithm. Leveraging Bayesian optimization (BO) and 10‐fold cross‐validation, we developed a stacked machine learning model combining three eXtreme Gradient Boosting (XGBoost) predictors for integrated classification and regression tasks. Compared to conventional empirical formulations, the proposed model demonstrates significant improvements in predictive accuracy for stress–strain curve types, <em>σ<sub>m</sub></em>, and <em>ε<sub>f</sub></em>. To enhance the model’s interpretability, we employed the SHAP (Shapley Additive Explanations) method to explain the impact of each feature on predictions. Furthermore, for scenarios with constrained data availability, two stacking models requiring fewer input features were constructed. Collectively, the stacking ensemble framework provides a robust and interpretable methodology for the accurate prediction of frozen soil mechanical properties under diverse and complex conditions.</div></div>","PeriodicalId":56013,"journal":{"name":"Transportation Geotechnics","volume":"55 ","pages":"Article 101730"},"PeriodicalIF":5.5000,"publicationDate":"2025-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Geotechnics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2214391225002491","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0

Abstract

In cold-region engineering and artificial ground freezing applications, the unconfined compressive strength (UCS, σm) and failure strain (εf) of frozen soil are critical mechanical parameters for design and analysis. However, precisely predicting frozen soil mechanical behavior under complex conditions remains a significant challenge. This study compiled a dataset of 1,346 unconfined compression test records for frozen soils and addressed missing data via multivariate imputation by chained equations (MICE) utilizing a Random Forest (RF) algorithm. Leveraging Bayesian optimization (BO) and 10‐fold cross‐validation, we developed a stacked machine learning model combining three eXtreme Gradient Boosting (XGBoost) predictors for integrated classification and regression tasks. Compared to conventional empirical formulations, the proposed model demonstrates significant improvements in predictive accuracy for stress–strain curve types, σm, and εf. To enhance the model’s interpretability, we employed the SHAP (Shapley Additive Explanations) method to explain the impact of each feature on predictions. Furthermore, for scenarios with constrained data availability, two stacking models requiring fewer input features were constructed. Collectively, the stacking ensemble framework provides a robust and interpretable methodology for the accurate prediction of frozen soil mechanical properties under diverse and complex conditions.
基于缺失数据的无侧限冻土抗压强度预测机器学习框架
在寒区工程和人工冻结工程中,冻土的无侧限抗压强度(UCS, σm)和破坏应变(εf)是设计和分析的关键力学参数。然而,精确预测冻土在复杂条件下的力学行为仍然是一个重大挑战。本研究编制了1346个冻土无侧限压缩试验记录的数据集,并利用随机森林(RF)算法通过链式方程(MICE)多变量代入解决了缺失数据。利用贝叶斯优化(BO)和10倍交叉验证,我们开发了一个堆叠机器学习模型,该模型结合了三个极端梯度增强(XGBoost)预测因子,用于集成分类和回归任务。与传统的经验公式相比,该模型对应力-应变曲线类型、σm和εf的预测精度有显著提高。为了提高模型的可解释性,我们采用SHAP (Shapley Additive Explanations)方法来解释每个特征对预测的影响。此外,对于数据可用性受限的场景,构建了两个需要较少输入特征的叠加模型。总的来说,叠加系综框架为准确预测多种复杂条件下的冻土力学特性提供了一种可靠且可解释的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Transportation Geotechnics
Transportation Geotechnics Social Sciences-Transportation
CiteScore
8.10
自引率
11.30%
发文量
194
审稿时长
51 days
期刊介绍: Transportation Geotechnics is a journal dedicated to publishing high-quality, theoretical, and applied papers that cover all facets of geotechnics for transportation infrastructure such as roads, highways, railways, underground railways, airfields, and waterways. The journal places a special emphasis on case studies that present original work relevant to the sustainable construction of transportation infrastructure. The scope of topics it addresses includes the geotechnical properties of geomaterials for sustainable and rational design and construction, the behavior of compacted and stabilized geomaterials, the use of geosynthetics and reinforcement in constructed layers and interlayers, ground improvement and slope stability for transportation infrastructures, compaction technology and management, maintenance technology, the impact of climate, embankments for highways and high-speed trains, transition zones, dredging, underwater geotechnics for infrastructure purposes, and the modeling of multi-layered structures and supporting ground under dynamic and repeated loads.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信