基于XGBoost和可解释机器学习的TBM岩体分类

IF 9.9 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Yaoqi Nie , Qian Zhang , Lili Hou , Lijie Du , Xiaolong Zhao , Yujie Xue , ZhiCheng Lin , Xiuxiu Cao , Zixin Wang
{"title":"基于XGBoost和可解释机器学习的TBM岩体分类","authors":"Yaoqi Nie ,&nbsp;Qian Zhang ,&nbsp;Lili Hou ,&nbsp;Lijie Du ,&nbsp;Xiaolong Zhao ,&nbsp;Yujie Xue ,&nbsp;ZhiCheng Lin ,&nbsp;Xiuxiu Cao ,&nbsp;Zixin Wang","doi":"10.1016/j.aei.2025.103459","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate classification of surrounding rock grades during Tunnel Boring Machine (TBM) operations is essential for optimizing excavation parameters and ensuring construction safety. However, traditional classification methods face challenges, particularly in adapting to complex geological conditions and achieving high efficiency in model optimization. To address these issues, this study proposes an identification methodology based on the COA-XGBoost framework. Initially, raw data is preprocessed to eliminate anomalous instances, such as those arising from equipment malfunctions or maintenance activities, using a state discrimination function. To better capture the interaction dynamics between the TBM and the surrounding rock, a novel feature parameter, specific energy, is introduced. The Relief algorithm is employed to evaluate feature importance and assign weights to TBM operational parameters, amplifying the role of critical features while minimizing interference from redundant ones. Subsequently, the Coati Optimization Algorithm (COA) is integrated with the XGBoost classifier, enabling global parameter optimization and enhancing the model’s performance. The proposed method was validated using data from the water conveyance tunnel section of the Yinchuojiliao Project. Results demonstrate that the Relief algorithm effectively enhances the model’s ability to identify critical parameters through feature weighting. The COA-optimized XGBoost model achieves a significant improvement in recognition accuracy compared to traditional approaches, with accuracy gains of approximately 8%, particularly excelling under complex geological conditions such as Class IV and Class V surrounding rocks. The final model achieves an accuracy of 91.5%, with recall and F1 scores reaching 91.5% and 91.4%, respectively. Additionally, five-fold cross-validation highlights the model’s robustness and generalization capabilities. Furthermore, a SHAP-based interpretability analysis clarifies the relative contributions of each feature to the classification outcome, offering valuable insights for practical TBM parameter optimization.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"66 ","pages":"Article 103459"},"PeriodicalIF":9.9000,"publicationDate":"2025-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"TBM rock mass classification using XGBoost and Interpretable Machine learning\",\"authors\":\"Yaoqi Nie ,&nbsp;Qian Zhang ,&nbsp;Lili Hou ,&nbsp;Lijie Du ,&nbsp;Xiaolong Zhao ,&nbsp;Yujie Xue ,&nbsp;ZhiCheng Lin ,&nbsp;Xiuxiu Cao ,&nbsp;Zixin Wang\",\"doi\":\"10.1016/j.aei.2025.103459\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Accurate classification of surrounding rock grades during Tunnel Boring Machine (TBM) operations is essential for optimizing excavation parameters and ensuring construction safety. However, traditional classification methods face challenges, particularly in adapting to complex geological conditions and achieving high efficiency in model optimization. To address these issues, this study proposes an identification methodology based on the COA-XGBoost framework. Initially, raw data is preprocessed to eliminate anomalous instances, such as those arising from equipment malfunctions or maintenance activities, using a state discrimination function. To better capture the interaction dynamics between the TBM and the surrounding rock, a novel feature parameter, specific energy, is introduced. The Relief algorithm is employed to evaluate feature importance and assign weights to TBM operational parameters, amplifying the role of critical features while minimizing interference from redundant ones. Subsequently, the Coati Optimization Algorithm (COA) is integrated with the XGBoost classifier, enabling global parameter optimization and enhancing the model’s performance. The proposed method was validated using data from the water conveyance tunnel section of the Yinchuojiliao Project. Results demonstrate that the Relief algorithm effectively enhances the model’s ability to identify critical parameters through feature weighting. The COA-optimized XGBoost model achieves a significant improvement in recognition accuracy compared to traditional approaches, with accuracy gains of approximately 8%, particularly excelling under complex geological conditions such as Class IV and Class V surrounding rocks. The final model achieves an accuracy of 91.5%, with recall and F1 scores reaching 91.5% and 91.4%, respectively. Additionally, five-fold cross-validation highlights the model’s robustness and generalization capabilities. Furthermore, a SHAP-based interpretability analysis clarifies the relative contributions of each feature to the classification outcome, offering valuable insights for practical TBM parameter optimization.</div></div>\",\"PeriodicalId\":50941,\"journal\":{\"name\":\"Advanced Engineering Informatics\",\"volume\":\"66 \",\"pages\":\"Article 103459\"},\"PeriodicalIF\":9.9000,\"publicationDate\":\"2025-05-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advanced Engineering Informatics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1474034625003520\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034625003520","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

隧道掘进机施工中围岩等级的准确分类是优化开挖参数、保证施工安全的关键。然而,传统的分类方法面临着挑战,特别是在适应复杂的地质条件和实现模型优化的高效率方面。为了解决这些问题,本研究提出了一种基于COA-XGBoost框架的识别方法。最初,使用状态判别功能对原始数据进行预处理以消除异常情况,例如由设备故障或维护活动引起的异常情况。为了更好地捕捉隧道掘进机与围岩之间的相互作用动力学,引入了一个新的特征参数——比能量。采用Relief算法对特征重要性进行评估,并对TBM运行参数进行权重分配,放大关键特征的作用,同时最大限度地减少冗余特征的干扰。随后,将Coati优化算法(COA)与XGBoost分类器集成,实现全局参数优化,提高模型性能。以银厝际寮工程输水隧洞段为例,对该方法进行了验证。结果表明,Relief算法通过特征加权有效地增强了模型对关键参数的识别能力。与传统方法相比,经过coa优化的XGBoost模型在识别精度上有了显著提高,准确率提高了约8%,尤其在复杂地质条件下(如IV类和V类围岩)表现出色。最终模型的准确率达到91.5%,召回率和F1得分分别达到91.5%和91.4%。此外,五次交叉验证突出了模型的鲁棒性和泛化能力。此外,基于shap的可解释性分析阐明了每个特征对分类结果的相对贡献,为实际的TBM参数优化提供了有价值的见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
TBM rock mass classification using XGBoost and Interpretable Machine learning
Accurate classification of surrounding rock grades during Tunnel Boring Machine (TBM) operations is essential for optimizing excavation parameters and ensuring construction safety. However, traditional classification methods face challenges, particularly in adapting to complex geological conditions and achieving high efficiency in model optimization. To address these issues, this study proposes an identification methodology based on the COA-XGBoost framework. Initially, raw data is preprocessed to eliminate anomalous instances, such as those arising from equipment malfunctions or maintenance activities, using a state discrimination function. To better capture the interaction dynamics between the TBM and the surrounding rock, a novel feature parameter, specific energy, is introduced. The Relief algorithm is employed to evaluate feature importance and assign weights to TBM operational parameters, amplifying the role of critical features while minimizing interference from redundant ones. Subsequently, the Coati Optimization Algorithm (COA) is integrated with the XGBoost classifier, enabling global parameter optimization and enhancing the model’s performance. The proposed method was validated using data from the water conveyance tunnel section of the Yinchuojiliao Project. Results demonstrate that the Relief algorithm effectively enhances the model’s ability to identify critical parameters through feature weighting. The COA-optimized XGBoost model achieves a significant improvement in recognition accuracy compared to traditional approaches, with accuracy gains of approximately 8%, particularly excelling under complex geological conditions such as Class IV and Class V surrounding rocks. The final model achieves an accuracy of 91.5%, with recall and F1 scores reaching 91.5% and 91.4%, respectively. Additionally, five-fold cross-validation highlights the model’s robustness and generalization capabilities. Furthermore, a SHAP-based interpretability analysis clarifies the relative contributions of each feature to the classification outcome, offering valuable insights for practical TBM parameter optimization.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Advanced Engineering Informatics
Advanced Engineering Informatics 工程技术-工程:综合
CiteScore
12.40
自引率
18.20%
发文量
292
审稿时长
45 days
期刊介绍: Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信