Yaoqi Nie , Qian Zhang , Lili Hou , Lijie Du , Xiaolong Zhao , Yujie Xue , ZhiCheng Lin , Xiuxiu Cao , Zixin Wang
{"title":"基于XGBoost和可解释机器学习的TBM岩体分类","authors":"Yaoqi Nie , Qian Zhang , Lili Hou , Lijie Du , Xiaolong Zhao , Yujie Xue , ZhiCheng Lin , Xiuxiu Cao , Zixin Wang","doi":"10.1016/j.aei.2025.103459","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate classification of surrounding rock grades during Tunnel Boring Machine (TBM) operations is essential for optimizing excavation parameters and ensuring construction safety. However, traditional classification methods face challenges, particularly in adapting to complex geological conditions and achieving high efficiency in model optimization. To address these issues, this study proposes an identification methodology based on the COA-XGBoost framework. Initially, raw data is preprocessed to eliminate anomalous instances, such as those arising from equipment malfunctions or maintenance activities, using a state discrimination function. To better capture the interaction dynamics between the TBM and the surrounding rock, a novel feature parameter, specific energy, is introduced. The Relief algorithm is employed to evaluate feature importance and assign weights to TBM operational parameters, amplifying the role of critical features while minimizing interference from redundant ones. Subsequently, the Coati Optimization Algorithm (COA) is integrated with the XGBoost classifier, enabling global parameter optimization and enhancing the model’s performance. The proposed method was validated using data from the water conveyance tunnel section of the Yinchuojiliao Project. Results demonstrate that the Relief algorithm effectively enhances the model’s ability to identify critical parameters through feature weighting. The COA-optimized XGBoost model achieves a significant improvement in recognition accuracy compared to traditional approaches, with accuracy gains of approximately 8%, particularly excelling under complex geological conditions such as Class IV and Class V surrounding rocks. The final model achieves an accuracy of 91.5%, with recall and F1 scores reaching 91.5% and 91.4%, respectively. Additionally, five-fold cross-validation highlights the model’s robustness and generalization capabilities. Furthermore, a SHAP-based interpretability analysis clarifies the relative contributions of each feature to the classification outcome, offering valuable insights for practical TBM parameter optimization.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"66 ","pages":"Article 103459"},"PeriodicalIF":9.9000,"publicationDate":"2025-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"TBM rock mass classification using XGBoost and Interpretable Machine learning\",\"authors\":\"Yaoqi Nie , Qian Zhang , Lili Hou , Lijie Du , Xiaolong Zhao , Yujie Xue , ZhiCheng Lin , Xiuxiu Cao , Zixin Wang\",\"doi\":\"10.1016/j.aei.2025.103459\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Accurate classification of surrounding rock grades during Tunnel Boring Machine (TBM) operations is essential for optimizing excavation parameters and ensuring construction safety. However, traditional classification methods face challenges, particularly in adapting to complex geological conditions and achieving high efficiency in model optimization. To address these issues, this study proposes an identification methodology based on the COA-XGBoost framework. Initially, raw data is preprocessed to eliminate anomalous instances, such as those arising from equipment malfunctions or maintenance activities, using a state discrimination function. To better capture the interaction dynamics between the TBM and the surrounding rock, a novel feature parameter, specific energy, is introduced. The Relief algorithm is employed to evaluate feature importance and assign weights to TBM operational parameters, amplifying the role of critical features while minimizing interference from redundant ones. Subsequently, the Coati Optimization Algorithm (COA) is integrated with the XGBoost classifier, enabling global parameter optimization and enhancing the model’s performance. The proposed method was validated using data from the water conveyance tunnel section of the Yinchuojiliao Project. Results demonstrate that the Relief algorithm effectively enhances the model’s ability to identify critical parameters through feature weighting. The COA-optimized XGBoost model achieves a significant improvement in recognition accuracy compared to traditional approaches, with accuracy gains of approximately 8%, particularly excelling under complex geological conditions such as Class IV and Class V surrounding rocks. The final model achieves an accuracy of 91.5%, with recall and F1 scores reaching 91.5% and 91.4%, respectively. Additionally, five-fold cross-validation highlights the model’s robustness and generalization capabilities. Furthermore, a SHAP-based interpretability analysis clarifies the relative contributions of each feature to the classification outcome, offering valuable insights for practical TBM parameter optimization.</div></div>\",\"PeriodicalId\":50941,\"journal\":{\"name\":\"Advanced Engineering Informatics\",\"volume\":\"66 \",\"pages\":\"Article 103459\"},\"PeriodicalIF\":9.9000,\"publicationDate\":\"2025-05-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advanced Engineering Informatics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1474034625003520\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034625003520","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
TBM rock mass classification using XGBoost and Interpretable Machine learning
Accurate classification of surrounding rock grades during Tunnel Boring Machine (TBM) operations is essential for optimizing excavation parameters and ensuring construction safety. However, traditional classification methods face challenges, particularly in adapting to complex geological conditions and achieving high efficiency in model optimization. To address these issues, this study proposes an identification methodology based on the COA-XGBoost framework. Initially, raw data is preprocessed to eliminate anomalous instances, such as those arising from equipment malfunctions or maintenance activities, using a state discrimination function. To better capture the interaction dynamics between the TBM and the surrounding rock, a novel feature parameter, specific energy, is introduced. The Relief algorithm is employed to evaluate feature importance and assign weights to TBM operational parameters, amplifying the role of critical features while minimizing interference from redundant ones. Subsequently, the Coati Optimization Algorithm (COA) is integrated with the XGBoost classifier, enabling global parameter optimization and enhancing the model’s performance. The proposed method was validated using data from the water conveyance tunnel section of the Yinchuojiliao Project. Results demonstrate that the Relief algorithm effectively enhances the model’s ability to identify critical parameters through feature weighting. The COA-optimized XGBoost model achieves a significant improvement in recognition accuracy compared to traditional approaches, with accuracy gains of approximately 8%, particularly excelling under complex geological conditions such as Class IV and Class V surrounding rocks. The final model achieves an accuracy of 91.5%, with recall and F1 scores reaching 91.5% and 91.4%, respectively. Additionally, five-fold cross-validation highlights the model’s robustness and generalization capabilities. Furthermore, a SHAP-based interpretability analysis clarifies the relative contributions of each feature to the classification outcome, offering valuable insights for practical TBM parameter optimization.
期刊介绍:
Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.