Semi-supervised ensemble model for TBM rock mass classification

IF 6.7 1区工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY

Tunnelling and Underground Space Technology Pub Date : 2025-04-13 DOI:10.1016/j.tust.2025.106632

Shaoxiang Zeng , Yuanqin Tao , Honglei Sun , Yu Wang

{"title":"Semi-supervised ensemble model for TBM rock mass classification","authors":"Shaoxiang Zeng , Yuanqin Tao , Honglei Sun , Yu Wang","doi":"10.1016/j.tust.2025.106632","DOIUrl":null,"url":null,"abstract":"<div><div>Rock mass classification is crucial for tunnel construction with tunnel boring machines (TBMs). Due to the limited measured rock data, existing machine learning-based classification methods often use empirically derived data to supplement the rock mass datasets for model training, which compromises the classification reliability. To address this issue, this study proposes a semi-supervised soft-voting ensemble (semi-supervised SVE) model for TBM rock mass classification, utilizing limited labeled data (i.e., data including measured rock mass grades) and extensive unlabeled data (i.e., data lacking measured rock mass grades). The model is initially pre-trained with limited labeled data, which include rock mass grades derived from a national code based on rock mass integrity and physical properties. The pre-trained model is then used to identify the rock mass grades of unlabeled data to produce pseudo labels. Multiple training iterations are conducted to incorporate the pseudo-labeled data, continuously expanding the training dataset of the proposed model. By integrating the boosting and bagging ensemble strategies through the soft voting method, the robustness of the proposed model in rock mass classification is enhanced. A high-quality dataset from the Yinchuo-Jiliao Project in China is used for illustration. The proposed model outperforms alternative supervised and unsupervised models on both labeled and unlabeled data. The superior performance on labeled data is directly evident from evaluation indices including precision, recall, and F1 score, whereas its effectiveness on unlabeled data is demonstrated indirectly through rock-breaking indices. A 5-fold random cross-validation shows that the soft-voting ensemble classifier is more robust than individual classifiers. In addition, the confidence level of the proposed model matches actual classification accuracy, providing useful insights into model uncertainty for decision-makers.</div></div>","PeriodicalId":49414,"journal":{"name":"Tunnelling and Underground Space Technology","volume":"162 ","pages":""},"PeriodicalIF":6.7000,"publicationDate":"2025-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Tunnelling and Underground Space Technology","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0886779825002706","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Rock mass classification is crucial for tunnel construction with tunnel boring machines (TBMs). Due to the limited measured rock data, existing machine learning-based classification methods often use empirically derived data to supplement the rock mass datasets for model training, which compromises the classification reliability. To address this issue, this study proposes a semi-supervised soft-voting ensemble (semi-supervised SVE) model for TBM rock mass classification, utilizing limited labeled data (i.e., data including measured rock mass grades) and extensive unlabeled data (i.e., data lacking measured rock mass grades). The model is initially pre-trained with limited labeled data, which include rock mass grades derived from a national code based on rock mass integrity and physical properties. The pre-trained model is then used to identify the rock mass grades of unlabeled data to produce pseudo labels. Multiple training iterations are conducted to incorporate the pseudo-labeled data, continuously expanding the training dataset of the proposed model. By integrating the boosting and bagging ensemble strategies through the soft voting method, the robustness of the proposed model in rock mass classification is enhanced. A high-quality dataset from the Yinchuo-Jiliao Project in China is used for illustration. The proposed model outperforms alternative supervised and unsupervised models on both labeled and unlabeled data. The superior performance on labeled data is directly evident from evaluation indices including precision, recall, and F1 score, whereas its effectiveness on unlabeled data is demonstrated indirectly through rock-breaking indices. A 5-fold random cross-validation shows that the soft-voting ensemble classifier is more robust than individual classifiers. In addition, the confidence level of the proposed model matches actual classification accuracy, providing useful insights into model uncertainty for decision-makers.

查看原文本刊更多论文

TBM岩体分类的半监督系综模型

岩体分类是隧道掘进机隧道施工的关键。由于岩石实测数据有限，现有的基于机器学习的分类方法往往使用经验导出的数据来补充岩体数据集进行模型训练，从而降低了分类的可靠性。为了解决这一问题，本研究提出了一种用于TBM岩体分类的半监督软投票集成（半监督SVE）模型，该模型利用有限的标记数据（即包括实测岩体等级的数据）和大量未标记数据（即缺乏实测岩体等级的数据）。该模型最初使用有限的标记数据进行预训练，其中包括基于岩体完整性和物理性质的国家代码得出的岩体等级。然后使用预训练模型识别未标记数据的岩体等级，生成伪标签。通过多次训练迭代来整合伪标记数据，不断扩展所提模型的训练数据集。通过软投票方法将助推和套袋策略相结合，增强了模型在岩体分类中的鲁棒性。本文使用了中国银丘-吉利奥项目的高质量数据集进行说明。所提出的模型在标记和未标记数据上都优于替代的监督和无监督模型。对标记数据的卓越性能直接体现在精度、召回率和F1分数等评价指标上，而对未标记数据的有效性则通过破岩指标间接体现。5倍随机交叉验证表明，软投票集成分类器比单个分类器更鲁棒。此外，所提出的模型的置信水平与实际分类精度相匹配，为决策者提供了对模型不确定性的有用见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Tunnelling and Underground Space Technology 工程技术-工程：土木

CiteScore

11.90

自引率

18.80%

发文量

454

审稿时长

10.8 months

期刊介绍： Tunnelling and Underground Space Technology is an international journal which publishes authoritative articles encompassing the development of innovative uses of underground space and the results of high quality research into improved, more cost-effective techniques for the planning, geo-investigation, design, construction, operation and maintenance of underground and earth-sheltered structures. The journal provides an effective vehicle for the improved worldwide exchange of information on developments in underground technology - and the experience gained from its use - and is strongly committed to publishing papers on the interdisciplinary aspects of creating, planning, and regulating underground space.