A confident learning-based support vector machine for robust ground classification in noisy label environments

IF 6.7 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY
Xin-Yue Zhang , Xiao-Ping Zhang , Hong-Gan Yu , Quan-Sheng Liu
{"title":"A confident learning-based support vector machine for robust ground classification in noisy label environments","authors":"Xin-Yue Zhang ,&nbsp;Xiao-Ping Zhang ,&nbsp;Hong-Gan Yu ,&nbsp;Quan-Sheng Liu","doi":"10.1016/j.tust.2024.106128","DOIUrl":null,"url":null,"abstract":"<div><div>Geological labels obtained from field exploration have potential errors due to technique limitations and subjective interference, leading to noisy labels when developing ground-machine interaction models for TBM tunneling. The present study proposes a novel confident learning-based support vector machine (CL-SVM) to eliminate label noise, thereby improving the accuracy and credibility of ground classification. The proposed model optimizes confidence values for each label and recognizes those with low confidence values as potential noise. Its effectiveness and superiority are confirmed through a noise test. The results indicate that the maximum acceptable noise ratio of the CL-SVM is 35%, while that of the conventional SVM is only 10%. In addition, the CL-SVM consistently emerges as a superior performer compared to the SVM in noisy label environments. The CL-SVM is further verified through its application on a class-imbalanced dataset collected from a metro tunnel project in Wuhan, China. Here, the accuracy metric <em>F1-score</em> for the most noise-interfered class is significantly improved from 0.7273 to 0.88. To enhance the model’s practical value, a confidence criterion is established to evaluate the credibility of individual predictions, which requires reliable predictions to have higher confidence values than specified thresholds. Without prior knowledge of true sample labels, this criterion distinguishes mispredictions from correct predictions with a remarkable precision of 99.08%. In summary, the proposed CL-SVM exhibits significantly better robustness to noisy labels than conventional models, demonstrating great potential for ground perception in tunnel projects.</div></div>","PeriodicalId":49414,"journal":{"name":"Tunnelling and Underground Space Technology","volume":null,"pages":null},"PeriodicalIF":6.7000,"publicationDate":"2024-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Tunnelling and Underground Space Technology","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0886779824005467","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Geological labels obtained from field exploration have potential errors due to technique limitations and subjective interference, leading to noisy labels when developing ground-machine interaction models for TBM tunneling. The present study proposes a novel confident learning-based support vector machine (CL-SVM) to eliminate label noise, thereby improving the accuracy and credibility of ground classification. The proposed model optimizes confidence values for each label and recognizes those with low confidence values as potential noise. Its effectiveness and superiority are confirmed through a noise test. The results indicate that the maximum acceptable noise ratio of the CL-SVM is 35%, while that of the conventional SVM is only 10%. In addition, the CL-SVM consistently emerges as a superior performer compared to the SVM in noisy label environments. The CL-SVM is further verified through its application on a class-imbalanced dataset collected from a metro tunnel project in Wuhan, China. Here, the accuracy metric F1-score for the most noise-interfered class is significantly improved from 0.7273 to 0.88. To enhance the model’s practical value, a confidence criterion is established to evaluate the credibility of individual predictions, which requires reliable predictions to have higher confidence values than specified thresholds. Without prior knowledge of true sample labels, this criterion distinguishes mispredictions from correct predictions with a remarkable precision of 99.08%. In summary, the proposed CL-SVM exhibits significantly better robustness to noisy labels than conventional models, demonstrating great potential for ground perception in tunnel projects.
基于自信学习的支持向量机,用于在噪声标签环境中进行稳健的地面分类
由于技术限制和主观干扰,野外勘探获得的地质标签可能存在误差,从而导致在为 TBM 隧道开发地机相互作用模型时出现噪声标签。本研究提出了一种新颖的基于置信学习的支持向量机(CL-SVM)来消除标签噪声,从而提高地面分类的准确性和可信度。所提出的模型优化了每个标签的置信度值,并将置信度值低的标签识别为潜在噪声。通过噪声测试证实了该模型的有效性和优越性。结果表明,CL-SVM 可接受的最大噪声比为 35%,而传统 SVM 的噪声比仅为 10%。此外,与 SVM 相比,CL-SVM 在嘈杂的标签环境中始终表现出色。通过将 CL-SVM 应用于从中国武汉地铁隧道项目中收集的类不平衡数据集,我们进一步验证了 CL-SVM。在该数据集上,受噪声干扰最严重类别的准确度指标 F1 分数从 0.7273 显著提高到 0.88。为了提高模型的实用价值,建立了一个置信度标准来评估单个预测的可信度,该标准要求可靠的预测具有高于指定阈值的置信度值。在事先不知道真实样本标签的情况下,该标准可以区分错误预测和正确预测,精确度高达 99.08%。总之,所提出的 CL-SVM 对噪声标签的鲁棒性明显优于传统模型,在隧道工程的地面感知方面显示出巨大的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Tunnelling and Underground Space Technology
Tunnelling and Underground Space Technology 工程技术-工程:土木
CiteScore
11.90
自引率
18.80%
发文量
454
审稿时长
10.8 months
期刊介绍: Tunnelling and Underground Space Technology is an international journal which publishes authoritative articles encompassing the development of innovative uses of underground space and the results of high quality research into improved, more cost-effective techniques for the planning, geo-investigation, design, construction, operation and maintenance of underground and earth-sheltered structures. The journal provides an effective vehicle for the improved worldwide exchange of information on developments in underground technology - and the experience gained from its use - and is strongly committed to publishing papers on the interdisciplinary aspects of creating, planning, and regulating underground space.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信