Unsupervised machine learning for data-driven classification of rock mass using drilling data: How can a data-driven system handle limitations in existing rock mass classification systems?

arXiv - CS - Systems and Control Pub Date : 2024-05-04 DOI:arxiv-2405.02631

T. F. Hansen, A. Aarset

{"title":"Unsupervised machine learning for data-driven classification of rock mass using drilling data: How can a data-driven system handle limitations in existing rock mass classification systems?","authors":"T. F. Hansen, A. Aarset","doi":"arxiv-2405.02631","DOIUrl":null,"url":null,"abstract":"Rock mass classification systems are crucial for assessing stability and risk\nin underground construction globally and guiding support and excavation design.\nHowever, systems developed primarily in the 1970s lack access to modern\nhigh-resolution data and advanced statistical techniques, limiting their\neffectiveness as decision-support systems. Initially, we outline the\nlimitations observed in this context and later describe how a data-driven\nsystem, based on drilling data as detailed in this study, can overcome these\nlimitations. Using extracted statistical information from thousands of MWD-data\nvalues in one-meter sections of a full tunnel profile, thus working as a\nsignature of the rock mass, we have demonstrated that it is possible to form\nwell-defined clusters that can act as a foundational basis for various rock\nmass classification systems. We reduced the dimensionality of 48-value vectors\nusing nonlinear manifold learning techniques (UMAP) and linear principal\ncomponent analysis (PCA) to enhance clustering. Unsupervised machine learning\nmethods (HDBSCAN, Agglomerative Clustering, K-means) were employed to cluster\nthe data, with hyperparameters optimised through multi-objective Bayesian\noptimisation for effective clustering. Using domain knowledge, we experienced\nimproved clustering and system tuning opportunities in adding extra features to\ncore clusters of MWD-data. We structured and correlated these clusters with\nphysical rock mass properties, including labels of rock type and rock quality,\nand analysed cumulative distributions of key MWD-parameters for rock mass\nassessment to determine if clusters meaningfully differentiate rock masses. The\nability of MWD data to form distinct rock mass clusters suggests substantial\npotential for future classification systems grounded in this objective,\ndata-driven methodology, free from human bias.","PeriodicalId":501062,"journal":{"name":"arXiv - CS - Systems and Control","volume":"16 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Systems and Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2405.02631","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Rock mass classification systems are crucial for assessing stability and risk in underground construction globally and guiding support and excavation design. However, systems developed primarily in the 1970s lack access to modern high-resolution data and advanced statistical techniques, limiting their effectiveness as decision-support systems. Initially, we outline the limitations observed in this context and later describe how a data-driven system, based on drilling data as detailed in this study, can overcome these limitations. Using extracted statistical information from thousands of MWD-data values in one-meter sections of a full tunnel profile, thus working as a signature of the rock mass, we have demonstrated that it is possible to form well-defined clusters that can act as a foundational basis for various rock mass classification systems. We reduced the dimensionality of 48-value vectors using nonlinear manifold learning techniques (UMAP) and linear principal component analysis (PCA) to enhance clustering. Unsupervised machine learning methods (HDBSCAN, Agglomerative Clustering, K-means) were employed to cluster the data, with hyperparameters optimised through multi-objective Bayesian optimisation for effective clustering. Using domain knowledge, we experienced improved clustering and system tuning opportunities in adding extra features to core clusters of MWD-data. We structured and correlated these clusters with physical rock mass properties, including labels of rock type and rock quality, and analysed cumulative distributions of key MWD-parameters for rock mass assessment to determine if clusters meaningfully differentiate rock masses. The ability of MWD data to form distinct rock mass clusters suggests substantial potential for future classification systems grounded in this objective, data-driven methodology, free from human bias.

查看原文本刊更多论文

利用钻探数据进行岩体数据驱动分类的无监督机器学习：数据驱动系统如何处理现有岩体分类系统的局限性？

岩体分类系统对于评估全球地下工程的稳定性和风险以及指导支护和挖掘设计至关重要。然而，主要在 20 世纪 70 年代开发的系统缺乏现代高分辨率数据和先进的统计技术，限制了其作为决策支持系统的有效性。首先，我们概述了在这种情况下观察到的局限性，随后介绍了本研究中详细介绍的基于钻探数据的数据驱动系统如何克服这些局限性。我们利用从整个隧道剖面一米断面的数千个 MWD 数据值中提取的统计信息作为岩体的特征，证明有可能形成定义明确的岩群，作为各种岩体分类系统的基础。我们利用非线性流形学习技术（UMAP）和线性主成分分析（PCA）降低了 48 值向量的维度，以增强聚类效果。我们采用无监督机器学习方法（HDBSCAN、聚合聚类、K-means）对数据进行聚类，并通过多目标贝叶斯优化法优化超参数，以实现有效聚类。利用领域知识，我们在为 MWD 数据的核心聚类添加额外特征时，体验到了聚类和系统调整的改进机会。我们将这些聚类与岩体物理属性（包括岩石类型和岩石质量标签）进行了结构化和关联，并分析了用于岩体评估的关键 MWD 参数的累积分布，以确定聚类是否能有效区分岩体。MWD数据能够形成独特的岩体聚类，这为未来基于这种客观、数据驱动、不受人为偏见影响的方法建立分类系统提供了巨大的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

arXiv - CS - Systems and Control

自引率

0.00%

发文量