Unsupervised machine learning for data-driven classification of rock mass using drilling data: How can a data-driven system handle limitations in existing rock mass classification systems?
{"title":"Unsupervised machine learning for data-driven classification of rock mass using drilling data: How can a data-driven system handle limitations in existing rock mass classification systems?","authors":"T. F. Hansen, A. Aarset","doi":"arxiv-2405.02631","DOIUrl":null,"url":null,"abstract":"Rock mass classification systems are crucial for assessing stability and risk\nin underground construction globally and guiding support and excavation design.\nHowever, systems developed primarily in the 1970s lack access to modern\nhigh-resolution data and advanced statistical techniques, limiting their\neffectiveness as decision-support systems. Initially, we outline the\nlimitations observed in this context and later describe how a data-driven\nsystem, based on drilling data as detailed in this study, can overcome these\nlimitations. Using extracted statistical information from thousands of MWD-data\nvalues in one-meter sections of a full tunnel profile, thus working as a\nsignature of the rock mass, we have demonstrated that it is possible to form\nwell-defined clusters that can act as a foundational basis for various rock\nmass classification systems. We reduced the dimensionality of 48-value vectors\nusing nonlinear manifold learning techniques (UMAP) and linear principal\ncomponent analysis (PCA) to enhance clustering. Unsupervised machine learning\nmethods (HDBSCAN, Agglomerative Clustering, K-means) were employed to cluster\nthe data, with hyperparameters optimised through multi-objective Bayesian\noptimisation for effective clustering. Using domain knowledge, we experienced\nimproved clustering and system tuning opportunities in adding extra features to\ncore clusters of MWD-data. We structured and correlated these clusters with\nphysical rock mass properties, including labels of rock type and rock quality,\nand analysed cumulative distributions of key MWD-parameters for rock mass\nassessment to determine if clusters meaningfully differentiate rock masses. The\nability of MWD data to form distinct rock mass clusters suggests substantial\npotential for future classification systems grounded in this objective,\ndata-driven methodology, free from human bias.","PeriodicalId":501062,"journal":{"name":"arXiv - CS - Systems and Control","volume":"16 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Systems and Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2405.02631","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Rock mass classification systems are crucial for assessing stability and risk
in underground construction globally and guiding support and excavation design.
However, systems developed primarily in the 1970s lack access to modern
high-resolution data and advanced statistical techniques, limiting their
effectiveness as decision-support systems. Initially, we outline the
limitations observed in this context and later describe how a data-driven
system, based on drilling data as detailed in this study, can overcome these
limitations. Using extracted statistical information from thousands of MWD-data
values in one-meter sections of a full tunnel profile, thus working as a
signature of the rock mass, we have demonstrated that it is possible to form
well-defined clusters that can act as a foundational basis for various rock
mass classification systems. We reduced the dimensionality of 48-value vectors
using nonlinear manifold learning techniques (UMAP) and linear principal
component analysis (PCA) to enhance clustering. Unsupervised machine learning
methods (HDBSCAN, Agglomerative Clustering, K-means) were employed to cluster
the data, with hyperparameters optimised through multi-objective Bayesian
optimisation for effective clustering. Using domain knowledge, we experienced
improved clustering and system tuning opportunities in adding extra features to
core clusters of MWD-data. We structured and correlated these clusters with
physical rock mass properties, including labels of rock type and rock quality,
and analysed cumulative distributions of key MWD-parameters for rock mass
assessment to determine if clusters meaningfully differentiate rock masses. The
ability of MWD data to form distinct rock mass clusters suggests substantial
potential for future classification systems grounded in this objective,
data-driven methodology, free from human bias.