Interpretable model for rockburst intensity prediction based on Shapley values-based Optuna-random forest

IF 8.2 1区 工程技术 Q1 ENGINEERING, CIVIL
Yaxi Shen , Shunchuan Wu , Yongbing Wang , Jiaxin Wang , Zhiquan Yang
{"title":"Interpretable model for rockburst intensity prediction based on Shapley values-based Optuna-random forest","authors":"Yaxi Shen ,&nbsp;Shunchuan Wu ,&nbsp;Yongbing Wang ,&nbsp;Jiaxin Wang ,&nbsp;Zhiquan Yang","doi":"10.1016/j.undsp.2024.09.002","DOIUrl":null,"url":null,"abstract":"<div><div>To address the limitation of traditional machine learning models in explaining the rockburst intensity prediction process, this study proposes an interpretable rockburst intensity prediction model. The model was developed using 350 sets of actual rockburst sample data to explore the impact of input metrics on the final rockburst intensity level. The collected data underwent pre-processing using the isolation forest algorithm and synthetic minority oversampling technique. The random forest model was optimized through 5-fold cross-validation and the Optuna framework, resulting in the establishment of an Optuna-random forest (Op-RF) model that generates decision rules through its internal decision tree, utilizing the properties of the random forest model. The model was further interpreted using the Shapley additive explanations algorithm, both locally and globally. The results demonstrate that the proposed model achieved an area under curve score of 0.984. In comparison to eight other machine learning models, the proposed Op-RF model demonstrated superior accuracy, precision, recall, and <em>F</em><sub>1</sub> score. The model provides a transparent explanation of the prediction process, linking impact characteristics to the final output. Additionally, a cloud deployment method for the rockburst intensity prediction model is provided and its effectiveness is demonstrated through engineering verification. The proposed model offers a new approach to the application of machine learning in rockburst intensity prediction.</div></div>","PeriodicalId":48505,"journal":{"name":"Underground Space","volume":"21 ","pages":"Pages 198-214"},"PeriodicalIF":8.2000,"publicationDate":"2024-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Underground Space","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2467967424001089","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0

Abstract

To address the limitation of traditional machine learning models in explaining the rockburst intensity prediction process, this study proposes an interpretable rockburst intensity prediction model. The model was developed using 350 sets of actual rockburst sample data to explore the impact of input metrics on the final rockburst intensity level. The collected data underwent pre-processing using the isolation forest algorithm and synthetic minority oversampling technique. The random forest model was optimized through 5-fold cross-validation and the Optuna framework, resulting in the establishment of an Optuna-random forest (Op-RF) model that generates decision rules through its internal decision tree, utilizing the properties of the random forest model. The model was further interpreted using the Shapley additive explanations algorithm, both locally and globally. The results demonstrate that the proposed model achieved an area under curve score of 0.984. In comparison to eight other machine learning models, the proposed Op-RF model demonstrated superior accuracy, precision, recall, and F1 score. The model provides a transparent explanation of the prediction process, linking impact characteristics to the final output. Additionally, a cloud deployment method for the rockburst intensity prediction model is provided and its effectiveness is demonstrated through engineering verification. The proposed model offers a new approach to the application of machine learning in rockburst intensity prediction.
基于 Shapley 值的 Optuna 随机森林的岩爆强度预测可解释模型
针对传统机器学习模型在解释岩爆强度预测过程中的局限性,本研究提出了一种可解释的岩爆强度预测模型。该模型使用 350 组实际岩爆样本数据进行开发,以探索输入指标对最终岩爆强度等级的影响。收集到的数据使用隔离林算法和合成少数超采样技术进行了预处理。通过 5 倍交叉验证和 Optuna 框架对随机森林模型进行了优化,最终建立了 Optuna-随机森林(Op-RF)模型,该模型利用随机森林模型的特性,通过内部决策树生成决策规则。利用沙普利加法解释算法对该模型进行了局部和全局的进一步解释。结果表明,所提出的模型的曲线下面积得分达到了 0.984。与其他 8 个机器学习模型相比,所提出的 Op-RF 模型在准确度、精确度、召回率和 F1 分数方面都表现出了优势。该模型透明地解释了预测过程,将影响特征与最终输出联系起来。此外,还提供了岩爆强度预测模型的云部署方法,并通过工程验证证明了其有效性。所提出的模型为机器学习在岩爆强度预测中的应用提供了一种新方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Underground Space
Underground Space ENGINEERING, CIVIL-
CiteScore
10.20
自引率
14.10%
发文量
71
审稿时长
63 days
期刊介绍: Underground Space is an open access international journal without article processing charges (APC) committed to serving as a scientific forum for researchers and practitioners in the field of underground engineering. The journal welcomes manuscripts that deal with original theories, methods, technologies, and important applications throughout the life-cycle of underground projects, including planning, design, operation and maintenance, disaster prevention, and demolition. The journal is particularly interested in manuscripts related to the latest development of smart underground engineering from the perspectives of resilience, resources saving, environmental friendliness, humanity, and artificial intelligence. The manuscripts are expected to have significant innovation and potential impact in the field of underground engineering, and should have clear association with or application in underground projects.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信