基于机器学习- shap的煤矿岩爆危险性预测可解释性分析

IF 2.1 3区 地球科学 Q2 GEOSCIENCES, MULTIDISCIPLINARY
Mei Hongjia , Wang Yanbing , Zhang Xiangliang , Wang Jianlong , Qi Gaowei , Han Yingying , Zheng Wenjing
{"title":"基于机器学习- shap的煤矿岩爆危险性预测可解释性分析","authors":"Mei Hongjia ,&nbsp;Wang Yanbing ,&nbsp;Zhang Xiangliang ,&nbsp;Wang Jianlong ,&nbsp;Qi Gaowei ,&nbsp;Han Yingying ,&nbsp;Zheng Wenjing","doi":"10.1016/j.jappgeo.2025.105982","DOIUrl":null,"url":null,"abstract":"<div><div>The traditional prediction model has significant limitations in the risk assessment of rockburst in deep coal mines, which is mainly reflected in its insufficient ability to analyze complex geological conditions and mining disturbance factors. These models are usually difficult to accurately capture the impact of various environmental factors on rockburst risk, lack the dynamic assessment ability of the importance of characteristics, and the traditional models are difficult to formulate targeted prevention measures, which can not meet the refined needs of deep coal mine safety production. This limitation not only reduces the reliability of prediction results, but also limits the effectiveness of control measures. Therefore, in view of the problems that the traditional prediction model lacks the corresponding interpretation and analysis of rockburst in deep coal mines, cannot distinguish the characteristic importance of various environmental factors, and is difficult to carry out targeted prevention and control measures, In view of the problems that the traditional prediction model lacks the corresponding interpretation and analysis of the rockburst in deep coal mines, cannot distinguish the characteristic importance of various environmental factors, and is difficult to carry out targeted prevention and control measures, this paper uses three algorithms in machine learning, namely random forest (Random Forest), support vector regression (Support Vector Regression), and extreme gradient lifting (Extreme Gradient Boosting), and takes the daily footage, the number of coal seam pressure relief holes, the number of drill cuttings detection holes, the number of bottom coal pressure relief holes, the number of bottom coal blasting holes, the total energy of microseismic, the frequency of microseismic, the maximum microseismic energy, the maximum stress, the amount of drill cuttings, and the depth of drill cuttings as 12 eigenvalues. After data processing, through cross validation and superparameter adjustment,the mean square error (MSE), root mean square error (RMSE), mean absolute error (MAE), mean absolute percentage error (MAPE) and The algorithm with the best regression performance is comprehensively screened out under the conditions of R<sup>2</sup> and STD deviation. The corresponding interpreter is used to analyze the interpretability of the algorithm and SHAP. Finally, the model is applied in the field to verify its feasibility. The research results indicate that the XGBoost algorithm performs the best on this dataset, with a sample MSE of 0.0001, RMSE of 0.0024, MAE of 0.0011, MAPE of 1.3164 %, R<sup>2</sup> of 0.9930, and Std Deviation of 0.0021 the XGBoost-SHAP model, the feature importance sequence was obtained using global interpretation, and the feature value with the highest weight in this environment was selected; By combining local explanations to make the inference and prediction processes within the model transparent, the black box problem of machine learning has been solved; The model can better capture the real-time impact of environmental factors on impact risk, and can provide a scientific basis for dynamic assessment of impact risk in different environments. The dynamic assessment and targeted prevention and control decision support framework for rockburst risk can provide important guidance for the targeted prevention and control of rockbursts at the site.</div></div>","PeriodicalId":54882,"journal":{"name":"Journal of Applied Geophysics","volume":"243 ","pages":"Article 105982"},"PeriodicalIF":2.1000,"publicationDate":"2025-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Interpretability analysis of rockburst risk prediction in coal mines based on machine learning-SHAP\",\"authors\":\"Mei Hongjia ,&nbsp;Wang Yanbing ,&nbsp;Zhang Xiangliang ,&nbsp;Wang Jianlong ,&nbsp;Qi Gaowei ,&nbsp;Han Yingying ,&nbsp;Zheng Wenjing\",\"doi\":\"10.1016/j.jappgeo.2025.105982\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The traditional prediction model has significant limitations in the risk assessment of rockburst in deep coal mines, which is mainly reflected in its insufficient ability to analyze complex geological conditions and mining disturbance factors. These models are usually difficult to accurately capture the impact of various environmental factors on rockburst risk, lack the dynamic assessment ability of the importance of characteristics, and the traditional models are difficult to formulate targeted prevention measures, which can not meet the refined needs of deep coal mine safety production. This limitation not only reduces the reliability of prediction results, but also limits the effectiveness of control measures. Therefore, in view of the problems that the traditional prediction model lacks the corresponding interpretation and analysis of rockburst in deep coal mines, cannot distinguish the characteristic importance of various environmental factors, and is difficult to carry out targeted prevention and control measures, In view of the problems that the traditional prediction model lacks the corresponding interpretation and analysis of the rockburst in deep coal mines, cannot distinguish the characteristic importance of various environmental factors, and is difficult to carry out targeted prevention and control measures, this paper uses three algorithms in machine learning, namely random forest (Random Forest), support vector regression (Support Vector Regression), and extreme gradient lifting (Extreme Gradient Boosting), and takes the daily footage, the number of coal seam pressure relief holes, the number of drill cuttings detection holes, the number of bottom coal pressure relief holes, the number of bottom coal blasting holes, the total energy of microseismic, the frequency of microseismic, the maximum microseismic energy, the maximum stress, the amount of drill cuttings, and the depth of drill cuttings as 12 eigenvalues. After data processing, through cross validation and superparameter adjustment,the mean square error (MSE), root mean square error (RMSE), mean absolute error (MAE), mean absolute percentage error (MAPE) and The algorithm with the best regression performance is comprehensively screened out under the conditions of R<sup>2</sup> and STD deviation. The corresponding interpreter is used to analyze the interpretability of the algorithm and SHAP. Finally, the model is applied in the field to verify its feasibility. The research results indicate that the XGBoost algorithm performs the best on this dataset, with a sample MSE of 0.0001, RMSE of 0.0024, MAE of 0.0011, MAPE of 1.3164 %, R<sup>2</sup> of 0.9930, and Std Deviation of 0.0021 the XGBoost-SHAP model, the feature importance sequence was obtained using global interpretation, and the feature value with the highest weight in this environment was selected; By combining local explanations to make the inference and prediction processes within the model transparent, the black box problem of machine learning has been solved; The model can better capture the real-time impact of environmental factors on impact risk, and can provide a scientific basis for dynamic assessment of impact risk in different environments. The dynamic assessment and targeted prevention and control decision support framework for rockburst risk can provide important guidance for the targeted prevention and control of rockbursts at the site.</div></div>\",\"PeriodicalId\":54882,\"journal\":{\"name\":\"Journal of Applied Geophysics\",\"volume\":\"243 \",\"pages\":\"Article 105982\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2025-10-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Applied Geophysics\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0926985125003635\",\"RegionNum\":3,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"GEOSCIENCES, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Applied Geophysics","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0926985125003635","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

传统的预测模型在深部煤矿岩爆风险评估中存在明显的局限性,主要表现在对复杂地质条件和开采扰动因素的分析能力不足。这些模型通常难以准确捕捉各种环境因素对岩爆危险性的影响,缺乏重要特征的动态评价能力,传统模型难以制定有针对性的防治措施,不能满足深部煤矿安全生产的精细化需求。这种局限性不仅降低了预测结果的可靠性,也限制了控制措施的有效性。因此,针对传统预测模型缺乏对深部煤矿岩爆的相应解释与分析、不能区分各种环境因素的特征重要性、难以开展有针对性的防治措施等问题,针对传统预测模型缺乏对深部煤矿岩爆的相应解释与分析的问题,无法区分各种环境因素的特征重要性,难以开展有针对性的防控措施,本文采用机器学习中的随机森林(random forest)、支持向量回归(support vector regression)和极端梯度提升(extreme gradient Boosting)三种算法,取日进尺、煤层卸压孔数、钻屑探测孔数、将底煤卸压孔数、底煤爆破孔数、微震总能量、微震频率、最大微震能量、最大应力、钻屑量、钻屑深度作为12个特征值。数据处理后,通过交叉验证和超参数调整,综合筛选出R2和STD偏差条件下均方误差(MSE)、均方根误差(RMSE)、平均绝对误差(MAE)、平均绝对百分比误差(MAPE)和回归性能最好的算法。使用相应的解释器来分析算法和SHAP的可解释性。最后,将该模型应用于现场,验证了其可行性。研究结果表明,XGBoost算法在该数据集上表现最好,XGBoost- shap模型的样本MSE为0.0001,RMSE为0.0024,MAE为0.0011,MAPE为1.3164%,R2为0.9930,Std Deviation为0.0021,采用全局解释获得特征重要性序列,并选择该环境下权重最高的特征值;通过结合局部解释,使模型内的推理和预测过程透明化,解决了机器学习的黑箱问题;该模型能较好地捕捉环境因素对影响风险的实时影响,可为不同环境下的影响风险动态评估提供科学依据。岩爆风险动态评价与针对性防治决策支持框架可为现场针对性防治岩爆提供重要指导。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Interpretability analysis of rockburst risk prediction in coal mines based on machine learning-SHAP
The traditional prediction model has significant limitations in the risk assessment of rockburst in deep coal mines, which is mainly reflected in its insufficient ability to analyze complex geological conditions and mining disturbance factors. These models are usually difficult to accurately capture the impact of various environmental factors on rockburst risk, lack the dynamic assessment ability of the importance of characteristics, and the traditional models are difficult to formulate targeted prevention measures, which can not meet the refined needs of deep coal mine safety production. This limitation not only reduces the reliability of prediction results, but also limits the effectiveness of control measures. Therefore, in view of the problems that the traditional prediction model lacks the corresponding interpretation and analysis of rockburst in deep coal mines, cannot distinguish the characteristic importance of various environmental factors, and is difficult to carry out targeted prevention and control measures, In view of the problems that the traditional prediction model lacks the corresponding interpretation and analysis of the rockburst in deep coal mines, cannot distinguish the characteristic importance of various environmental factors, and is difficult to carry out targeted prevention and control measures, this paper uses three algorithms in machine learning, namely random forest (Random Forest), support vector regression (Support Vector Regression), and extreme gradient lifting (Extreme Gradient Boosting), and takes the daily footage, the number of coal seam pressure relief holes, the number of drill cuttings detection holes, the number of bottom coal pressure relief holes, the number of bottom coal blasting holes, the total energy of microseismic, the frequency of microseismic, the maximum microseismic energy, the maximum stress, the amount of drill cuttings, and the depth of drill cuttings as 12 eigenvalues. After data processing, through cross validation and superparameter adjustment,the mean square error (MSE), root mean square error (RMSE), mean absolute error (MAE), mean absolute percentage error (MAPE) and The algorithm with the best regression performance is comprehensively screened out under the conditions of R2 and STD deviation. The corresponding interpreter is used to analyze the interpretability of the algorithm and SHAP. Finally, the model is applied in the field to verify its feasibility. The research results indicate that the XGBoost algorithm performs the best on this dataset, with a sample MSE of 0.0001, RMSE of 0.0024, MAE of 0.0011, MAPE of 1.3164 %, R2 of 0.9930, and Std Deviation of 0.0021 the XGBoost-SHAP model, the feature importance sequence was obtained using global interpretation, and the feature value with the highest weight in this environment was selected; By combining local explanations to make the inference and prediction processes within the model transparent, the black box problem of machine learning has been solved; The model can better capture the real-time impact of environmental factors on impact risk, and can provide a scientific basis for dynamic assessment of impact risk in different environments. The dynamic assessment and targeted prevention and control decision support framework for rockburst risk can provide important guidance for the targeted prevention and control of rockbursts at the site.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Applied Geophysics
Journal of Applied Geophysics 地学-地球科学综合
CiteScore
3.60
自引率
10.00%
发文量
274
审稿时长
4 months
期刊介绍: The Journal of Applied Geophysics with its key objective of responding to pertinent and timely needs, places particular emphasis on methodological developments and innovative applications of geophysical techniques for addressing environmental, engineering, and hydrological problems. Related topical research in exploration geophysics and in soil and rock physics is also covered by the Journal of Applied Geophysics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信