An interpretable machine learning model for optimization of prediction index gases in coal spontaneous combustion

IF 6.2 2区工程技术 Q1 ENGINEERING, MULTIDISCIPLINARY

alexandria engineering journal Pub Date : 2025-03-14 DOI:10.1016/j.aej.2025.02.104

Jiuling Zhang , Xu Zhou , Jinpeng Su , Yilong Xiao

{"title":"An interpretable machine learning model for optimization of prediction index gases in coal spontaneous combustion","authors":"Jiuling Zhang , Xu Zhou , Jinpeng Su , Yilong Xiao","doi":"10.1016/j.aej.2025.02.104","DOIUrl":null,"url":null,"abstract":"<div><div>Early warnings of coal spontaneous combustion (CSC) have become urgent problems for coal enterprises. Existing approaches are designed to enhance the accuracy of CSC prediction. Improving the interpretability of the model is another important issue besides improving the prediction accuracy. Therefore, an interpretable machine learning framework based on RF (Random Forest) and SHAP (SHapley Additive exPlanations) is proposed to optimize prediction index gases. The data obtained from temperature-programmed experiments using coal samples from #5, #7, #8, #9, and #12 coal seams in Fangezhuang Mine are implemented to verify the proposed framework. CO, O2/CO, CO/CO2, CO/O2, <math><mrow><mi>C</mi><mi>O</mi><mo>/</mo><mi>Δ</mi><msub><mrow><mi>O</mi></mrow><mrow><mn>2</mn></mrow></msub></mrow></math>, <math><mrow><mi>Δ</mi><msub><mrow><mi>O</mi></mrow><mrow><mn>2</mn></mrow></msub></mrow></math>, <math><mrow><mi>Δ</mi><msub><mrow><mi>O</mi></mrow><mrow><mn>2</mn></mrow></msub><mo>/</mo><mi>Δ</mi><mi>C</mi><msub><mrow><mi>O</mi></mrow><mrow><mn>2</mn></mrow></msub></mrow></math>, <math><mrow><msub><mrow><mi>C</mi></mrow><mrow><mn>2</mn></mrow></msub><msub><mrow><mi>H</mi></mrow><mrow><mn>6</mn></mrow></msub><mo>/</mo><mi>C</mi><msub><mrow><mi>O</mi></mrow><mrow><mn>2</mn></mrow></msub></mrow></math>, C2H4, CO2/O2 are selected, which is explained the rationality of the selected indicators using SHAP, practical experience, and related theories. Comparison of results using different machine learning models and different parameter optimization approaches showed the accuracy of the model affects the interpretation of the results. Finally, through the ablation experiment, the R² of RF, XGBoost, and Linear Regression model before feature removal was 0.98, 0.95 and 0.9, the model accuracy decreased significantly after the deletion, which showed the optimal prediction performance of RF, and the importance and validity of the selected indicators were verified using SHAP interpretation.</div></div>","PeriodicalId":7484,"journal":{"name":"alexandria engineering journal","volume":"122 ","pages":"Pages 268-278"},"PeriodicalIF":6.2000,"publicationDate":"2025-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"alexandria engineering journal","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1110016825002819","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

Early warnings of coal spontaneous combustion (CSC) have become urgent problems for coal enterprises. Existing approaches are designed to enhance the accuracy of CSC prediction. Improving the interpretability of the model is another important issue besides improving the prediction accuracy. Therefore, an interpretable machine learning framework based on RF (Random Forest) and SHAP (SHapley Additive exPlanations) is proposed to optimize prediction index gases. The data obtained from temperature-programmed experiments using coal samples from #5, #7, #8, #9, and #12 coal seams in Fangezhuang Mine are implemented to verify the proposed framework. CO, O₂/CO, CO/CO₂, CO/O₂,

C O / Δ O_{2}

Δ O_{2}

Δ O_{2} / Δ C O_{2}

C_{2} H_{6} / C O_{2}

, C₂H₄, CO₂/O₂ are selected, which is explained the rationality of the selected indicators using SHAP, practical experience, and related theories. Comparison of results using different machine learning models and different parameter optimization approaches showed the accuracy of the model affects the interpretation of the results. Finally, through the ablation experiment, the R² of RF, XGBoost, and Linear Regression model before feature removal was 0.98, 0.95 and 0.9, the model accuracy decreased significantly after the deletion, which showed the optimal prediction performance of RF, and the importance and validity of the selected indicators were verified using SHAP interpretation.

查看原文本刊更多论文

用于优化煤炭自燃预测指标气体的可解释机器学习模型

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

alexandria engineering journal Engineering-General Engineering

CiteScore

11.20

自引率

4.40%

发文量

1015

审稿时长

43 days

期刊介绍： Alexandria Engineering Journal is an international journal devoted to publishing high quality papers in the field of engineering and applied science. Alexandria Engineering Journal is cited in the Engineering Information Services (EIS) and the Chemical Abstracts (CA). The papers published in Alexandria Engineering Journal are grouped into five sections, according to the following classification: • Mechanical, Production, Marine and Textile Engineering • Electrical Engineering, Computer Science and Nuclear Engineering • Civil and Architecture Engineering • Chemical Engineering and Applied Sciences • Environmental Engineering