{"title":"An interpretable machine learning model for optimization of prediction index gases in coal spontaneous combustion","authors":"Jiuling Zhang , Xu Zhou , Jinpeng Su , Yilong Xiao","doi":"10.1016/j.aej.2025.02.104","DOIUrl":null,"url":null,"abstract":"<div><div>Early warnings of coal spontaneous combustion (CSC) have become urgent problems for coal enterprises. Existing approaches are designed to enhance the accuracy of CSC prediction. Improving the interpretability of the model is another important issue besides improving the prediction accuracy. Therefore, an interpretable machine learning framework based on RF (Random Forest) and SHAP (SHapley Additive exPlanations) is proposed to optimize prediction index gases. The data obtained from temperature-programmed experiments using coal samples from #5, #7, #8, #9, and #12 coal seams in Fangezhuang Mine are implemented to verify the proposed framework. <em>CO</em>, <em>O</em><sub><em>2</em></sub><em>/CO</em>, <em>CO/CO</em><sub><em>2</em></sub>, <em>CO/O</em><sub><em>2</em></sub>, <span><math><mrow><mi>C</mi><mi>O</mi><mo>/</mo><mi>Δ</mi><msub><mrow><mi>O</mi></mrow><mrow><mn>2</mn></mrow></msub></mrow></math></span>, <span><math><mrow><mi>Δ</mi><msub><mrow><mi>O</mi></mrow><mrow><mn>2</mn></mrow></msub></mrow></math></span>, <span><math><mrow><mi>Δ</mi><msub><mrow><mi>O</mi></mrow><mrow><mn>2</mn></mrow></msub><mo>/</mo><mi>Δ</mi><mi>C</mi><msub><mrow><mi>O</mi></mrow><mrow><mn>2</mn></mrow></msub></mrow></math></span>, <span><math><mrow><msub><mrow><mi>C</mi></mrow><mrow><mn>2</mn></mrow></msub><msub><mrow><mi>H</mi></mrow><mrow><mn>6</mn></mrow></msub><mo>/</mo><mi>C</mi><msub><mrow><mi>O</mi></mrow><mrow><mn>2</mn></mrow></msub></mrow></math></span>, <em>C</em><sub><em>2</em></sub><em>H</em><sub><em>4</em></sub>, <em>CO</em><sub><em>2</em></sub><em>/O</em><sub><em>2</em></sub> are selected, which is explained the rationality of the selected indicators using SHAP, practical experience, and related theories. Comparison of results using different machine learning models and different parameter optimization approaches showed the accuracy of the model affects the interpretation of the results. Finally, through the ablation experiment, the R² of RF, XGBoost, and Linear Regression model before feature removal was 0.98, 0.95 and 0.9, the model accuracy decreased significantly after the deletion, which showed the optimal prediction performance of RF, and the importance and validity of the selected indicators were verified using SHAP interpretation.</div></div>","PeriodicalId":7484,"journal":{"name":"alexandria engineering journal","volume":"122 ","pages":"Pages 268-278"},"PeriodicalIF":6.2000,"publicationDate":"2025-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"alexandria engineering journal","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1110016825002819","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Early warnings of coal spontaneous combustion (CSC) have become urgent problems for coal enterprises. Existing approaches are designed to enhance the accuracy of CSC prediction. Improving the interpretability of the model is another important issue besides improving the prediction accuracy. Therefore, an interpretable machine learning framework based on RF (Random Forest) and SHAP (SHapley Additive exPlanations) is proposed to optimize prediction index gases. The data obtained from temperature-programmed experiments using coal samples from #5, #7, #8, #9, and #12 coal seams in Fangezhuang Mine are implemented to verify the proposed framework. CO, O2/CO, CO/CO2, CO/O2, , , , , C2H4, CO2/O2 are selected, which is explained the rationality of the selected indicators using SHAP, practical experience, and related theories. Comparison of results using different machine learning models and different parameter optimization approaches showed the accuracy of the model affects the interpretation of the results. Finally, through the ablation experiment, the R² of RF, XGBoost, and Linear Regression model before feature removal was 0.98, 0.95 and 0.9, the model accuracy decreased significantly after the deletion, which showed the optimal prediction performance of RF, and the importance and validity of the selected indicators were verified using SHAP interpretation.
期刊介绍:
Alexandria Engineering Journal is an international journal devoted to publishing high quality papers in the field of engineering and applied science. Alexandria Engineering Journal is cited in the Engineering Information Services (EIS) and the Chemical Abstracts (CA). The papers published in Alexandria Engineering Journal are grouped into five sections, according to the following classification:
• Mechanical, Production, Marine and Textile Engineering
• Electrical Engineering, Computer Science and Nuclear Engineering
• Civil and Architecture Engineering
• Chemical Engineering and Applied Sciences
• Environmental Engineering