Application research of SSA-RF model in predicting the height of water-conducting fracture zone in deep and thick coal seams

IF 4.2
Li Wang , Jiming Zhu , Zhongchang Wang
{"title":"Application research of SSA-RF model in predicting the height of water-conducting fracture zone in deep and thick coal seams","authors":"Li Wang ,&nbsp;Jiming Zhu ,&nbsp;Zhongchang Wang","doi":"10.1016/j.aiig.2025.100154","DOIUrl":null,"url":null,"abstract":"<div><div>The 91 measured values of the development height of the water-conducting fracture zone (WCFZ) in deep and thick coal seam mining faces under thick loose layer conditions were collected. Five key characteristic variables influencing the WCFZ height were identified. After removing outliers from the dataset, a Random Forest (RF) regression model optimized by the Sparrow Search Algorithm (SSA) was constructed. The hyperparameters of the RF model were iteratively optimized by minimizing the Out-of-Bag (OOB) error, resulting in the rapid determination of optimal parameters. Specifically, the SSA-RF model achieved an OOB error of 0.148, with 20 decision trees, a maximum depth of 8, a minimum split sample size of 2, and a minimum leaf node sample size of 1. Cross-validation experiments were performed using the trained optimal model and compared against other prediction methods. The results showed that the mining height had the most significant correlation with the development height of the WCFZ. The SSA-RF model outperformed all other models, with R<sup>2</sup> values exceeding 0.9 across the training, validation, and test datasets. Compared to other models, the SSA-RF model demonstrates a simpler structure, stronger fitting capacity, higher predictive accuracy, and superior stability and generalization ability. It also exhibits the smallest variation in relative error across datasets, indicating excellent adaptability to different data conditions.Furthermore, a numerical model was developed using the hydrogeological data from the 1305 working face at Wanfukou Coal Mine, Shandong Province, China, to simulate the dynamic development of the WCFZ during mining. The SSA-RF model predicted the WCFZ height to be 69.7 m, closely aligning with the PFC2D simulation result of 65 m, with an error of less than 5 %. Compared to traditional methods and numerical simulations, the SSA-RF model provides more accurate predictions, showing only a 7.23 % deviation from the PFC2D simulation, while traditional empirical formulas yield deviations as large as 19.97 %. These results demonstrate the SSA-RF model's superior predictive capability, reinforcing its reliability and engineering applicability for real-world mining operations. This model holds significant potential for enhancing mining safety and optimizing planning processes, offering a more accurate and efficient approach for WCFZ height prediction.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 2","pages":"Article 100154"},"PeriodicalIF":4.2000,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence in Geosciences","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666544125000504","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The 91 measured values of the development height of the water-conducting fracture zone (WCFZ) in deep and thick coal seam mining faces under thick loose layer conditions were collected. Five key characteristic variables influencing the WCFZ height were identified. After removing outliers from the dataset, a Random Forest (RF) regression model optimized by the Sparrow Search Algorithm (SSA) was constructed. The hyperparameters of the RF model were iteratively optimized by minimizing the Out-of-Bag (OOB) error, resulting in the rapid determination of optimal parameters. Specifically, the SSA-RF model achieved an OOB error of 0.148, with 20 decision trees, a maximum depth of 8, a minimum split sample size of 2, and a minimum leaf node sample size of 1. Cross-validation experiments were performed using the trained optimal model and compared against other prediction methods. The results showed that the mining height had the most significant correlation with the development height of the WCFZ. The SSA-RF model outperformed all other models, with R2 values exceeding 0.9 across the training, validation, and test datasets. Compared to other models, the SSA-RF model demonstrates a simpler structure, stronger fitting capacity, higher predictive accuracy, and superior stability and generalization ability. It also exhibits the smallest variation in relative error across datasets, indicating excellent adaptability to different data conditions.Furthermore, a numerical model was developed using the hydrogeological data from the 1305 working face at Wanfukou Coal Mine, Shandong Province, China, to simulate the dynamic development of the WCFZ during mining. The SSA-RF model predicted the WCFZ height to be 69.7 m, closely aligning with the PFC2D simulation result of 65 m, with an error of less than 5 %. Compared to traditional methods and numerical simulations, the SSA-RF model provides more accurate predictions, showing only a 7.23 % deviation from the PFC2D simulation, while traditional empirical formulas yield deviations as large as 19.97 %. These results demonstrate the SSA-RF model's superior predictive capability, reinforcing its reliability and engineering applicability for real-world mining operations. This model holds significant potential for enhancing mining safety and optimizing planning processes, offering a more accurate and efficient approach for WCFZ height prediction.
SSA-RF模型在深厚煤层导水裂隙带高度预测中的应用研究
收集了厚松散层条件下深厚煤层采煤工作面导水裂隙带发育高度的91个实测值。确定了影响WCFZ高度的5个关键特征变量。在剔除异常值后,构建了基于麻雀搜索算法(SSA)优化的随机森林(RF)回归模型。通过对模型的超参数进行迭代优化,最大限度地减少出袋误差,实现了最优参数的快速确定。具体而言,SSA-RF模型的OOB误差为0.148,决策树为20棵,最大深度为8,最小分裂样本量为2,最小叶节点样本量为1。利用训练好的最优模型进行交叉验证实验,并与其他预测方法进行对比。结果表明:采掘高度与西部断裂带发育高度的相关性最为显著;SSA-RF模型优于所有其他模型,在训练、验证和测试数据集上的R2值超过0.9。与其他模型相比,SSA-RF模型结构更简单,拟合能力更强,预测精度更高,稳定性和泛化能力更强。它还显示出数据集之间相对误差的最小变化,表明对不同数据条件的良好适应性。利用山东万福口煤矿1305工作面水文地质资料,建立了数值模型,模拟了开采过程中WCFZ的动态发展。SSA-RF模型预测WCFZ高度为69.7 m,与PFC2D模拟结果65 m基本吻合,误差小于5%。与传统方法和数值模拟相比,SSA-RF模型提供了更准确的预测,与PFC2D模拟的偏差仅为7.23%,而传统经验公式的偏差高达19.97%。这些结果表明,SSA-RF模型具有优越的预测能力,增强了其可靠性和对实际采矿作业的工程适用性。该模型在提高开采安全性和优化规划流程方面具有重要的潜力,为WCFZ高度预测提供了更准确、更有效的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
4.20
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信