用于预测和评估煤超临界水气化制氢的可解释机器学习

IF 7.5 1区 工程技术 Q2 ENERGY & FUELS
Fuel Pub Date : 2025-07-05 DOI:10.1016/j.fuel.2025.136173
Jianghua Tian , Runqiu Dong , Hanbing Jia , Zhiyong Peng , Zhigang Liu , Le Wang , Lei Yi , Jialing Xu , Hui Jin , Bin Chen , Liejin Guo
{"title":"用于预测和评估煤超临界水气化制氢的可解释机器学习","authors":"Jianghua Tian ,&nbsp;Runqiu Dong ,&nbsp;Hanbing Jia ,&nbsp;Zhiyong Peng ,&nbsp;Zhigang Liu ,&nbsp;Le Wang ,&nbsp;Lei Yi ,&nbsp;Jialing Xu ,&nbsp;Hui Jin ,&nbsp;Bin Chen ,&nbsp;Liejin Guo","doi":"10.1016/j.fuel.2025.136173","DOIUrl":null,"url":null,"abstract":"<div><div>Coal Supercritical Water Gasification (SCWG) process optimization by Machine Learning (ML) models is a promising strategy to conserve experimental resources. However, the lack of diversity in ML models and the neglect of their interpretability in existing works may limit the development of coal SCWG technology. This paper systematically collected 233 experimental results (1631 data points) to develop five ML models to analyze coal SCWG: Support Vector Regression (SVR), AdaBoost Regression (ABR), Decision Tree (DT), Random Forest (RF) Regression and Gradient Boosting Regression (GBR). The DT and GBR were found to have more robust predictive ability among the five models due to their superior performance in Mean Square Error (MSE), coefficient of determination (R<sup>2</sup>) and Mean Absolute Error (MAE). Temperature (TEMP) and Residence Time (RT) are the main controlling factors in determining gas production by analyzing the results based on SHapley Additive exPlanations (SHAP) values. There is a significant positive correlation between TEMP and RT and gas production. The SHAP values of the GBR model can well interpret the mechanism of the influence of coal SCWG parameters, especially the Concentration (CR) is negatively correlated with the gasification yields of H<sub>2</sub>, CO, and CO<sub>2</sub>, while it is positively correlated with the gas yield of CH<sub>4</sub>. Combining with the model predictive ability (MSE of 0.54, R<sup>2</sup> of 0.97, MAE of 0.19) of the model and the interpretability of the mechanism, the GBR model may be a superior tool to assist the coal SCWG technology. The error analysis and catalyst were input into the GBR model as characteristic parameters to further enhance its robustness. Compared to the kinetic model, the GBR model improved the accuracy and generalization ability of the four-gas yield prediction by expanding the input parameters (TEMP, RT, CR, error, catalyst type and concentration). This work would be of great value in the prediction and optimization of the coal SCWG process.</div></div>","PeriodicalId":325,"journal":{"name":"Fuel","volume":"404 ","pages":"Article 136173"},"PeriodicalIF":7.5000,"publicationDate":"2025-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Interpretable machine learning for predicting and evaluating hydrogen production from supercritical water gasification of coal\",\"authors\":\"Jianghua Tian ,&nbsp;Runqiu Dong ,&nbsp;Hanbing Jia ,&nbsp;Zhiyong Peng ,&nbsp;Zhigang Liu ,&nbsp;Le Wang ,&nbsp;Lei Yi ,&nbsp;Jialing Xu ,&nbsp;Hui Jin ,&nbsp;Bin Chen ,&nbsp;Liejin Guo\",\"doi\":\"10.1016/j.fuel.2025.136173\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Coal Supercritical Water Gasification (SCWG) process optimization by Machine Learning (ML) models is a promising strategy to conserve experimental resources. However, the lack of diversity in ML models and the neglect of their interpretability in existing works may limit the development of coal SCWG technology. This paper systematically collected 233 experimental results (1631 data points) to develop five ML models to analyze coal SCWG: Support Vector Regression (SVR), AdaBoost Regression (ABR), Decision Tree (DT), Random Forest (RF) Regression and Gradient Boosting Regression (GBR). The DT and GBR were found to have more robust predictive ability among the five models due to their superior performance in Mean Square Error (MSE), coefficient of determination (R<sup>2</sup>) and Mean Absolute Error (MAE). Temperature (TEMP) and Residence Time (RT) are the main controlling factors in determining gas production by analyzing the results based on SHapley Additive exPlanations (SHAP) values. There is a significant positive correlation between TEMP and RT and gas production. The SHAP values of the GBR model can well interpret the mechanism of the influence of coal SCWG parameters, especially the Concentration (CR) is negatively correlated with the gasification yields of H<sub>2</sub>, CO, and CO<sub>2</sub>, while it is positively correlated with the gas yield of CH<sub>4</sub>. Combining with the model predictive ability (MSE of 0.54, R<sup>2</sup> of 0.97, MAE of 0.19) of the model and the interpretability of the mechanism, the GBR model may be a superior tool to assist the coal SCWG technology. The error analysis and catalyst were input into the GBR model as characteristic parameters to further enhance its robustness. Compared to the kinetic model, the GBR model improved the accuracy and generalization ability of the four-gas yield prediction by expanding the input parameters (TEMP, RT, CR, error, catalyst type and concentration). This work would be of great value in the prediction and optimization of the coal SCWG process.</div></div>\",\"PeriodicalId\":325,\"journal\":{\"name\":\"Fuel\",\"volume\":\"404 \",\"pages\":\"Article 136173\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-07-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Fuel\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0016236125018988\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENERGY & FUELS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fuel","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0016236125018988","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0

摘要

利用机器学习模型优化煤超临界水气化(SCWG)过程是节约实验资源的一种有前途的策略。然而,ML模型缺乏多样性以及在现有工作中忽视其可解释性可能会限制煤炭SCWG技术的发展。本文系统地收集了233个实验结果(1631个数据点),建立了支持向量回归(SVR)、AdaBoost回归(ABR)、决策树(DT)、随机森林(RF)回归和梯度增强回归(GBR) 5种机器学习模型来分析煤炭SCWG。DT模型和GBR模型在均方误差(MSE)、决定系数(R2)和平均绝对误差(MAE)方面表现优异,在5个模型中具有较强的预测能力。根据SHapley加性解释(SHAP)值分析结果,温度(TEMP)和停留时间(RT)是决定产气量的主要控制因素。温度、RT与产气量呈显著正相关。GBR模型的SHAP值可以很好地解释煤SCWG参数的影响机理,特别是浓度(CR)与H2、CO和CO2的气化产率呈负相关,而与CH4的产率呈正相关。结合模型的模型预测能力(MSE为0.54,R2为0.97,MAE为0.19)和机理的可解释性,GBR模型可能是辅助煤超临界水处理技术的优越工具。将误差分析和催化剂作为特征参数输入到GBR模型中,进一步增强了模型的鲁棒性。与动力学模型相比,GBR模型通过扩大输入参数(TEMP、RT、CR、误差、催化剂类型和浓度),提高了四气产率预测的准确性和泛化能力。本研究对煤超临界转炉过程的预测和优化具有重要意义。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Interpretable machine learning for predicting and evaluating hydrogen production from supercritical water gasification of coal

Interpretable machine learning for predicting and evaluating hydrogen production from supercritical water gasification of coal
Coal Supercritical Water Gasification (SCWG) process optimization by Machine Learning (ML) models is a promising strategy to conserve experimental resources. However, the lack of diversity in ML models and the neglect of their interpretability in existing works may limit the development of coal SCWG technology. This paper systematically collected 233 experimental results (1631 data points) to develop five ML models to analyze coal SCWG: Support Vector Regression (SVR), AdaBoost Regression (ABR), Decision Tree (DT), Random Forest (RF) Regression and Gradient Boosting Regression (GBR). The DT and GBR were found to have more robust predictive ability among the five models due to their superior performance in Mean Square Error (MSE), coefficient of determination (R2) and Mean Absolute Error (MAE). Temperature (TEMP) and Residence Time (RT) are the main controlling factors in determining gas production by analyzing the results based on SHapley Additive exPlanations (SHAP) values. There is a significant positive correlation between TEMP and RT and gas production. The SHAP values of the GBR model can well interpret the mechanism of the influence of coal SCWG parameters, especially the Concentration (CR) is negatively correlated with the gasification yields of H2, CO, and CO2, while it is positively correlated with the gas yield of CH4. Combining with the model predictive ability (MSE of 0.54, R2 of 0.97, MAE of 0.19) of the model and the interpretability of the mechanism, the GBR model may be a superior tool to assist the coal SCWG technology. The error analysis and catalyst were input into the GBR model as characteristic parameters to further enhance its robustness. Compared to the kinetic model, the GBR model improved the accuracy and generalization ability of the four-gas yield prediction by expanding the input parameters (TEMP, RT, CR, error, catalyst type and concentration). This work would be of great value in the prediction and optimization of the coal SCWG process.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Fuel
Fuel 工程技术-工程:化工
CiteScore
12.80
自引率
20.30%
发文量
3506
审稿时长
64 days
期刊介绍: The exploration of energy sources remains a critical matter of study. For the past nine decades, fuel has consistently held the forefront in primary research efforts within the field of energy science. This area of investigation encompasses a wide range of subjects, with a particular emphasis on emerging concerns like environmental factors and pollution.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信