利用混合模型估算生物质生物油产量

IF 6.2 1区 农林科学 Q1 AGRICULTURAL ENGINEERING
Huanqiang Guo , Farag M.A. Altalbawy , Hardik Doshi , Anupam Yadav , B. Jayaprakash , Abhayveer Singh , B. Bharathi , Prabhat Kumar Sahu , Shaxnoza Saydaxmetova , Ahmad Alkhayyat , Samim Sherzod , Khursheed Muzammil
{"title":"利用混合模型估算生物质生物油产量","authors":"Huanqiang Guo ,&nbsp;Farag M.A. Altalbawy ,&nbsp;Hardik Doshi ,&nbsp;Anupam Yadav ,&nbsp;B. Jayaprakash ,&nbsp;Abhayveer Singh ,&nbsp;B. Bharathi ,&nbsp;Prabhat Kumar Sahu ,&nbsp;Shaxnoza Saydaxmetova ,&nbsp;Ahmad Alkhayyat ,&nbsp;Samim Sherzod ,&nbsp;Khursheed Muzammil","doi":"10.1016/j.indcrop.2025.121711","DOIUrl":null,"url":null,"abstract":"<div><div>The bio-oil yield from biomass during pyrolysis, influenced by multiple chemical and process parameters, is particular for gaining sustainable production of energy, necessitating accurate predictive models. This study employs a Gradient Boosting Machine (GBM) framework, augmented by four sophisticated optimization algorithms: Batch Bayesian Optimization (BBO), Evolution Strategies (ES), Bayesian Probability Improvement (BPI), and Gaussian Processes Optimization (GPO). The model leverages a dataset comprising 400 experimental samples, with 90 % allocated for training and 10 % for testing, using input variables such as content of carbon, content of nitrogen, content of hydrogen, content of ash, content of oxygen, crystallinity index, BET surface area, catalyst-to-biomass ratio, residence time, temperature and to predict bio-oil yield. To mitigate overfitting, k-fold cross-validation is useful during training. The performance of every optimization algorithm is assessed through computational runtime and metrics such as R-squared (R²), mean squared error (MSE), and average absolute relative error (AARE%). Correlation analysis reveals varied relationships, with BET surface area (0.18) and oxygen content (0.14) showing positive associations with bio-oil yield, while temperature (-0.26) and ash content (-0.22) exhibit notable negative correlations. Among the optimization approaches, GBM-BBO achieves the highest accuracy, with an R² of 0.99 for training set and 0.94 for test set, surpassing other methods. Regarding computational efficiency, GPO is the fastest, requiring 171.9 s, whereas BBO is the slowest at 298.2 s. SHAP analysis identifies BET surface area, ash content, and temperature as powerful factors affecting bio-oil yield, underscoring the efficacy of data-driven methodologies in addressing intricate systems. These models offer reliable tools for estimating bio-oil yield, reducing reliance on expensive, time-consuming, and resource-intensive experimental processes.</div></div>","PeriodicalId":13581,"journal":{"name":"Industrial Crops and Products","volume":"235 ","pages":"Article 121711"},"PeriodicalIF":6.2000,"publicationDate":"2025-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Estimation of bio-oil yield for biomass using hybrid models\",\"authors\":\"Huanqiang Guo ,&nbsp;Farag M.A. Altalbawy ,&nbsp;Hardik Doshi ,&nbsp;Anupam Yadav ,&nbsp;B. Jayaprakash ,&nbsp;Abhayveer Singh ,&nbsp;B. Bharathi ,&nbsp;Prabhat Kumar Sahu ,&nbsp;Shaxnoza Saydaxmetova ,&nbsp;Ahmad Alkhayyat ,&nbsp;Samim Sherzod ,&nbsp;Khursheed Muzammil\",\"doi\":\"10.1016/j.indcrop.2025.121711\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The bio-oil yield from biomass during pyrolysis, influenced by multiple chemical and process parameters, is particular for gaining sustainable production of energy, necessitating accurate predictive models. This study employs a Gradient Boosting Machine (GBM) framework, augmented by four sophisticated optimization algorithms: Batch Bayesian Optimization (BBO), Evolution Strategies (ES), Bayesian Probability Improvement (BPI), and Gaussian Processes Optimization (GPO). The model leverages a dataset comprising 400 experimental samples, with 90 % allocated for training and 10 % for testing, using input variables such as content of carbon, content of nitrogen, content of hydrogen, content of ash, content of oxygen, crystallinity index, BET surface area, catalyst-to-biomass ratio, residence time, temperature and to predict bio-oil yield. To mitigate overfitting, k-fold cross-validation is useful during training. The performance of every optimization algorithm is assessed through computational runtime and metrics such as R-squared (R²), mean squared error (MSE), and average absolute relative error (AARE%). Correlation analysis reveals varied relationships, with BET surface area (0.18) and oxygen content (0.14) showing positive associations with bio-oil yield, while temperature (-0.26) and ash content (-0.22) exhibit notable negative correlations. Among the optimization approaches, GBM-BBO achieves the highest accuracy, with an R² of 0.99 for training set and 0.94 for test set, surpassing other methods. Regarding computational efficiency, GPO is the fastest, requiring 171.9 s, whereas BBO is the slowest at 298.2 s. SHAP analysis identifies BET surface area, ash content, and temperature as powerful factors affecting bio-oil yield, underscoring the efficacy of data-driven methodologies in addressing intricate systems. These models offer reliable tools for estimating bio-oil yield, reducing reliance on expensive, time-consuming, and resource-intensive experimental processes.</div></div>\",\"PeriodicalId\":13581,\"journal\":{\"name\":\"Industrial Crops and Products\",\"volume\":\"235 \",\"pages\":\"Article 121711\"},\"PeriodicalIF\":6.2000,\"publicationDate\":\"2025-08-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Industrial Crops and Products\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0926669025012579\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURAL ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Industrial Crops and Products","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0926669025012579","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURAL ENGINEERING","Score":null,"Total":0}
引用次数: 0

摘要

生物质热解过程中的生物油产量受多种化学和工艺参数的影响,对于获得可持续的能源生产尤为重要,因此需要精确的预测模型。本研究采用梯度增强机(GBM)框架,并采用四种复杂的优化算法:批处理贝叶斯优化(BBO)、进化策略(ES)、贝叶斯概率改进(BPI)和高斯过程优化(GPO)。该模型利用了一个包含400个实验样本的数据集,其中90% %用于训练,10% %用于测试,使用输入变量,如碳含量、氮含量、氢含量、灰分含量、氧含量、结晶度指数、BET表面积、催化剂与生物质比、停留时间、温度,并预测生物油产量。为了减轻过拟合,k-fold交叉验证在训练期间是有用的。每个优化算法的性能都通过计算运行时间和诸如R平方(R²)、均方误差(MSE)和平均绝对相对误差(AARE%)等指标来评估。相关分析显示,BET表面积(0.18)和氧含量(0.14)与生物油产量呈正相关,而温度(-0.26)和灰分含量(-0.22)与生物油产量呈显著负相关。在这些优化方法中,GBM-BBO的准确率最高,训练集的R²为0.99,测试集的R²为0.94,优于其他方法。在计算效率方面,GPO最快,需要171.9 s, BBO最慢,需要298.2 s。SHAP分析确定BET表面积、灰分含量和温度是影响生物油产量的重要因素,强调了数据驱动方法在解决复杂系统中的有效性。这些模型为估计生物油产量提供了可靠的工具,减少了对昂贵、耗时和资源密集型实验过程的依赖。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Estimation of bio-oil yield for biomass using hybrid models
The bio-oil yield from biomass during pyrolysis, influenced by multiple chemical and process parameters, is particular for gaining sustainable production of energy, necessitating accurate predictive models. This study employs a Gradient Boosting Machine (GBM) framework, augmented by four sophisticated optimization algorithms: Batch Bayesian Optimization (BBO), Evolution Strategies (ES), Bayesian Probability Improvement (BPI), and Gaussian Processes Optimization (GPO). The model leverages a dataset comprising 400 experimental samples, with 90 % allocated for training and 10 % for testing, using input variables such as content of carbon, content of nitrogen, content of hydrogen, content of ash, content of oxygen, crystallinity index, BET surface area, catalyst-to-biomass ratio, residence time, temperature and to predict bio-oil yield. To mitigate overfitting, k-fold cross-validation is useful during training. The performance of every optimization algorithm is assessed through computational runtime and metrics such as R-squared (R²), mean squared error (MSE), and average absolute relative error (AARE%). Correlation analysis reveals varied relationships, with BET surface area (0.18) and oxygen content (0.14) showing positive associations with bio-oil yield, while temperature (-0.26) and ash content (-0.22) exhibit notable negative correlations. Among the optimization approaches, GBM-BBO achieves the highest accuracy, with an R² of 0.99 for training set and 0.94 for test set, surpassing other methods. Regarding computational efficiency, GPO is the fastest, requiring 171.9 s, whereas BBO is the slowest at 298.2 s. SHAP analysis identifies BET surface area, ash content, and temperature as powerful factors affecting bio-oil yield, underscoring the efficacy of data-driven methodologies in addressing intricate systems. These models offer reliable tools for estimating bio-oil yield, reducing reliance on expensive, time-consuming, and resource-intensive experimental processes.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Industrial Crops and Products
Industrial Crops and Products 农林科学-农业工程
CiteScore
9.50
自引率
8.50%
发文量
1518
审稿时长
43 days
期刊介绍: Industrial Crops and Products is an International Journal publishing academic and industrial research on industrial (defined as non-food/non-feed) crops and products. Papers concern both crop-oriented and bio-based materials from crops-oriented research, and should be of interest to an international audience, hypothesis driven, and where comparisons are made statistics performed.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信