Huanqiang Guo , Farag M.A. Altalbawy , Hardik Doshi , Anupam Yadav , B. Jayaprakash , Abhayveer Singh , B. Bharathi , Prabhat Kumar Sahu , Shaxnoza Saydaxmetova , Ahmad Alkhayyat , Samim Sherzod , Khursheed Muzammil
{"title":"利用混合模型估算生物质生物油产量","authors":"Huanqiang Guo , Farag M.A. Altalbawy , Hardik Doshi , Anupam Yadav , B. Jayaprakash , Abhayveer Singh , B. Bharathi , Prabhat Kumar Sahu , Shaxnoza Saydaxmetova , Ahmad Alkhayyat , Samim Sherzod , Khursheed Muzammil","doi":"10.1016/j.indcrop.2025.121711","DOIUrl":null,"url":null,"abstract":"<div><div>The bio-oil yield from biomass during pyrolysis, influenced by multiple chemical and process parameters, is particular for gaining sustainable production of energy, necessitating accurate predictive models. This study employs a Gradient Boosting Machine (GBM) framework, augmented by four sophisticated optimization algorithms: Batch Bayesian Optimization (BBO), Evolution Strategies (ES), Bayesian Probability Improvement (BPI), and Gaussian Processes Optimization (GPO). The model leverages a dataset comprising 400 experimental samples, with 90 % allocated for training and 10 % for testing, using input variables such as content of carbon, content of nitrogen, content of hydrogen, content of ash, content of oxygen, crystallinity index, BET surface area, catalyst-to-biomass ratio, residence time, temperature and to predict bio-oil yield. To mitigate overfitting, k-fold cross-validation is useful during training. The performance of every optimization algorithm is assessed through computational runtime and metrics such as R-squared (R²), mean squared error (MSE), and average absolute relative error (AARE%). Correlation analysis reveals varied relationships, with BET surface area (0.18) and oxygen content (0.14) showing positive associations with bio-oil yield, while temperature (-0.26) and ash content (-0.22) exhibit notable negative correlations. Among the optimization approaches, GBM-BBO achieves the highest accuracy, with an R² of 0.99 for training set and 0.94 for test set, surpassing other methods. Regarding computational efficiency, GPO is the fastest, requiring 171.9 s, whereas BBO is the slowest at 298.2 s. SHAP analysis identifies BET surface area, ash content, and temperature as powerful factors affecting bio-oil yield, underscoring the efficacy of data-driven methodologies in addressing intricate systems. These models offer reliable tools for estimating bio-oil yield, reducing reliance on expensive, time-consuming, and resource-intensive experimental processes.</div></div>","PeriodicalId":13581,"journal":{"name":"Industrial Crops and Products","volume":"235 ","pages":"Article 121711"},"PeriodicalIF":6.2000,"publicationDate":"2025-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Estimation of bio-oil yield for biomass using hybrid models\",\"authors\":\"Huanqiang Guo , Farag M.A. Altalbawy , Hardik Doshi , Anupam Yadav , B. Jayaprakash , Abhayveer Singh , B. Bharathi , Prabhat Kumar Sahu , Shaxnoza Saydaxmetova , Ahmad Alkhayyat , Samim Sherzod , Khursheed Muzammil\",\"doi\":\"10.1016/j.indcrop.2025.121711\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The bio-oil yield from biomass during pyrolysis, influenced by multiple chemical and process parameters, is particular for gaining sustainable production of energy, necessitating accurate predictive models. This study employs a Gradient Boosting Machine (GBM) framework, augmented by four sophisticated optimization algorithms: Batch Bayesian Optimization (BBO), Evolution Strategies (ES), Bayesian Probability Improvement (BPI), and Gaussian Processes Optimization (GPO). The model leverages a dataset comprising 400 experimental samples, with 90 % allocated for training and 10 % for testing, using input variables such as content of carbon, content of nitrogen, content of hydrogen, content of ash, content of oxygen, crystallinity index, BET surface area, catalyst-to-biomass ratio, residence time, temperature and to predict bio-oil yield. To mitigate overfitting, k-fold cross-validation is useful during training. The performance of every optimization algorithm is assessed through computational runtime and metrics such as R-squared (R²), mean squared error (MSE), and average absolute relative error (AARE%). Correlation analysis reveals varied relationships, with BET surface area (0.18) and oxygen content (0.14) showing positive associations with bio-oil yield, while temperature (-0.26) and ash content (-0.22) exhibit notable negative correlations. Among the optimization approaches, GBM-BBO achieves the highest accuracy, with an R² of 0.99 for training set and 0.94 for test set, surpassing other methods. Regarding computational efficiency, GPO is the fastest, requiring 171.9 s, whereas BBO is the slowest at 298.2 s. SHAP analysis identifies BET surface area, ash content, and temperature as powerful factors affecting bio-oil yield, underscoring the efficacy of data-driven methodologies in addressing intricate systems. These models offer reliable tools for estimating bio-oil yield, reducing reliance on expensive, time-consuming, and resource-intensive experimental processes.</div></div>\",\"PeriodicalId\":13581,\"journal\":{\"name\":\"Industrial Crops and Products\",\"volume\":\"235 \",\"pages\":\"Article 121711\"},\"PeriodicalIF\":6.2000,\"publicationDate\":\"2025-08-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Industrial Crops and Products\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0926669025012579\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURAL ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Industrial Crops and Products","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0926669025012579","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURAL ENGINEERING","Score":null,"Total":0}
Estimation of bio-oil yield for biomass using hybrid models
The bio-oil yield from biomass during pyrolysis, influenced by multiple chemical and process parameters, is particular for gaining sustainable production of energy, necessitating accurate predictive models. This study employs a Gradient Boosting Machine (GBM) framework, augmented by four sophisticated optimization algorithms: Batch Bayesian Optimization (BBO), Evolution Strategies (ES), Bayesian Probability Improvement (BPI), and Gaussian Processes Optimization (GPO). The model leverages a dataset comprising 400 experimental samples, with 90 % allocated for training and 10 % for testing, using input variables such as content of carbon, content of nitrogen, content of hydrogen, content of ash, content of oxygen, crystallinity index, BET surface area, catalyst-to-biomass ratio, residence time, temperature and to predict bio-oil yield. To mitigate overfitting, k-fold cross-validation is useful during training. The performance of every optimization algorithm is assessed through computational runtime and metrics such as R-squared (R²), mean squared error (MSE), and average absolute relative error (AARE%). Correlation analysis reveals varied relationships, with BET surface area (0.18) and oxygen content (0.14) showing positive associations with bio-oil yield, while temperature (-0.26) and ash content (-0.22) exhibit notable negative correlations. Among the optimization approaches, GBM-BBO achieves the highest accuracy, with an R² of 0.99 for training set and 0.94 for test set, surpassing other methods. Regarding computational efficiency, GPO is the fastest, requiring 171.9 s, whereas BBO is the slowest at 298.2 s. SHAP analysis identifies BET surface area, ash content, and temperature as powerful factors affecting bio-oil yield, underscoring the efficacy of data-driven methodologies in addressing intricate systems. These models offer reliable tools for estimating bio-oil yield, reducing reliance on expensive, time-consuming, and resource-intensive experimental processes.
期刊介绍:
Industrial Crops and Products is an International Journal publishing academic and industrial research on industrial (defined as non-food/non-feed) crops and products. Papers concern both crop-oriented and bio-based materials from crops-oriented research, and should be of interest to an international audience, hypothesis driven, and where comparisons are made statistics performed.