使用机器学习增强纤维素转化为5-羟甲基糠醛的数据驱动见解

IF 9.7 1区 环境科学与生态学 Q1 AGRICULTURAL ENGINEERING
Yanming Qiao , Ehsan Kargaran , Hao Ji , Meysam Madadi , Saeed Rafieyan , Dan Liu
{"title":"使用机器学习增强纤维素转化为5-羟甲基糠醛的数据驱动见解","authors":"Yanming Qiao ,&nbsp;Ehsan Kargaran ,&nbsp;Hao Ji ,&nbsp;Meysam Madadi ,&nbsp;Saeed Rafieyan ,&nbsp;Dan Liu","doi":"10.1016/j.biortech.2025.132582","DOIUrl":null,"url":null,"abstract":"<div><div>Converting cellulose into 5-Hydroxymethylfurfural (HMF) provides a promising strategy for creating bio-based chemicals, offering sustainable alternatives to petroleum-based materials in polymers, biofuels, and pharmaceuticals. However, the efficient production of HMF from cellulose is challenged by the complex interplay of numerous operational variables. This study develops a machine learning (ML) model to optimize HMF production and conducts a feature importance analysis to identify the key factors affecting HMF yield. Additionally, a Bayesian optimization is employed for multi-objective optimization aimed at maximizing HMF yield. A comprehensive dataset, sourced from existing literature, was subjected to statistical analysis to elucidate the influence of each factor on HMF production. Among the eight models evaluated, the CatBoost Regressor emerged as the most effective, delivering robust predictive performance with R<sup>2</sup> of 0.76 during testing and exhibiting low RMSE (4.72) and MAE (5.2) values. Feature importance analysis revealed that operational conditions, particularly time and temperature, were the most significant, accounting for 41.0% of the variability, followed by catalyst properties at 33.0% and solvent properties at 26.0%. The ML-based optimization achieved an HMF yield of 48.1%, with relative errors of −1% and 2.5% in the first (47.6%) and second (49.3%) runs of experimental validation, respectively. This research showcases ML’s ability to address challenges in cellulose-to-HMF conversion, offering insights for optimizing production and advancing sustainable manufacturing.</div></div>","PeriodicalId":258,"journal":{"name":"Bioresource Technology","volume":"430 ","pages":"Article 132582"},"PeriodicalIF":9.7000,"publicationDate":"2025-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data-driven insights for enhanced cellulose conversion to 5-hydroxymethylfurfural using machine learning\",\"authors\":\"Yanming Qiao ,&nbsp;Ehsan Kargaran ,&nbsp;Hao Ji ,&nbsp;Meysam Madadi ,&nbsp;Saeed Rafieyan ,&nbsp;Dan Liu\",\"doi\":\"10.1016/j.biortech.2025.132582\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Converting cellulose into 5-Hydroxymethylfurfural (HMF) provides a promising strategy for creating bio-based chemicals, offering sustainable alternatives to petroleum-based materials in polymers, biofuels, and pharmaceuticals. However, the efficient production of HMF from cellulose is challenged by the complex interplay of numerous operational variables. This study develops a machine learning (ML) model to optimize HMF production and conducts a feature importance analysis to identify the key factors affecting HMF yield. Additionally, a Bayesian optimization is employed for multi-objective optimization aimed at maximizing HMF yield. A comprehensive dataset, sourced from existing literature, was subjected to statistical analysis to elucidate the influence of each factor on HMF production. Among the eight models evaluated, the CatBoost Regressor emerged as the most effective, delivering robust predictive performance with R<sup>2</sup> of 0.76 during testing and exhibiting low RMSE (4.72) and MAE (5.2) values. Feature importance analysis revealed that operational conditions, particularly time and temperature, were the most significant, accounting for 41.0% of the variability, followed by catalyst properties at 33.0% and solvent properties at 26.0%. The ML-based optimization achieved an HMF yield of 48.1%, with relative errors of −1% and 2.5% in the first (47.6%) and second (49.3%) runs of experimental validation, respectively. This research showcases ML’s ability to address challenges in cellulose-to-HMF conversion, offering insights for optimizing production and advancing sustainable manufacturing.</div></div>\",\"PeriodicalId\":258,\"journal\":{\"name\":\"Bioresource Technology\",\"volume\":\"430 \",\"pages\":\"Article 132582\"},\"PeriodicalIF\":9.7000,\"publicationDate\":\"2025-04-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Bioresource Technology\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0960852425005486\",\"RegionNum\":1,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURAL ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioresource Technology","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0960852425005486","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURAL ENGINEERING","Score":null,"Total":0}
引用次数: 0

摘要

将纤维素转化为5-羟甲基糠醛(HMF)为制造生物基化学品提供了一种很有前途的策略,为聚合物、生物燃料和药品中的石油基材料提供了可持续的替代品。然而,从纤维素中高效生产HMF受到众多操作变量复杂相互作用的挑战。本研究开发了优化HMF生产的机器学习(ML)模型,并进行了特征重要性分析,以确定影响HMF产量的关键因素。此外,采用贝叶斯优化方法进行多目标优化,以最大限度地提高HMF产量。从现有文献中获取一个全面的数据集,对其进行统计分析,以阐明每个因素对HMF产生的影响。在评估的8个模型中,CatBoost回归因子是最有效的,在测试期间提供了稳健的预测性能,R2为0.76,RMSE(4.72)和MAE(5.2)值较低。特征重要性分析显示,操作条件,特别是时间和温度,是最重要的,占41.0%的变异性,其次是催化剂性质(33.0%)和溶剂性质(26.0%)。基于ml的优化得到的HMF产率为48.1%,第一次(47.6%)和第二次(49.3%)实验验证的相对误差分别为- 1%和2.5%。这项研究展示了ML解决纤维素到hmf转化挑战的能力,为优化生产和推进可持续制造提供了见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Data-driven insights for enhanced cellulose conversion to 5-hydroxymethylfurfural using machine learning

Data-driven insights for enhanced cellulose conversion to 5-hydroxymethylfurfural using machine learning
Converting cellulose into 5-Hydroxymethylfurfural (HMF) provides a promising strategy for creating bio-based chemicals, offering sustainable alternatives to petroleum-based materials in polymers, biofuels, and pharmaceuticals. However, the efficient production of HMF from cellulose is challenged by the complex interplay of numerous operational variables. This study develops a machine learning (ML) model to optimize HMF production and conducts a feature importance analysis to identify the key factors affecting HMF yield. Additionally, a Bayesian optimization is employed for multi-objective optimization aimed at maximizing HMF yield. A comprehensive dataset, sourced from existing literature, was subjected to statistical analysis to elucidate the influence of each factor on HMF production. Among the eight models evaluated, the CatBoost Regressor emerged as the most effective, delivering robust predictive performance with R2 of 0.76 during testing and exhibiting low RMSE (4.72) and MAE (5.2) values. Feature importance analysis revealed that operational conditions, particularly time and temperature, were the most significant, accounting for 41.0% of the variability, followed by catalyst properties at 33.0% and solvent properties at 26.0%. The ML-based optimization achieved an HMF yield of 48.1%, with relative errors of −1% and 2.5% in the first (47.6%) and second (49.3%) runs of experimental validation, respectively. This research showcases ML’s ability to address challenges in cellulose-to-HMF conversion, offering insights for optimizing production and advancing sustainable manufacturing.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Bioresource Technology
Bioresource Technology 工程技术-能源与燃料
CiteScore
20.80
自引率
19.30%
发文量
2013
审稿时长
12 days
期刊介绍: Bioresource Technology publishes original articles, review articles, case studies, and short communications covering the fundamentals, applications, and management of bioresource technology. The journal seeks to advance and disseminate knowledge across various areas related to biomass, biological waste treatment, bioenergy, biotransformations, bioresource systems analysis, and associated conversion or production technologies. Topics include: • Biofuels: liquid and gaseous biofuels production, modeling and economics • Bioprocesses and bioproducts: biocatalysis and fermentations • Biomass and feedstocks utilization: bioconversion of agro-industrial residues • Environmental protection: biological waste treatment • Thermochemical conversion of biomass: combustion, pyrolysis, gasification, catalysis.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信