Yingju Chang , Wei Wang , Jo-Shu Chang , Duu-Jong Lee
{"title":"生物质热化学制氢的机器学习建模","authors":"Yingju Chang , Wei Wang , Jo-Shu Chang , Duu-Jong Lee","doi":"10.1016/j.nxener.2025.100377","DOIUrl":null,"url":null,"abstract":"<div><div>This paper outlines the steps for applying machine learning (ML) models to predict biohydrogen yields from biomass using thermochemical treatments. Input features include elemental compositions and thermochemical process parameters, while outputs are biohydrogen yields reported in existing studies. Procedures and software for performing ML modeling on biohydrogen yield predictions are provided. Input features were analyzed using Random Forest (RF) and eXtreme Gradient Boosting (XGB) models, interpreted through SHapley Additive exPlanations (SHAP) and Partial Dependence Plot analyses. XGB demonstrated superior performance over RF in predicting hydrogen yields, as measured by mean squared error values. Fixed carbon content, moisture, and volatile matter content significantly influenced the yields. Process temperature and fixed carbon content showed an increase in yield when temperatures were below 600<!--> <!-->°C and carbon content was below 20%. The provided programs are adaptable for ML modeling and help users efficiently organize datasets to develop their models.</div></div>","PeriodicalId":100957,"journal":{"name":"Next Energy","volume":"9 ","pages":"Article 100377"},"PeriodicalIF":0.0000,"publicationDate":"2025-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine learning modeling for thermochemical biohydrogen production from biomass\",\"authors\":\"Yingju Chang , Wei Wang , Jo-Shu Chang , Duu-Jong Lee\",\"doi\":\"10.1016/j.nxener.2025.100377\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This paper outlines the steps for applying machine learning (ML) models to predict biohydrogen yields from biomass using thermochemical treatments. Input features include elemental compositions and thermochemical process parameters, while outputs are biohydrogen yields reported in existing studies. Procedures and software for performing ML modeling on biohydrogen yield predictions are provided. Input features were analyzed using Random Forest (RF) and eXtreme Gradient Boosting (XGB) models, interpreted through SHapley Additive exPlanations (SHAP) and Partial Dependence Plot analyses. XGB demonstrated superior performance over RF in predicting hydrogen yields, as measured by mean squared error values. Fixed carbon content, moisture, and volatile matter content significantly influenced the yields. Process temperature and fixed carbon content showed an increase in yield when temperatures were below 600<!--> <!-->°C and carbon content was below 20%. The provided programs are adaptable for ML modeling and help users efficiently organize datasets to develop their models.</div></div>\",\"PeriodicalId\":100957,\"journal\":{\"name\":\"Next Energy\",\"volume\":\"9 \",\"pages\":\"Article 100377\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-07-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Next Energy\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2949821X25001401\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Next Energy","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949821X25001401","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Machine learning modeling for thermochemical biohydrogen production from biomass
This paper outlines the steps for applying machine learning (ML) models to predict biohydrogen yields from biomass using thermochemical treatments. Input features include elemental compositions and thermochemical process parameters, while outputs are biohydrogen yields reported in existing studies. Procedures and software for performing ML modeling on biohydrogen yield predictions are provided. Input features were analyzed using Random Forest (RF) and eXtreme Gradient Boosting (XGB) models, interpreted through SHapley Additive exPlanations (SHAP) and Partial Dependence Plot analyses. XGB demonstrated superior performance over RF in predicting hydrogen yields, as measured by mean squared error values. Fixed carbon content, moisture, and volatile matter content significantly influenced the yields. Process temperature and fixed carbon content showed an increase in yield when temperatures were below 600 °C and carbon content was below 20%. The provided programs are adaptable for ML modeling and help users efficiently organize datasets to develop their models.