Yang Shi , Xinnan Zheng , Xiaoxia Wang , Xiangyi Shi , Cheng Hou , Shizhuo Wang , Zipeng Huang , Zheng Shen
{"title":"A hybrid XGB-ANN model integrated with Lion Optimization Algorithm for improved prediction accuracy and lactic acid yield optimization","authors":"Yang Shi , Xinnan Zheng , Xiaoxia Wang , Xiangyi Shi , Cheng Hou , Shizhuo Wang , Zipeng Huang , Zheng Shen","doi":"10.1016/j.procbio.2025.09.005","DOIUrl":null,"url":null,"abstract":"<div><div>Machine learning (ML) has been widely applied in industrial fields in recent years. However, ML faces challenges in fermentation production due to complex features and limited data. To overcome the limitations of single models and improve the prediction accuracy and stability of lactic acid fermentation, we combined six classical ML models with Artificial Neural Networks (ANN) separately in a serial manner to construct the Classical-ANN (C-ANN) hybrid models. Meanwhile, implemented the Lion Optimization Algorithm (LOA) to systematically optimize critical fermentation parameters and conducted experimental validation. While single models showed moderate performance (Extreme Gradient Boosting (XGB) R²= 0.9197; ANN R²= 0.9295), C-ANN hybrids demonstrated superior accuracy across all metrics. The XGB-ANN model achieved exceptional performance (R²=0.9850), benefiting from XGB's robust feature selection and ANN's high-dimensional processing capabilities. SHAP analysis revealed fermentation time as the most critical yield determinant, followed by total volume and pH. Lion Optimization Algorithm parameters enabled a predicted yield of 81.21 %, with experimental validation showing only 4.72 % deviation from predictions. This study not only provides valuable assistance for optimization of fermentation production conditions in the short term but also offers an innovative and effective modeling approach for future research in this field.</div></div>","PeriodicalId":20811,"journal":{"name":"Process Biochemistry","volume":"159 ","pages":"Pages 1-14"},"PeriodicalIF":4.0000,"publicationDate":"2025-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Process Biochemistry","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1359511325002508","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Machine learning (ML) has been widely applied in industrial fields in recent years. However, ML faces challenges in fermentation production due to complex features and limited data. To overcome the limitations of single models and improve the prediction accuracy and stability of lactic acid fermentation, we combined six classical ML models with Artificial Neural Networks (ANN) separately in a serial manner to construct the Classical-ANN (C-ANN) hybrid models. Meanwhile, implemented the Lion Optimization Algorithm (LOA) to systematically optimize critical fermentation parameters and conducted experimental validation. While single models showed moderate performance (Extreme Gradient Boosting (XGB) R²= 0.9197; ANN R²= 0.9295), C-ANN hybrids demonstrated superior accuracy across all metrics. The XGB-ANN model achieved exceptional performance (R²=0.9850), benefiting from XGB's robust feature selection and ANN's high-dimensional processing capabilities. SHAP analysis revealed fermentation time as the most critical yield determinant, followed by total volume and pH. Lion Optimization Algorithm parameters enabled a predicted yield of 81.21 %, with experimental validation showing only 4.72 % deviation from predictions. This study not only provides valuable assistance for optimization of fermentation production conditions in the short term but also offers an innovative and effective modeling approach for future research in this field.
期刊介绍:
Process Biochemistry is an application-orientated research journal devoted to reporting advances with originality and novelty, in the science and technology of the processes involving bioactive molecules and living organisms. These processes concern the production of useful metabolites or materials, or the removal of toxic compounds using tools and methods of current biology and engineering. Its main areas of interest include novel bioprocesses and enabling technologies (such as nanobiotechnology, tissue engineering, directed evolution, metabolic engineering, systems biology, and synthetic biology) applicable in food (nutraceutical), healthcare (medical, pharmaceutical, cosmetic), energy (biofuels), environmental, and biorefinery industries and their underlying biological and engineering principles.