Machine learning for predicting and optimizing the performance of a commercial-scale anaerobic digester with diverse feedstocks and operating conditions
{"title":"Machine learning for predicting and optimizing the performance of a commercial-scale anaerobic digester with diverse feedstocks and operating conditions","authors":"Fei Long , Meicai Xu , Wei Liao , Hong Liu","doi":"10.1016/j.biortech.2025.132940","DOIUrl":null,"url":null,"abstract":"<div><div>The improper disposal of food waste and livestock manure poses significant environmental risks, including nutrient pollution, water contamination, and greenhouse gas emissions. Anaerobic digestion (AD) provides a sustainable pathway for converting organic waste into biogas while reducing environmental impacts. However, optimizing AD performance at a commercial scale remains challenging due to feedstock variability, operational complexity, and time-dependent dynamics. In this study, we analyzed six years of data from a commercial-scale AD system processing 18 types of food and manure waste to develop machine learning (ML) models for predictive analysis and process optimization. Three key outputs, total gas production, methane percentage, and H<sub>2</sub>S content, were predicted using Random Forest (RF), Artificial Neural Networks (ANN), and XGBoost. RF consistently yielded the highest performance with accuracy of 0.91 (gas production), 0.93 (methane), and 0.91 (H<sub>2</sub>S). Feature importance analysis revealed that time-series factors (e.g., rolling averages of previous days), pH, temperature, and hydraulic retention time (HRT) significantly influenced model accuracy. Notably, feedstocks such as dairy manure and pineapple waste exhibited strong correlations with both gas yield and H<sub>2</sub>S fluctuations. Optimization using Particle Swarm Optimization and Simulated Annealing demonstrated the potential to improve biogas production by up to 12 % and reduce H<sub>2</sub>S levels by as much as 65 % through adjusted operating conditions. These findings highlight the value of ML in not only forecasting AD performance with high accuracy but also in identifying operational strategies to enhance system efficiency and stability. This work provides actionable insights for the data-driven management of commercial-scale AD systems.</div></div>","PeriodicalId":258,"journal":{"name":"Bioresource Technology","volume":"435 ","pages":"Article 132940"},"PeriodicalIF":9.0000,"publicationDate":"2025-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioresource Technology","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S096085242500906X","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURAL ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
The improper disposal of food waste and livestock manure poses significant environmental risks, including nutrient pollution, water contamination, and greenhouse gas emissions. Anaerobic digestion (AD) provides a sustainable pathway for converting organic waste into biogas while reducing environmental impacts. However, optimizing AD performance at a commercial scale remains challenging due to feedstock variability, operational complexity, and time-dependent dynamics. In this study, we analyzed six years of data from a commercial-scale AD system processing 18 types of food and manure waste to develop machine learning (ML) models for predictive analysis and process optimization. Three key outputs, total gas production, methane percentage, and H2S content, were predicted using Random Forest (RF), Artificial Neural Networks (ANN), and XGBoost. RF consistently yielded the highest performance with accuracy of 0.91 (gas production), 0.93 (methane), and 0.91 (H2S). Feature importance analysis revealed that time-series factors (e.g., rolling averages of previous days), pH, temperature, and hydraulic retention time (HRT) significantly influenced model accuracy. Notably, feedstocks such as dairy manure and pineapple waste exhibited strong correlations with both gas yield and H2S fluctuations. Optimization using Particle Swarm Optimization and Simulated Annealing demonstrated the potential to improve biogas production by up to 12 % and reduce H2S levels by as much as 65 % through adjusted operating conditions. These findings highlight the value of ML in not only forecasting AD performance with high accuracy but also in identifying operational strategies to enhance system efficiency and stability. This work provides actionable insights for the data-driven management of commercial-scale AD systems.
期刊介绍:
Bioresource Technology publishes original articles, review articles, case studies, and short communications covering the fundamentals, applications, and management of bioresource technology. The journal seeks to advance and disseminate knowledge across various areas related to biomass, biological waste treatment, bioenergy, biotransformations, bioresource systems analysis, and associated conversion or production technologies.
Topics include:
• Biofuels: liquid and gaseous biofuels production, modeling and economics
• Bioprocesses and bioproducts: biocatalysis and fermentations
• Biomass and feedstocks utilization: bioconversion of agro-industrial residues
• Environmental protection: biological waste treatment
• Thermochemical conversion of biomass: combustion, pyrolysis, gasification, catalysis.