Bartosz Szeląg , Krzysztof Barbusiński , Michał Stachura , Przemysław Kowal , Adam Kiczko , Eldon R. Rene
{"title":"Advanced evaluation of performance of machine learning models for soapstock splitting optimisation under uncertainty","authors":"Bartosz Szeląg , Krzysztof Barbusiński , Michał Stachura , Przemysław Kowal , Adam Kiczko , Eldon R. Rene","doi":"10.1016/j.wri.2025.100294","DOIUrl":null,"url":null,"abstract":"<div><div>This study proposes a computational framework for the prediction and optimisation of soapstock splitting under conditions of limited measurement data and input uncertainty. The objective was to evaluate and select the modeling approaches based on (i) data availability, (ii) model complexity, (iii) predictive accuracy, and (iv) sensitivity to input uncertainty. Machine learning algorithms—Extreme Gradient Boosting (XGBoost) and Support Vector Machines (SVM)—were assessed in comparison with Response Surface Methodology (RSM). XGBoost provided the most accurate predictions for chemical oxygen demand (COD) and organic phosphorus (P<sub>org</sub>), while SVM performed best for acid number (AN). K-means clustering identified specific input domains where RSM models could effectively substitute for XGBoost, offering a balance between simplicity and performance. GSA showed that the key influence on P<sub>org</sub> (organic phosphorus), COD (chemical oxygen demand) and AN (acid number) was the phosphorus content of the oil, and less important were the operational parameters of the soapstock splitting system. Multi-criteria optimisation under uncertainty using a genetic algorithm (NSGA II) showed a significant influence of phosphorus content uncertainty on the choice of soapstock splitting operating conditions. These findings underscore the importance of accurate phosphorus quantification and support the development of robust, data-efficient computational tools for the monitoring, prediction, and optimisation of complex industrial processes such as soapstock splitting.</div></div>","PeriodicalId":23714,"journal":{"name":"Water Resources and Industry","volume":"33 ","pages":"Article 100294"},"PeriodicalIF":4.5000,"publicationDate":"2025-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Water Resources and Industry","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2212371725000186","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"WATER RESOURCES","Score":null,"Total":0}
引用次数: 0
Abstract
This study proposes a computational framework for the prediction and optimisation of soapstock splitting under conditions of limited measurement data and input uncertainty. The objective was to evaluate and select the modeling approaches based on (i) data availability, (ii) model complexity, (iii) predictive accuracy, and (iv) sensitivity to input uncertainty. Machine learning algorithms—Extreme Gradient Boosting (XGBoost) and Support Vector Machines (SVM)—were assessed in comparison with Response Surface Methodology (RSM). XGBoost provided the most accurate predictions for chemical oxygen demand (COD) and organic phosphorus (Porg), while SVM performed best for acid number (AN). K-means clustering identified specific input domains where RSM models could effectively substitute for XGBoost, offering a balance between simplicity and performance. GSA showed that the key influence on Porg (organic phosphorus), COD (chemical oxygen demand) and AN (acid number) was the phosphorus content of the oil, and less important were the operational parameters of the soapstock splitting system. Multi-criteria optimisation under uncertainty using a genetic algorithm (NSGA II) showed a significant influence of phosphorus content uncertainty on the choice of soapstock splitting operating conditions. These findings underscore the importance of accurate phosphorus quantification and support the development of robust, data-efficient computational tools for the monitoring, prediction, and optimisation of complex industrial processes such as soapstock splitting.
期刊介绍:
Water Resources and Industry moves research to innovation by focusing on the role industry plays in the exploitation, management and treatment of water resources. Different industries use radically different water resources in their production processes, while they produce, treat and dispose a wide variety of wastewater qualities. Depending on the geographical location of the facilities, the impact on the local resources will vary, pre-empting the applicability of one single approach. The aims and scope of the journal include: -Industrial water footprint assessment - an evaluation of tools and methodologies -What constitutes good corporate governance and policy and how to evaluate water-related risk -What constitutes good stakeholder collaboration and engagement -New technologies enabling companies to better manage water resources -Integration of water and energy and of water treatment and production processes in industry