Advanced evaluation of performance of machine learning models for soapstock splitting optimisation under uncertainty

IF 4.5 3区 工程技术 Q1 WATER RESOURCES
Bartosz Szeląg , Krzysztof Barbusiński , Michał Stachura , Przemysław Kowal , Adam Kiczko , Eldon R. Rene
{"title":"Advanced evaluation of performance of machine learning models for soapstock splitting optimisation under uncertainty","authors":"Bartosz Szeląg ,&nbsp;Krzysztof Barbusiński ,&nbsp;Michał Stachura ,&nbsp;Przemysław Kowal ,&nbsp;Adam Kiczko ,&nbsp;Eldon R. Rene","doi":"10.1016/j.wri.2025.100294","DOIUrl":null,"url":null,"abstract":"<div><div>This study proposes a computational framework for the prediction and optimisation of soapstock splitting under conditions of limited measurement data and input uncertainty. The objective was to evaluate and select the modeling approaches based on (i) data availability, (ii) model complexity, (iii) predictive accuracy, and (iv) sensitivity to input uncertainty. Machine learning algorithms—Extreme Gradient Boosting (XGBoost) and Support Vector Machines (SVM)—were assessed in comparison with Response Surface Methodology (RSM). XGBoost provided the most accurate predictions for chemical oxygen demand (COD) and organic phosphorus (P<sub>org</sub>), while SVM performed best for acid number (AN). K-means clustering identified specific input domains where RSM models could effectively substitute for XGBoost, offering a balance between simplicity and performance. GSA showed that the key influence on P<sub>org</sub> (organic phosphorus), COD (chemical oxygen demand) and AN (acid number) was the phosphorus content of the oil, and less important were the operational parameters of the soapstock splitting system. Multi-criteria optimisation under uncertainty using a genetic algorithm (NSGA II) showed a significant influence of phosphorus content uncertainty on the choice of soapstock splitting operating conditions. These findings underscore the importance of accurate phosphorus quantification and support the development of robust, data-efficient computational tools for the monitoring, prediction, and optimisation of complex industrial processes such as soapstock splitting.</div></div>","PeriodicalId":23714,"journal":{"name":"Water Resources and Industry","volume":"33 ","pages":"Article 100294"},"PeriodicalIF":4.5000,"publicationDate":"2025-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Water Resources and Industry","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2212371725000186","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"WATER RESOURCES","Score":null,"Total":0}
引用次数: 0

Abstract

This study proposes a computational framework for the prediction and optimisation of soapstock splitting under conditions of limited measurement data and input uncertainty. The objective was to evaluate and select the modeling approaches based on (i) data availability, (ii) model complexity, (iii) predictive accuracy, and (iv) sensitivity to input uncertainty. Machine learning algorithms—Extreme Gradient Boosting (XGBoost) and Support Vector Machines (SVM)—were assessed in comparison with Response Surface Methodology (RSM). XGBoost provided the most accurate predictions for chemical oxygen demand (COD) and organic phosphorus (Porg), while SVM performed best for acid number (AN). K-means clustering identified specific input domains where RSM models could effectively substitute for XGBoost, offering a balance between simplicity and performance. GSA showed that the key influence on Porg (organic phosphorus), COD (chemical oxygen demand) and AN (acid number) was the phosphorus content of the oil, and less important were the operational parameters of the soapstock splitting system. Multi-criteria optimisation under uncertainty using a genetic algorithm (NSGA II) showed a significant influence of phosphorus content uncertainty on the choice of soapstock splitting operating conditions. These findings underscore the importance of accurate phosphorus quantification and support the development of robust, data-efficient computational tools for the monitoring, prediction, and optimisation of complex industrial processes such as soapstock splitting.

Abstract Image

不确定条件下皂料拆分优化机器学习模型性能的高级评价
本研究提出了一个在有限测量数据和输入不确定性条件下预测和优化皂料分裂的计算框架。目的是根据(i)数据可用性、(ii)模型复杂性、(iii)预测准确性和(iv)对输入不确定性的敏感性来评估和选择建模方法。机器学习算法——极端梯度增强(XGBoost)和支持向量机(SVM)——与响应面方法(RSM)进行了比较评估。XGBoost对化学需氧量(COD)和有机磷(Porg)的预测最准确,而SVM对酸值(AN)的预测效果最好。K-means聚类确定了RSM模型可以有效替代XGBoost的特定输入域,在简单性和性能之间提供了平衡。GSA结果表明,油中磷的含量是影响Porg(有机磷)、COD(化学需氧量)和AN(酸值)的主要因素,皂料裂解系统的操作参数影响较小。利用遗传算法(NSGA II)进行了不确定条件下的多准则优化,结果表明磷含量的不确定度对皂料裂解操作条件的选择有显著影响。这些发现强调了准确的磷定量的重要性,并支持开发强大的、数据高效的计算工具,用于监测、预测和优化复杂的工业过程,如皂料分裂。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Water Resources and Industry
Water Resources and Industry Social Sciences-Geography, Planning and Development
CiteScore
8.10
自引率
5.90%
发文量
23
审稿时长
75 days
期刊介绍: Water Resources and Industry moves research to innovation by focusing on the role industry plays in the exploitation, management and treatment of water resources. Different industries use radically different water resources in their production processes, while they produce, treat and dispose a wide variety of wastewater qualities. Depending on the geographical location of the facilities, the impact on the local resources will vary, pre-empting the applicability of one single approach. The aims and scope of the journal include: -Industrial water footprint assessment - an evaluation of tools and methodologies -What constitutes good corporate governance and policy and how to evaluate water-related risk -What constitutes good stakeholder collaboration and engagement -New technologies enabling companies to better manage water resources -Integration of water and energy and of water treatment and production processes in industry
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信