Fernando Arrais R.D. Lima , Marcellus G.F. de Moraes , Carine Menezes Rebello , Amaro G. Barreto Jr. , Argimiro R. Secchi , Maurício B. de Souza Jr. , Idelfonso B.R. Nogueira
{"title":"Interpretable and uncertainty-aware machine learning for trustworthy prediction in batch crystallization","authors":"Fernando Arrais R.D. Lima , Marcellus G.F. de Moraes , Carine Menezes Rebello , Amaro G. Barreto Jr. , Argimiro R. Secchi , Maurício B. de Souza Jr. , Idelfonso B.R. Nogueira","doi":"10.1016/j.cep.2025.110350","DOIUrl":null,"url":null,"abstract":"<div><div>Symbolic regression was applied to model the nucleation and crystal growth in potassium sulfate batch crystallization. A methodology combining multivariable analysis, symbolic regression, meta-heuristic optimization, and statistical analysis was developed and compared with an alternative neural network-based approach. Experimental data were used to identify models via symbolic regression for predicting the first four moments of the crystal size distribution (CSD) and to train neural networks for the same purpose. Both methods efficiently modeled the crystallization process, achieving similar mean squared error (MSE) values. Additionally, solute concentration predictions were successful. The models were tested on simulated batches with supersaturation conditions not present in the experimental dataset. Both approaches performed comparably to the population balance model (PBM). However, symbolic regression required fewer parameters than neural networks. Parameter estimation for symbolic regression was done using particle swarm optimization (PSO), and confidence regions were constructed via Fisher’s test, revealing ellipsoidal shapes for most variables. For the zero-order moment, reparameterization achieved well-defined confidence regions. Symbolic regression provided interpretable and generalizable models that describe key phenomena, such as nucleation and crystal growth, aligning with population balance theory. These models offer advantages for modeling, controlling, and optimizing crystallization processes over traditional ODE systems.</div></div>","PeriodicalId":9929,"journal":{"name":"Chemical Engineering and Processing - Process Intensification","volume":"215 ","pages":"Article 110350"},"PeriodicalIF":3.8000,"publicationDate":"2025-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemical Engineering and Processing - Process Intensification","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0255270125001990","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0
Abstract
Symbolic regression was applied to model the nucleation and crystal growth in potassium sulfate batch crystallization. A methodology combining multivariable analysis, symbolic regression, meta-heuristic optimization, and statistical analysis was developed and compared with an alternative neural network-based approach. Experimental data were used to identify models via symbolic regression for predicting the first four moments of the crystal size distribution (CSD) and to train neural networks for the same purpose. Both methods efficiently modeled the crystallization process, achieving similar mean squared error (MSE) values. Additionally, solute concentration predictions were successful. The models were tested on simulated batches with supersaturation conditions not present in the experimental dataset. Both approaches performed comparably to the population balance model (PBM). However, symbolic regression required fewer parameters than neural networks. Parameter estimation for symbolic regression was done using particle swarm optimization (PSO), and confidence regions were constructed via Fisher’s test, revealing ellipsoidal shapes for most variables. For the zero-order moment, reparameterization achieved well-defined confidence regions. Symbolic regression provided interpretable and generalizable models that describe key phenomena, such as nucleation and crystal growth, aligning with population balance theory. These models offer advantages for modeling, controlling, and optimizing crystallization processes over traditional ODE systems.
期刊介绍:
Chemical Engineering and Processing: Process Intensification is intended for practicing researchers in industry and academia, working in the field of Process Engineering and related to the subject of Process Intensification.Articles published in the Journal demonstrate how novel discoveries, developments and theories in the field of Process Engineering and in particular Process Intensification may be used for analysis and design of innovative equipment and processing methods with substantially improved sustainability, efficiency and environmental performance.