{"title":"Prediction of the density of aqueous electrolyte solutions with matrix completion methods","authors":"Maximilian Kohns, Pascal Zittlau, Fabian Jirasek","doi":"10.1016/j.fluid.2025.114454","DOIUrl":null,"url":null,"abstract":"<div><div>Information on the density of electrolyte solutions is important for many processes in chemistry and chemical engineering. However, experimental data are scarce, and broadly applicable prediction methods that can extrapolate to unstudied electrolytes have been unavailable until now. In the present work, we introduce a novel approach for predicting the densities of aqueous solutions of 720 single electrolytes at 298.15 K based on the machine-learning concept of matrix completion. The studied electrolytes belong to the valency classes 1:1, 2:1, 1:2, 3:1, 2:2, and 3:2; individual ion concentrations up to 0.1 mol/mol are considered. We arrange the available density data for these electrolytes composed of 40 cations and 18 anions in a matrix, where the columns and rows denote the cations and anions, respectively. In the literature, experimental data are available for only 181 of all 720 electrolytes. This makes the prediction for the other electrolytes a matrix completion problem, which we address using probabilistic matrix factorization. To account for the concentration dependence of the density, a dimensionality reduction is carried out by representing the density as a linear function of the mole fraction-based ionic strength, a correlation found to be very accurate for all considered electrolytes. As a result, a sparse matrix containing the scalar slope of that linear function is obtained. Two matrix completion methods (MCMs) are introduced: a purely data-driven one trained only on the available density data and a hierarchical model that includes the ions’ valencies as side information. The performance of both models is evaluated on unseen test data, with the hierarchical MCM providing very accurate predictions: When averaging the relative deviations for all density data points for a certain electrolyte, an average deviation of 0.96 % is obtained. Moreover, we show that the MCM parameters learned during training are physically interpretable, as their values align with descriptors such as an ion’s charge density.</div></div>","PeriodicalId":12170,"journal":{"name":"Fluid Phase Equilibria","volume":"597 ","pages":"Article 114454"},"PeriodicalIF":2.8000,"publicationDate":"2025-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fluid Phase Equilibria","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378381225001244","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Information on the density of electrolyte solutions is important for many processes in chemistry and chemical engineering. However, experimental data are scarce, and broadly applicable prediction methods that can extrapolate to unstudied electrolytes have been unavailable until now. In the present work, we introduce a novel approach for predicting the densities of aqueous solutions of 720 single electrolytes at 298.15 K based on the machine-learning concept of matrix completion. The studied electrolytes belong to the valency classes 1:1, 2:1, 1:2, 3:1, 2:2, and 3:2; individual ion concentrations up to 0.1 mol/mol are considered. We arrange the available density data for these electrolytes composed of 40 cations and 18 anions in a matrix, where the columns and rows denote the cations and anions, respectively. In the literature, experimental data are available for only 181 of all 720 electrolytes. This makes the prediction for the other electrolytes a matrix completion problem, which we address using probabilistic matrix factorization. To account for the concentration dependence of the density, a dimensionality reduction is carried out by representing the density as a linear function of the mole fraction-based ionic strength, a correlation found to be very accurate for all considered electrolytes. As a result, a sparse matrix containing the scalar slope of that linear function is obtained. Two matrix completion methods (MCMs) are introduced: a purely data-driven one trained only on the available density data and a hierarchical model that includes the ions’ valencies as side information. The performance of both models is evaluated on unseen test data, with the hierarchical MCM providing very accurate predictions: When averaging the relative deviations for all density data points for a certain electrolyte, an average deviation of 0.96 % is obtained. Moreover, we show that the MCM parameters learned during training are physically interpretable, as their values align with descriptors such as an ion’s charge density.
期刊介绍:
Fluid Phase Equilibria publishes high-quality papers dealing with experimental, theoretical, and applied research related to equilibrium and transport properties of fluids, solids, and interfaces. Subjects of interest include physical/phase and chemical equilibria; equilibrium and nonequilibrium thermophysical properties; fundamental thermodynamic relations; and stability. The systems central to the journal include pure substances and mixtures of organic and inorganic materials, including polymers, biochemicals, and surfactants with sufficient characterization of composition and purity for the results to be reproduced. Alloys are of interest only when thermodynamic studies are included, purely material studies will not be considered. In all cases, authors are expected to provide physical or chemical interpretations of the results.
Experimental research can include measurements under all conditions of temperature, pressure, and composition, including critical and supercritical. Measurements are to be associated with systems and conditions of fundamental or applied interest, and may not be only a collection of routine data, such as physical property or solubility measurements at limited pressures and temperatures close to ambient, or surfactant studies focussed strictly on micellisation or micelle structure. Papers reporting common data must be accompanied by new physical insights and/or contemporary or new theory or techniques.