Machine learning for determination of activity of water and activity coefficients of electrolytes in binary solutions

Guillaume Zante
{"title":"Machine learning for determination of activity of water and activity coefficients of electrolytes in binary solutions","authors":"Guillaume Zante","doi":"10.1016/j.aichem.2024.100069","DOIUrl":null,"url":null,"abstract":"<div><p>Activity of water and electrolytes in aqueous solutions is of utmost importance for multiple industrial applications. However, experimental determination of such values is time-consuming, while calculation of activity coefficients using numerical methods is challenging. By training neural networks models on literature data, one could predict activity of water and electrolytes easily, without requiring any experiment. In this paper, multiple descriptors (or features) are compared to predict activity coefficients of electrolytes and activity of water in electrolyte solutions. A neural network based on the Levenberg-Marquardt algorithm (LM-NN) showed high accuracy to calculate values, despite the small size of the training datasets. Both activity coefficients of electrolytes and activity of water in electrolyte solutions can be predicted accurately even on unseen data, using simple descriptors such as electrolyte concentration, ion sizes and charges. However, some discrepancies were observed due to the lack of representativeness of the training dataset. This could be solved by selecting training data sets that are similar (<em>e.g.</em> same group of the periodic table) to the unknown values, or by including available experimental data for the salt considered. The ability of the LM-NN to solve non-linear least square curve fitting problems makes it a good candidate to fit experimental activity coefficient data, with the advantage of simplicity as compared to e-NRTL or UNIQUAC methods. This method paves the way for accurate and quick determination of thermodynamic data for electrolyte solutions (and beyond) using machine learning, without necessitating large training datasets.</p></div>","PeriodicalId":72302,"journal":{"name":"Artificial intelligence chemistry","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949747724000277/pdfft?md5=e459d6b44dc65480007e2791c28b76ea&pid=1-s2.0-S2949747724000277-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial intelligence chemistry","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949747724000277","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Activity of water and electrolytes in aqueous solutions is of utmost importance for multiple industrial applications. However, experimental determination of such values is time-consuming, while calculation of activity coefficients using numerical methods is challenging. By training neural networks models on literature data, one could predict activity of water and electrolytes easily, without requiring any experiment. In this paper, multiple descriptors (or features) are compared to predict activity coefficients of electrolytes and activity of water in electrolyte solutions. A neural network based on the Levenberg-Marquardt algorithm (LM-NN) showed high accuracy to calculate values, despite the small size of the training datasets. Both activity coefficients of electrolytes and activity of water in electrolyte solutions can be predicted accurately even on unseen data, using simple descriptors such as electrolyte concentration, ion sizes and charges. However, some discrepancies were observed due to the lack of representativeness of the training dataset. This could be solved by selecting training data sets that are similar (e.g. same group of the periodic table) to the unknown values, or by including available experimental data for the salt considered. The ability of the LM-NN to solve non-linear least square curve fitting problems makes it a good candidate to fit experimental activity coefficient data, with the advantage of simplicity as compared to e-NRTL or UNIQUAC methods. This method paves the way for accurate and quick determination of thermodynamic data for electrolyte solutions (and beyond) using machine learning, without necessitating large training datasets.

通过机器学习确定二元溶液中水的活度和电解质的活度系数
水和电解质在水溶液中的活度对多种工业应用至关重要。然而,通过实验确定这些值非常耗时,而使用数值方法计算活度系数又极具挑战性。通过对文献数据进行神经网络模型训练,人们可以轻松预测水和电解质的活性,而无需进行任何实验。本文比较了多种描述符(或特征)来预测电解质的活度系数和电解质溶液中水的活度。基于 Levenberg-Marquardt 算法的神经网络(LM-NN)显示,尽管训练数据集的规模较小,但计算值的准确性很高。使用简单的描述符(如电解质浓度、离子大小和电荷),电解质的活度系数和电解质溶液中水的活度即使在未见过的数据上也能准确预测。不过,由于训练数据集缺乏代表性,也出现了一些差异。要解决这个问题,可以选择与未知值相似(如元素周期表中的同一组)的训练数据集,或加入所考虑盐类的可用实验数据。LM-NN 解决非线性最小平方曲线拟合问题的能力使其成为拟合实验活性系数数据的良好候选方法,与 e-NRTL 或 UNIQUAC 方法相比,它具有简单的优势。这种方法为利用机器学习准确、快速地确定电解质溶液(及其他)的热力学数据铺平了道路,而无需大量的训练数据集。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Artificial intelligence chemistry
Artificial intelligence chemistry Chemistry (General)
自引率
0.00%
发文量
0
审稿时长
21 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信