Predicting Phase Behavior of Linear Polymers in Solution Using Machine Learning

IF 5.2 1区 化学 Q1 POLYMER SCIENCE
Jeffrey G. Ethier, Rohan K. Casukhela, Joshua J. Latimer, Matthew D. Jacobsen, Boris Rasin, Maneesh K. Gupta, Luke A. Baldwin and Richard A. Vaia*, 
{"title":"Predicting Phase Behavior of Linear Polymers in Solution Using Machine Learning","authors":"Jeffrey G. Ethier,&nbsp;Rohan K. Casukhela,&nbsp;Joshua J. Latimer,&nbsp;Matthew D. Jacobsen,&nbsp;Boris Rasin,&nbsp;Maneesh K. Gupta,&nbsp;Luke A. Baldwin and Richard A. Vaia*,&nbsp;","doi":"10.1021/acs.macromol.2c00245","DOIUrl":null,"url":null,"abstract":"<p >The phase behavior of polymers in solution is crucial to many applications in polymer processing, synthesis, self-assembly, and purification. Quantitative prediction of polymer solubility space for an arbitrary polymer–solvent pair and across a large composition range is challenging. Qualitative agreement is provided by many current theoretical models, but only a portion of the phase space is quantitatively predicted. Here, we utilize a curated database for binary polymer solutions comprised of 21 linear polymers, 61 solvents, and 97 unique polymer–solvent combinations (6524 cloud point temperatures) to construct phase diagrams from machine learning predictions. A generalizable feature vector is developed that includes component descriptors concatenated with state variables and an experimental data descriptor (phase direction). The impact of several types of descriptors (Morgan fingerprints, molecular descriptors, and Hansen solubility parameters) to encode polymer–solvent interactions is assessed. Hansen solubility parameters are also introduced as a means to understand the general breadth of the linear polymer–solvent space as well as the density and distribution of curated data. Two common regression algorithms (XGBoost and neural networks) establish the generality of the descriptors; provide a root mean squared error (RMSE) within 3 °C for predicted cloud points in the test set; and offer excellent agreement with upper and lower critical solubility curves, isopleths, and closed-loop phase behavior by a single model. The ability to extrapolate to polymers that are very dissimilar from the curated data is poor, but with as little as 20 cloud points or a single phase boundary, RMSE error of predictions are within 5 °C. This implies that the current model captures aspects of the underlying physics and can readily exploit correlations to reduce required data for additional polymer–solvent pairs. Finally, the model and data are accessible via the Polymer Property Predictor and Database (3PDb).</p>","PeriodicalId":51,"journal":{"name":"Macromolecules","volume":"55 7","pages":"2691–2702"},"PeriodicalIF":5.2000,"publicationDate":"2022-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Macromolecules","FirstCategoryId":"92","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acs.macromol.2c00245","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"POLYMER SCIENCE","Score":null,"Total":0}
引用次数: 13

Abstract

The phase behavior of polymers in solution is crucial to many applications in polymer processing, synthesis, self-assembly, and purification. Quantitative prediction of polymer solubility space for an arbitrary polymer–solvent pair and across a large composition range is challenging. Qualitative agreement is provided by many current theoretical models, but only a portion of the phase space is quantitatively predicted. Here, we utilize a curated database for binary polymer solutions comprised of 21 linear polymers, 61 solvents, and 97 unique polymer–solvent combinations (6524 cloud point temperatures) to construct phase diagrams from machine learning predictions. A generalizable feature vector is developed that includes component descriptors concatenated with state variables and an experimental data descriptor (phase direction). The impact of several types of descriptors (Morgan fingerprints, molecular descriptors, and Hansen solubility parameters) to encode polymer–solvent interactions is assessed. Hansen solubility parameters are also introduced as a means to understand the general breadth of the linear polymer–solvent space as well as the density and distribution of curated data. Two common regression algorithms (XGBoost and neural networks) establish the generality of the descriptors; provide a root mean squared error (RMSE) within 3 °C for predicted cloud points in the test set; and offer excellent agreement with upper and lower critical solubility curves, isopleths, and closed-loop phase behavior by a single model. The ability to extrapolate to polymers that are very dissimilar from the curated data is poor, but with as little as 20 cloud points or a single phase boundary, RMSE error of predictions are within 5 °C. This implies that the current model captures aspects of the underlying physics and can readily exploit correlations to reduce required data for additional polymer–solvent pairs. Finally, the model and data are accessible via the Polymer Property Predictor and Database (3PDb).

Abstract Image

利用机器学习预测线性聚合物在溶液中的相行为
聚合物在溶液中的相行为对聚合物加工、合成、自组装和纯化等领域的许多应用至关重要。定量预测任意聚合物-溶剂对和大组成范围内的聚合物溶解度空间是具有挑战性的。许多现有的理论模型都提供了定性的一致性,但只有一部分相空间是定量预测的。在这里,我们利用一个由21种线性聚合物、61种溶剂和97种独特的聚合物溶剂组合(6524个云点温度)组成的二元聚合物溶液的策划数据库,从机器学习预测中构建相图。提出了一种广义特征向量,该特征向量包括连接状态变量的分量描述符和一个实验数据描述符(相位方向)。评估了几种类型的描述符(摩根指纹、分子描述符和汉森溶解度参数)对聚合物-溶剂相互作用编码的影响。汉森溶解度参数也被引入,作为一种手段,了解线性聚合物-溶剂空间的一般宽度,以及密度和分布的策划数据。两种常见的回归算法(XGBoost和神经网络)建立了描述符的通用性;为测试集中的预测云点提供3°C内的均方根误差(RMSE);并且通过单一模型与上下限临界溶解度曲线、等容线和闭环相行为具有很好的一致性。外推到与整理数据非常不同的聚合物的能力很差,但只有20个云点或单相边界,预测的RMSE误差在5°C以内。这意味着当前的模型捕获了基础物理的各个方面,并且可以很容易地利用相关性来减少额外的聚合物-溶剂对所需的数据。最后,模型和数据可通过聚合物属性预测和数据库(3PDb)访问。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Macromolecules
Macromolecules 工程技术-高分子科学
CiteScore
9.30
自引率
16.40%
发文量
942
审稿时长
2 months
期刊介绍: Macromolecules publishes original, fundamental, and impactful research on all aspects of polymer science. Topics of interest include synthesis (e.g., controlled polymerizations, polymerization catalysis, post polymerization modification, new monomer structures and polymer architectures, and polymerization mechanisms/kinetics analysis); phase behavior, thermodynamics, dynamic, and ordering/disordering phenomena (e.g., self-assembly, gelation, crystallization, solution/melt/solid-state characteristics); structure and properties (e.g., mechanical and rheological properties, surface/interfacial characteristics, electronic and transport properties); new state of the art characterization (e.g., spectroscopy, scattering, microscopy, rheology), simulation (e.g., Monte Carlo, molecular dynamics, multi-scale/coarse-grained modeling), and theoretical methods. Renewable/sustainable polymers, polymer networks, responsive polymers, electro-, magneto- and opto-active macromolecules, inorganic polymers, charge-transporting polymers (ion-containing, semiconducting, and conducting), nanostructured polymers, and polymer composites are also of interest. Typical papers published in Macromolecules showcase important and innovative concepts, experimental methods/observations, and theoretical/computational approaches that demonstrate a fundamental advance in the understanding of polymers.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信