Jeffrey G. Ethier, Rohan K. Casukhela, Joshua J. Latimer, Matthew D. Jacobsen, Boris Rasin, Maneesh K. Gupta, Luke A. Baldwin and Richard A. Vaia*,
{"title":"Predicting Phase Behavior of Linear Polymers in Solution Using Machine Learning","authors":"Jeffrey G. Ethier, Rohan K. Casukhela, Joshua J. Latimer, Matthew D. Jacobsen, Boris Rasin, Maneesh K. Gupta, Luke A. Baldwin and Richard A. Vaia*, ","doi":"10.1021/acs.macromol.2c00245","DOIUrl":null,"url":null,"abstract":"<p >The phase behavior of polymers in solution is crucial to many applications in polymer processing, synthesis, self-assembly, and purification. Quantitative prediction of polymer solubility space for an arbitrary polymer–solvent pair and across a large composition range is challenging. Qualitative agreement is provided by many current theoretical models, but only a portion of the phase space is quantitatively predicted. Here, we utilize a curated database for binary polymer solutions comprised of 21 linear polymers, 61 solvents, and 97 unique polymer–solvent combinations (6524 cloud point temperatures) to construct phase diagrams from machine learning predictions. A generalizable feature vector is developed that includes component descriptors concatenated with state variables and an experimental data descriptor (phase direction). The impact of several types of descriptors (Morgan fingerprints, molecular descriptors, and Hansen solubility parameters) to encode polymer–solvent interactions is assessed. Hansen solubility parameters are also introduced as a means to understand the general breadth of the linear polymer–solvent space as well as the density and distribution of curated data. Two common regression algorithms (XGBoost and neural networks) establish the generality of the descriptors; provide a root mean squared error (RMSE) within 3 °C for predicted cloud points in the test set; and offer excellent agreement with upper and lower critical solubility curves, isopleths, and closed-loop phase behavior by a single model. The ability to extrapolate to polymers that are very dissimilar from the curated data is poor, but with as little as 20 cloud points or a single phase boundary, RMSE error of predictions are within 5 °C. This implies that the current model captures aspects of the underlying physics and can readily exploit correlations to reduce required data for additional polymer–solvent pairs. Finally, the model and data are accessible via the Polymer Property Predictor and Database (3PDb).</p>","PeriodicalId":51,"journal":{"name":"Macromolecules","volume":"55 7","pages":"2691–2702"},"PeriodicalIF":5.2000,"publicationDate":"2022-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Macromolecules","FirstCategoryId":"92","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acs.macromol.2c00245","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"POLYMER SCIENCE","Score":null,"Total":0}
引用次数: 13
Abstract
The phase behavior of polymers in solution is crucial to many applications in polymer processing, synthesis, self-assembly, and purification. Quantitative prediction of polymer solubility space for an arbitrary polymer–solvent pair and across a large composition range is challenging. Qualitative agreement is provided by many current theoretical models, but only a portion of the phase space is quantitatively predicted. Here, we utilize a curated database for binary polymer solutions comprised of 21 linear polymers, 61 solvents, and 97 unique polymer–solvent combinations (6524 cloud point temperatures) to construct phase diagrams from machine learning predictions. A generalizable feature vector is developed that includes component descriptors concatenated with state variables and an experimental data descriptor (phase direction). The impact of several types of descriptors (Morgan fingerprints, molecular descriptors, and Hansen solubility parameters) to encode polymer–solvent interactions is assessed. Hansen solubility parameters are also introduced as a means to understand the general breadth of the linear polymer–solvent space as well as the density and distribution of curated data. Two common regression algorithms (XGBoost and neural networks) establish the generality of the descriptors; provide a root mean squared error (RMSE) within 3 °C for predicted cloud points in the test set; and offer excellent agreement with upper and lower critical solubility curves, isopleths, and closed-loop phase behavior by a single model. The ability to extrapolate to polymers that are very dissimilar from the curated data is poor, but with as little as 20 cloud points or a single phase boundary, RMSE error of predictions are within 5 °C. This implies that the current model captures aspects of the underlying physics and can readily exploit correlations to reduce required data for additional polymer–solvent pairs. Finally, the model and data are accessible via the Polymer Property Predictor and Database (3PDb).
期刊介绍:
Macromolecules publishes original, fundamental, and impactful research on all aspects of polymer science. Topics of interest include synthesis (e.g., controlled polymerizations, polymerization catalysis, post polymerization modification, new monomer structures and polymer architectures, and polymerization mechanisms/kinetics analysis); phase behavior, thermodynamics, dynamic, and ordering/disordering phenomena (e.g., self-assembly, gelation, crystallization, solution/melt/solid-state characteristics); structure and properties (e.g., mechanical and rheological properties, surface/interfacial characteristics, electronic and transport properties); new state of the art characterization (e.g., spectroscopy, scattering, microscopy, rheology), simulation (e.g., Monte Carlo, molecular dynamics, multi-scale/coarse-grained modeling), and theoretical methods. Renewable/sustainable polymers, polymer networks, responsive polymers, electro-, magneto- and opto-active macromolecules, inorganic polymers, charge-transporting polymers (ion-containing, semiconducting, and conducting), nanostructured polymers, and polymer composites are also of interest. Typical papers published in Macromolecules showcase important and innovative concepts, experimental methods/observations, and theoretical/computational approaches that demonstrate a fundamental advance in the understanding of polymers.