预测部分水解聚丙烯酰胺衍生物粘度的机器学习方法比较

Kelly Cristine Da Silveira, Matheus Henrique Silva Siqueira, João Matheus Ramos Gama, Jonathan Nogueira Gois, Cláudio Fabiano Motta Toledo, A. S. Silva Neto
{"title":"预测部分水解聚丙烯酰胺衍生物粘度的机器学习方法比较","authors":"Kelly Cristine Da Silveira, Matheus Henrique Silva Siqueira, João Matheus Ramos Gama, Jonathan Nogueira Gois, Cláudio Fabiano Motta Toledo, A. S. Silva Neto","doi":"10.14295/vetor.v33i1.15157","DOIUrl":null,"url":null,"abstract":"Partially hydrolyzed polyacrylamides (HPAM) are widely used to modulate the viscosity of formulations. The appropriate application of a viscosity model can facilitate the idealization of new macromolecules and contribute to a better understanding of the structure-property relationship. In the present study, machine learning approaches, Multiple Linear Regression (MLR) and Random Forest (RF), were compared to model the viscosifying effect of HPAM derivatives, based on their chemical composition and concentration in an aqueous solution. The evaluated data come from a previous experimental study, which explores a post-synthetic polymer modification methodology. The relative importance of the variables was investigated, determining the features with the greatest influence on viscosity, including variations in chemical composition, with emphasis on the more hydrophobic groups (C7 and C12). The accuracy of the models was evaluated using statistical criteria, the coefficient of determination (R2) and the Root Mean Square Error (RMSE). The Random Forest approach outperformed Multiple Linear Regression, with values of 0.97 and 0.30 for R2 and RMSE, respectively, compared to 0.83 and 0.67 for Multiple Linear Regression. Applying the Random Forest model, it was possible to generate a set of hypothetical macromolecules, with potential viscosifying effects. These macromolecules were idealized focusing on mixed compositions of C7 and C12 with a maximum structural variation of 10 mol%. Additionally, this structural mapping provided insights for designing promising polymers by inserting cyclic structures, such as CYCLOPROP, which could overcome the solubility limitation observed in the literature.","PeriodicalId":258655,"journal":{"name":"VETOR - Revista de Ciências Exatas e Engenharias","volume":"47 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Comparison of Machine Learning Approaches in Predicting Viscosity for Partially Hydrolyzed Polyacrylamide Derivatives\",\"authors\":\"Kelly Cristine Da Silveira, Matheus Henrique Silva Siqueira, João Matheus Ramos Gama, Jonathan Nogueira Gois, Cláudio Fabiano Motta Toledo, A. S. Silva Neto\",\"doi\":\"10.14295/vetor.v33i1.15157\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Partially hydrolyzed polyacrylamides (HPAM) are widely used to modulate the viscosity of formulations. The appropriate application of a viscosity model can facilitate the idealization of new macromolecules and contribute to a better understanding of the structure-property relationship. In the present study, machine learning approaches, Multiple Linear Regression (MLR) and Random Forest (RF), were compared to model the viscosifying effect of HPAM derivatives, based on their chemical composition and concentration in an aqueous solution. The evaluated data come from a previous experimental study, which explores a post-synthetic polymer modification methodology. The relative importance of the variables was investigated, determining the features with the greatest influence on viscosity, including variations in chemical composition, with emphasis on the more hydrophobic groups (C7 and C12). The accuracy of the models was evaluated using statistical criteria, the coefficient of determination (R2) and the Root Mean Square Error (RMSE). The Random Forest approach outperformed Multiple Linear Regression, with values of 0.97 and 0.30 for R2 and RMSE, respectively, compared to 0.83 and 0.67 for Multiple Linear Regression. Applying the Random Forest model, it was possible to generate a set of hypothetical macromolecules, with potential viscosifying effects. These macromolecules were idealized focusing on mixed compositions of C7 and C12 with a maximum structural variation of 10 mol%. Additionally, this structural mapping provided insights for designing promising polymers by inserting cyclic structures, such as CYCLOPROP, which could overcome the solubility limitation observed in the literature.\",\"PeriodicalId\":258655,\"journal\":{\"name\":\"VETOR - Revista de Ciências Exatas e Engenharias\",\"volume\":\"47 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"VETOR - Revista de Ciências Exatas e Engenharias\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.14295/vetor.v33i1.15157\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"VETOR - Revista de Ciências Exatas e Engenharias","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.14295/vetor.v33i1.15157","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

部分水解聚丙烯酰胺(HPAM)被广泛用于调节配方粘度。粘度模型的适当应用可促进新大分子的理想化,并有助于更好地理解结构与性能之间的关系。在本研究中,比较了多重线性回归(MLR)和随机森林(RF)这两种机器学习方法,根据 HPAM 衍生物在水溶液中的化学成分和浓度,为其增粘效果建立模型。所评估的数据来自之前的一项实验研究,该研究探索了一种后合成聚合物改性方法。研究调查了变量的相对重要性,确定了对粘度影响最大的特征,包括化学成分的变化,重点是疏水基团(C7 和 C12)。使用统计标准、决定系数(R2)和均方根误差(RMSE)对模型的准确性进行了评估。随机森林方法优于多元线性回归方法,R2 和 RMSE 值分别为 0.97 和 0.30,而多元线性回归方法分别为 0.83 和 0.67。应用随机森林模型,可以生成一组具有潜在增粘效果的假定大分子。这些大分子是理想化的,侧重于 C7 和 C12 的混合成分,最大结构变化为 10 摩尔%。此外,该结构图还为通过插入环状结构(如 CYCLOPROP)设计有前景的聚合物提供了启示,这种结构可以克服文献中观察到的溶解度限制。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Comparison of Machine Learning Approaches in Predicting Viscosity for Partially Hydrolyzed Polyacrylamide Derivatives
Partially hydrolyzed polyacrylamides (HPAM) are widely used to modulate the viscosity of formulations. The appropriate application of a viscosity model can facilitate the idealization of new macromolecules and contribute to a better understanding of the structure-property relationship. In the present study, machine learning approaches, Multiple Linear Regression (MLR) and Random Forest (RF), were compared to model the viscosifying effect of HPAM derivatives, based on their chemical composition and concentration in an aqueous solution. The evaluated data come from a previous experimental study, which explores a post-synthetic polymer modification methodology. The relative importance of the variables was investigated, determining the features with the greatest influence on viscosity, including variations in chemical composition, with emphasis on the more hydrophobic groups (C7 and C12). The accuracy of the models was evaluated using statistical criteria, the coefficient of determination (R2) and the Root Mean Square Error (RMSE). The Random Forest approach outperformed Multiple Linear Regression, with values of 0.97 and 0.30 for R2 and RMSE, respectively, compared to 0.83 and 0.67 for Multiple Linear Regression. Applying the Random Forest model, it was possible to generate a set of hypothetical macromolecules, with potential viscosifying effects. These macromolecules were idealized focusing on mixed compositions of C7 and C12 with a maximum structural variation of 10 mol%. Additionally, this structural mapping provided insights for designing promising polymers by inserting cyclic structures, such as CYCLOPROP, which could overcome the solubility limitation observed in the literature.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信