{"title":"基于COSMO筛选电荷密度的表面活性剂表征用于物理信息神经网络(PINN)吸附等温线预测","authors":"Achmad Anggawirya Alimin, Kattariya Srasamran, Wanutchaya Yuenyong, Ampira Charoensaeng, Bor-Jier Shiau, Uthaiporn Suriyapraphadilok","doi":"10.1186/s13321-025-01027-y","DOIUrl":null,"url":null,"abstract":"<div><p>Predicting surfactant adsorption using the currently available isotherm model is limited to one or two independent variables: equilibrium concentration and temperature. This study aims to develop an adsorption model that includes molecular features, testing conditions, and solid properties in the model. A Physics-Informed Neural Network (PINN) was structured by integrating adsorption isotherm into artificial neural networks (ANN). The model was trained using a dataset containing 56 adsorption isotherms and 20 types of anionic and nonionic surfactants under various conditions with sand and silica oxide as their solids. The surfactants were quantified using sets of descriptors generated from molecular counting, charge distribution, and Conductor-like Screening Model (COSMO) screened charge density. The COSMO-screened charge density descriptors provide the highest accuracy in representing the surfactant molecule. The interpretation of molecular structure effect and surfactant-solid interaction described using COSMO-screened charge density showed that adsorption between the surfactant and solid media involves hydrogen bonding and hydrophobic interaction. The PINN model achieves high accuracy with 93% training and 85% validation with fivefold cross-validation. Later, the model was evaluated and used to generate an adsorption isotherm and predict unseen surfactant adsorption. Adsorption prediction with unseen surfactants showed high accuracy with the surfactant for familiar structure (RMSE 0.07 mg/g) and promising profile for the whole new structure (RMSE 2.95 mg/g). <b>Scientific contribution</b> This study advances the field by integrating COSMO-screened charge density descriptors into a physics-informed deep learning model to predict surfactant adsorption isotherms, accounting for molecular features, testing conditions, and solid properties. The incorporation of COSMO-screened charge density offers a novel approach to accurately represent surfactant molecules, enabling accurate prediction of their adsorption behavior. This approach extends conventional models, which are often limited to empirical parameters or fewer variables. This physics-informed framework significantly enhances the understanding of surfactant-solid interactions and offers a robust predictive tool for optimizing surfactant formulations, aiming to minimize adsorption losses in chemical enhanced oil recovery and environmental remediation.</p></div>","PeriodicalId":617,"journal":{"name":"Journal of Cheminformatics","volume":"17 1","pages":""},"PeriodicalIF":5.7000,"publicationDate":"2025-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://jcheminf.biomedcentral.com/counter/pdf/10.1186/s13321-025-01027-y","citationCount":"0","resultStr":"{\"title\":\"Surfactant representation using COSMO screened charge density for adsorption isotherm prediction using Physics-Informed Neural Network (PINN)\",\"authors\":\"Achmad Anggawirya Alimin, Kattariya Srasamran, Wanutchaya Yuenyong, Ampira Charoensaeng, Bor-Jier Shiau, Uthaiporn Suriyapraphadilok\",\"doi\":\"10.1186/s13321-025-01027-y\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Predicting surfactant adsorption using the currently available isotherm model is limited to one or two independent variables: equilibrium concentration and temperature. This study aims to develop an adsorption model that includes molecular features, testing conditions, and solid properties in the model. A Physics-Informed Neural Network (PINN) was structured by integrating adsorption isotherm into artificial neural networks (ANN). The model was trained using a dataset containing 56 adsorption isotherms and 20 types of anionic and nonionic surfactants under various conditions with sand and silica oxide as their solids. The surfactants were quantified using sets of descriptors generated from molecular counting, charge distribution, and Conductor-like Screening Model (COSMO) screened charge density. The COSMO-screened charge density descriptors provide the highest accuracy in representing the surfactant molecule. The interpretation of molecular structure effect and surfactant-solid interaction described using COSMO-screened charge density showed that adsorption between the surfactant and solid media involves hydrogen bonding and hydrophobic interaction. The PINN model achieves high accuracy with 93% training and 85% validation with fivefold cross-validation. Later, the model was evaluated and used to generate an adsorption isotherm and predict unseen surfactant adsorption. Adsorption prediction with unseen surfactants showed high accuracy with the surfactant for familiar structure (RMSE 0.07 mg/g) and promising profile for the whole new structure (RMSE 2.95 mg/g). <b>Scientific contribution</b> This study advances the field by integrating COSMO-screened charge density descriptors into a physics-informed deep learning model to predict surfactant adsorption isotherms, accounting for molecular features, testing conditions, and solid properties. The incorporation of COSMO-screened charge density offers a novel approach to accurately represent surfactant molecules, enabling accurate prediction of their adsorption behavior. This approach extends conventional models, which are often limited to empirical parameters or fewer variables. This physics-informed framework significantly enhances the understanding of surfactant-solid interactions and offers a robust predictive tool for optimizing surfactant formulations, aiming to minimize adsorption losses in chemical enhanced oil recovery and environmental remediation.</p></div>\",\"PeriodicalId\":617,\"journal\":{\"name\":\"Journal of Cheminformatics\",\"volume\":\"17 1\",\"pages\":\"\"},\"PeriodicalIF\":5.7000,\"publicationDate\":\"2025-05-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://jcheminf.biomedcentral.com/counter/pdf/10.1186/s13321-025-01027-y\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Cheminformatics\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://link.springer.com/article/10.1186/s13321-025-01027-y\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Cheminformatics","FirstCategoryId":"92","ListUrlMain":"https://link.springer.com/article/10.1186/s13321-025-01027-y","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
Surfactant representation using COSMO screened charge density for adsorption isotherm prediction using Physics-Informed Neural Network (PINN)
Predicting surfactant adsorption using the currently available isotherm model is limited to one or two independent variables: equilibrium concentration and temperature. This study aims to develop an adsorption model that includes molecular features, testing conditions, and solid properties in the model. A Physics-Informed Neural Network (PINN) was structured by integrating adsorption isotherm into artificial neural networks (ANN). The model was trained using a dataset containing 56 adsorption isotherms and 20 types of anionic and nonionic surfactants under various conditions with sand and silica oxide as their solids. The surfactants were quantified using sets of descriptors generated from molecular counting, charge distribution, and Conductor-like Screening Model (COSMO) screened charge density. The COSMO-screened charge density descriptors provide the highest accuracy in representing the surfactant molecule. The interpretation of molecular structure effect and surfactant-solid interaction described using COSMO-screened charge density showed that adsorption between the surfactant and solid media involves hydrogen bonding and hydrophobic interaction. The PINN model achieves high accuracy with 93% training and 85% validation with fivefold cross-validation. Later, the model was evaluated and used to generate an adsorption isotherm and predict unseen surfactant adsorption. Adsorption prediction with unseen surfactants showed high accuracy with the surfactant for familiar structure (RMSE 0.07 mg/g) and promising profile for the whole new structure (RMSE 2.95 mg/g). Scientific contribution This study advances the field by integrating COSMO-screened charge density descriptors into a physics-informed deep learning model to predict surfactant adsorption isotherms, accounting for molecular features, testing conditions, and solid properties. The incorporation of COSMO-screened charge density offers a novel approach to accurately represent surfactant molecules, enabling accurate prediction of their adsorption behavior. This approach extends conventional models, which are often limited to empirical parameters or fewer variables. This physics-informed framework significantly enhances the understanding of surfactant-solid interactions and offers a robust predictive tool for optimizing surfactant formulations, aiming to minimize adsorption losses in chemical enhanced oil recovery and environmental remediation.
期刊介绍:
Journal of Cheminformatics is an open access journal publishing original peer-reviewed research in all aspects of cheminformatics and molecular modelling.
Coverage includes, but is not limited to:
chemical information systems, software and databases, and molecular modelling,
chemical structure representations and their use in structure, substructure, and similarity searching of chemical substance and chemical reaction databases,
computer and molecular graphics, computer-aided molecular design, expert systems, QSAR, and data mining techniques.