Jia-Xi Chang, Jian-Wei Zou, Chao-Yuan Lou, Jia-Xin Ye, Rui Feng, Zi-Yuan Li, Gui-Xiang Hu
{"title":"Gas-to-ionic liquid partition: QSPR modeling and mechanistic interpretation.","authors":"Jia-Xi Chang, Jian-Wei Zou, Chao-Yuan Lou, Jia-Xin Ye, Rui Feng, Zi-Yuan Li, Gui-Xiang Hu","doi":"10.1002/minf.202200223","DOIUrl":null,"url":null,"abstract":"<p><p>The present work was devoted to explore the quantitative structure-property relationships for gas-to-ionic liquid partition coefficients (log K<sub>ILA</sub> ). A series of linear models were first established for the representative dataset (IL01). The optimal model was a four-parameter equation (1Ed) consisting of two electrostatic potential-based descriptors ( <math> <semantics><mrow><mi>Σ</mi> <msubsup><mi>V</mi> <mrow><mi>s</mi> <mo>,</mo> <mi>i</mi> <mi>n</mi> <mi>d</mi></mrow> <mo>-</mo></msubsup> </mrow> <annotation>${{\\rm { \\Sigma }}{V}_{s,ind}^{-}}$</annotation> </semantics> </math> and V<sub>s,max</sub> ), one 2D matrix-based descriptor (J_D/Dt) and dipole moment (μ). All of the four descriptors introduced in the model can find the corresponding parameters, directly or indirectly, from Abraham's linear solvation energy relationship (LSER) or its theoretical alternatives, which endows the model good interpretability. Gaussian process was utilized to build the nonlinear model. Systematical validations, including 5-fold cross-validation for the training set, the validation for test set, as well as a more rigorous Monte Carlo cross-validation were performed to verify the reliability of the constructed models. Applicability domain of the model was evaluated, and the Williams plot revealed that the model can be used to predict the log K<sub>ILA</sub> values of structurally diverse solutes. The other 13 datasets were also processed in the same way, and all of the linear models with expressions similar to equation 1Ed were obtained. These models, whether linear of nonlinear, represent satisfactory statistical results, which confirms the universality of the method adopted in this study in QSPR modeling of gas-to-IL partition.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":null,"pages":null},"PeriodicalIF":2.8000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Informatics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/minf.202200223","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
引用次数: 0
Abstract
The present work was devoted to explore the quantitative structure-property relationships for gas-to-ionic liquid partition coefficients (log KILA ). A series of linear models were first established for the representative dataset (IL01). The optimal model was a four-parameter equation (1Ed) consisting of two electrostatic potential-based descriptors ( and Vs,max ), one 2D matrix-based descriptor (J_D/Dt) and dipole moment (μ). All of the four descriptors introduced in the model can find the corresponding parameters, directly or indirectly, from Abraham's linear solvation energy relationship (LSER) or its theoretical alternatives, which endows the model good interpretability. Gaussian process was utilized to build the nonlinear model. Systematical validations, including 5-fold cross-validation for the training set, the validation for test set, as well as a more rigorous Monte Carlo cross-validation were performed to verify the reliability of the constructed models. Applicability domain of the model was evaluated, and the Williams plot revealed that the model can be used to predict the log KILA values of structurally diverse solutes. The other 13 datasets were also processed in the same way, and all of the linear models with expressions similar to equation 1Ed were obtained. These models, whether linear of nonlinear, represent satisfactory statistical results, which confirms the universality of the method adopted in this study in QSPR modeling of gas-to-IL partition.
期刊介绍:
Molecular Informatics is a peer-reviewed, international forum for publication of high-quality, interdisciplinary research on all molecular aspects of bio/cheminformatics and computer-assisted molecular design. Molecular Informatics succeeded QSAR & Combinatorial Science in 2010.
Molecular Informatics presents methodological innovations that will lead to a deeper understanding of ligand-receptor interactions, macromolecular complexes, molecular networks, design concepts and processes that demonstrate how ideas and design concepts lead to molecules with a desired structure or function, preferably including experimental validation.
The journal''s scope includes but is not limited to the fields of drug discovery and chemical biology, protein and nucleic acid engineering and design, the design of nanomolecular structures, strategies for modeling of macromolecular assemblies, molecular networks and systems, pharmaco- and chemogenomics, computer-assisted screening strategies, as well as novel technologies for the de novo design of biologically active molecules. As a unique feature Molecular Informatics publishes so-called "Methods Corner" review-type articles which feature important technological concepts and advances within the scope of the journal.