精选活性药物成分在胆碱和甜菜碱基深共晶溶剂中的溶解度超空间探索:机器学习建模与实验验证。

IF 4.2 2区 化学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY
Piotr Cysewski, Tomasz Jeliński, Maciej Przybyłek
{"title":"精选活性药物成分在胆碱和甜菜碱基深共晶溶剂中的溶解度超空间探索:机器学习建模与实验验证。","authors":"Piotr Cysewski, Tomasz Jeliński, Maciej Przybyłek","doi":"10.3390/molecules29204894","DOIUrl":null,"url":null,"abstract":"<p><p>Deep eutectic solvents (DESs) are popular green media used for various industrial, pharmaceutical, and biomedical applications. However, the possible compositions of eutectic systems are so numerous that it is impossible to study all of them experimentally. To remedy this limitation, the solubility landscape of selected active pharmaceutical ingredients (APIs) in choline chloride- and betaine-based deep eutectic solvents was explored using theoretical models based on machine learning. The available solubility data for the selected APIs, comprising a total of 8014 data points, were collected for the available neat solvents, binary solvent mixtures, and DESs. This set was augmented with new measurements for the popular sulfa drugs in dry DESs. The descriptors used in the machine learning protocol were obtained from the σ-profiles of the considered molecules computed within the COSMO-RS framework. A combination of six sets of descriptors and 36 regressors were tested. Taking into account both accuracy and generalization, it was concluded that the best regressor is nuSVR regressor-based predictive models trained using the relative intermolecular interactions and a twelve-step averaged simplification of the relative σ-profiles.</p>","PeriodicalId":19041,"journal":{"name":"Molecules","volume":null,"pages":null},"PeriodicalIF":4.2000,"publicationDate":"2024-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11510433/pdf/","citationCount":"0","resultStr":"{\"title\":\"Exploration of the Solubility Hyperspace of Selected Active Pharmaceutical Ingredients in Choline- and Betaine-Based Deep Eutectic Solvents: Machine Learning Modeling and Experimental Validation.\",\"authors\":\"Piotr Cysewski, Tomasz Jeliński, Maciej Przybyłek\",\"doi\":\"10.3390/molecules29204894\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Deep eutectic solvents (DESs) are popular green media used for various industrial, pharmaceutical, and biomedical applications. However, the possible compositions of eutectic systems are so numerous that it is impossible to study all of them experimentally. To remedy this limitation, the solubility landscape of selected active pharmaceutical ingredients (APIs) in choline chloride- and betaine-based deep eutectic solvents was explored using theoretical models based on machine learning. The available solubility data for the selected APIs, comprising a total of 8014 data points, were collected for the available neat solvents, binary solvent mixtures, and DESs. This set was augmented with new measurements for the popular sulfa drugs in dry DESs. The descriptors used in the machine learning protocol were obtained from the σ-profiles of the considered molecules computed within the COSMO-RS framework. A combination of six sets of descriptors and 36 regressors were tested. Taking into account both accuracy and generalization, it was concluded that the best regressor is nuSVR regressor-based predictive models trained using the relative intermolecular interactions and a twelve-step averaged simplification of the relative σ-profiles.</p>\",\"PeriodicalId\":19041,\"journal\":{\"name\":\"Molecules\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2024-10-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11510433/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Molecules\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.3390/molecules29204894\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecules","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.3390/molecules29204894","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

深共晶溶剂(DES)是一种常用的绿色介质,可用于各种工业、制药和生物医学领域。然而,共晶体系的可能组成非常多,不可能对所有组成进行实验研究。为了弥补这一局限性,我们利用基于机器学习的理论模型探索了选定活性药物成分(API)在氯化胆碱和甜菜碱基深共晶溶剂中的溶解度情况。针对现有的纯溶剂、二元溶剂混合物和 DES,收集了所选原料药的可用溶解度数据,共计 8014 个数据点。在此基础上,又增加了对常用磺胺类药物在干燥 DESs 中的溶解度进行的新测量。机器学习协议中使用的描述符来自在 COSMO-RS 框架内计算的所考虑分子的 σ 配置文件。测试了六组描述符和 36 个回归因子的组合。考虑到准确性和普适性,得出的结论是最佳回归因子是基于 nuSVR 回归因子的预测模型,该模型使用相对分子间相互作用和相对 σ-profile的十二步平均简化进行训练。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Exploration of the Solubility Hyperspace of Selected Active Pharmaceutical Ingredients in Choline- and Betaine-Based Deep Eutectic Solvents: Machine Learning Modeling and Experimental Validation.

Deep eutectic solvents (DESs) are popular green media used for various industrial, pharmaceutical, and biomedical applications. However, the possible compositions of eutectic systems are so numerous that it is impossible to study all of them experimentally. To remedy this limitation, the solubility landscape of selected active pharmaceutical ingredients (APIs) in choline chloride- and betaine-based deep eutectic solvents was explored using theoretical models based on machine learning. The available solubility data for the selected APIs, comprising a total of 8014 data points, were collected for the available neat solvents, binary solvent mixtures, and DESs. This set was augmented with new measurements for the popular sulfa drugs in dry DESs. The descriptors used in the machine learning protocol were obtained from the σ-profiles of the considered molecules computed within the COSMO-RS framework. A combination of six sets of descriptors and 36 regressors were tested. Taking into account both accuracy and generalization, it was concluded that the best regressor is nuSVR regressor-based predictive models trained using the relative intermolecular interactions and a twelve-step averaged simplification of the relative σ-profiles.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Molecules
Molecules 化学-有机化学
CiteScore
7.40
自引率
8.70%
发文量
7524
审稿时长
1.4 months
期刊介绍: Molecules (ISSN 1420-3049, CODEN: MOLEFW) is an open access journal of synthetic organic chemistry and natural product chemistry. All articles are peer-reviewed and published continously upon acceptance. Molecules is published by MDPI, Basel, Switzerland. Our aim is to encourage chemists to publish as much as possible their experimental detail, particularly synthetic procedures and characterization information. There is no restriction on the length of the experimental section. In addition, availability of compound samples is published and considered as important information. Authors are encouraged to register or deposit their chemical samples through the non-profit international organization Molecular Diversity Preservation International (MDPI). Molecules has been launched in 1996 to preserve and exploit molecular diversity of both, chemical information and chemical substances.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信