共晶形成预测:混合GIN-Mordred模型优于基于dft的方法

IF 3.2 2区 化学 Q2 CHEMISTRY, MULTIDISCIPLINARY
Mohammad Amin Ghanavati,  and , Sohrab Rohani*, 
{"title":"共晶形成预测:混合GIN-Mordred模型优于基于dft的方法","authors":"Mohammad Amin Ghanavati,&nbsp; and ,&nbsp;Sohrab Rohani*,&nbsp;","doi":"10.1021/acs.cgd.5c0034710.1021/acs.cgd.5c00347","DOIUrl":null,"url":null,"abstract":"<p >Cocrystals offer significant potential across various industries, especially pharmaceuticals, by addressing the poor solubility of new drug candidates. However, traditional experimental screening for cocrystal formation is expensive and time-consuming, highlighting the need for predictive models. In this study, we compared four cocrystal prediction approaches: two deep learning (DL) models based on DFT-driven data (PointNet for electrostatic potential (ESP) maps and a novel LSTM for sequential hydrogen bond parameters), a novel hybrid model combining graph isomorphism networks (GIN) with Mordred descriptors, and the empirical Hydrogen Bond Energy (HBE) method. To perform this comparison, we compiled and carried out DFT calculations for 14,790 molecules (7395 pairs of successful and unsuccessful cocrystals). Notably, the GIN-Mordred model outperformed all other methods, achieving the highest balanced accuracy (BACC: 0.916), F1-score (0.956), recall (0.932), and AUC (0.97), with superior segregation performance in distinguishing between cocrystallization outcomes. Importantly, the GIN-Mordred model does not require costly DFT calculations, demonstrating that a combination of graph-based and descriptor-based molecular representation provides an efficient and accurate alternative for cocrystal prediction. This model significantly streamlines the process of tuning the physicochemical properties of crystalline materials for various applications.</p>","PeriodicalId":34,"journal":{"name":"Crystal Growth & Design","volume":"25 8","pages":"2717–2729 2717–2729"},"PeriodicalIF":3.2000,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Cocrystal Formation Prediction: Hybrid GIN-Mordred Model Outperforms DFT-Based Methods\",\"authors\":\"Mohammad Amin Ghanavati,&nbsp; and ,&nbsp;Sohrab Rohani*,&nbsp;\",\"doi\":\"10.1021/acs.cgd.5c0034710.1021/acs.cgd.5c00347\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p >Cocrystals offer significant potential across various industries, especially pharmaceuticals, by addressing the poor solubility of new drug candidates. However, traditional experimental screening for cocrystal formation is expensive and time-consuming, highlighting the need for predictive models. In this study, we compared four cocrystal prediction approaches: two deep learning (DL) models based on DFT-driven data (PointNet for electrostatic potential (ESP) maps and a novel LSTM for sequential hydrogen bond parameters), a novel hybrid model combining graph isomorphism networks (GIN) with Mordred descriptors, and the empirical Hydrogen Bond Energy (HBE) method. To perform this comparison, we compiled and carried out DFT calculations for 14,790 molecules (7395 pairs of successful and unsuccessful cocrystals). Notably, the GIN-Mordred model outperformed all other methods, achieving the highest balanced accuracy (BACC: 0.916), F1-score (0.956), recall (0.932), and AUC (0.97), with superior segregation performance in distinguishing between cocrystallization outcomes. Importantly, the GIN-Mordred model does not require costly DFT calculations, demonstrating that a combination of graph-based and descriptor-based molecular representation provides an efficient and accurate alternative for cocrystal prediction. This model significantly streamlines the process of tuning the physicochemical properties of crystalline materials for various applications.</p>\",\"PeriodicalId\":34,\"journal\":{\"name\":\"Crystal Growth & Design\",\"volume\":\"25 8\",\"pages\":\"2717–2729 2717–2729\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2025-03-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Crystal Growth & Design\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://pubs.acs.org/doi/10.1021/acs.cgd.5c00347\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Crystal Growth & Design","FirstCategoryId":"92","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acs.cgd.5c00347","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

通过解决新候选药物溶解度差的问题,共晶在各个行业,特别是制药行业提供了巨大的潜力。然而,传统的共晶形成实验筛选既昂贵又耗时,这突出了对预测模型的需求。在这项研究中,我们比较了四种共晶预测方法:两种基于dft驱动数据的深度学习(DL)模型(用于静电电位(ESP)映射的PointNet和用于顺序氢键参数的新型LSTM),一种结合图同构网络(GIN)和Mordred描述符的新型混合模型,以及经验氢键能(HBE)方法。为了进行比较,我们对14,790个分子(7395对成功和不成功的共晶)进行了DFT计算。值得注意的是,GIN-Mordred模型优于所有其他方法,实现了最高的平衡精度(BACC: 0.916), f1评分(0.956),召回率(0.932)和AUC(0.97),在区分共结晶结果方面具有优越的分离性能。重要的是,GIN-Mordred模型不需要昂贵的DFT计算,这表明基于图和基于描述符的分子表示的组合为共晶预测提供了一种高效和准确的替代方案。该模型显著简化了调整各种应用晶体材料的物理化学性质的过程。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Cocrystal Formation Prediction: Hybrid GIN-Mordred Model Outperforms DFT-Based Methods

Cocrystal Formation Prediction: Hybrid GIN-Mordred Model Outperforms DFT-Based Methods

Cocrystals offer significant potential across various industries, especially pharmaceuticals, by addressing the poor solubility of new drug candidates. However, traditional experimental screening for cocrystal formation is expensive and time-consuming, highlighting the need for predictive models. In this study, we compared four cocrystal prediction approaches: two deep learning (DL) models based on DFT-driven data (PointNet for electrostatic potential (ESP) maps and a novel LSTM for sequential hydrogen bond parameters), a novel hybrid model combining graph isomorphism networks (GIN) with Mordred descriptors, and the empirical Hydrogen Bond Energy (HBE) method. To perform this comparison, we compiled and carried out DFT calculations for 14,790 molecules (7395 pairs of successful and unsuccessful cocrystals). Notably, the GIN-Mordred model outperformed all other methods, achieving the highest balanced accuracy (BACC: 0.916), F1-score (0.956), recall (0.932), and AUC (0.97), with superior segregation performance in distinguishing between cocrystallization outcomes. Importantly, the GIN-Mordred model does not require costly DFT calculations, demonstrating that a combination of graph-based and descriptor-based molecular representation provides an efficient and accurate alternative for cocrystal prediction. This model significantly streamlines the process of tuning the physicochemical properties of crystalline materials for various applications.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Crystal Growth & Design
Crystal Growth & Design 化学-材料科学:综合
CiteScore
6.30
自引率
10.50%
发文量
650
审稿时长
1.9 months
期刊介绍: The aim of Crystal Growth & Design is to stimulate crossfertilization of knowledge among scientists and engineers working in the fields of crystal growth, crystal engineering, and the industrial application of crystalline materials. Crystal Growth & Design publishes theoretical and experimental studies of the physical, chemical, and biological phenomena and processes related to the design, growth, and application of crystalline materials. Synergistic approaches originating from different disciplines and technologies and integrating the fields of crystal growth, crystal engineering, intermolecular interactions, and industrial application are encouraged.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信