{"title":"Cocrystal Formation Prediction: Hybrid GIN-Mordred Model Outperforms DFT-Based Methods","authors":"Mohammad Amin Ghanavati, and , Sohrab Rohani*, ","doi":"10.1021/acs.cgd.5c0034710.1021/acs.cgd.5c00347","DOIUrl":null,"url":null,"abstract":"<p >Cocrystals offer significant potential across various industries, especially pharmaceuticals, by addressing the poor solubility of new drug candidates. However, traditional experimental screening for cocrystal formation is expensive and time-consuming, highlighting the need for predictive models. In this study, we compared four cocrystal prediction approaches: two deep learning (DL) models based on DFT-driven data (PointNet for electrostatic potential (ESP) maps and a novel LSTM for sequential hydrogen bond parameters), a novel hybrid model combining graph isomorphism networks (GIN) with Mordred descriptors, and the empirical Hydrogen Bond Energy (HBE) method. To perform this comparison, we compiled and carried out DFT calculations for 14,790 molecules (7395 pairs of successful and unsuccessful cocrystals). Notably, the GIN-Mordred model outperformed all other methods, achieving the highest balanced accuracy (BACC: 0.916), F1-score (0.956), recall (0.932), and AUC (0.97), with superior segregation performance in distinguishing between cocrystallization outcomes. Importantly, the GIN-Mordred model does not require costly DFT calculations, demonstrating that a combination of graph-based and descriptor-based molecular representation provides an efficient and accurate alternative for cocrystal prediction. This model significantly streamlines the process of tuning the physicochemical properties of crystalline materials for various applications.</p>","PeriodicalId":34,"journal":{"name":"Crystal Growth & Design","volume":"25 8","pages":"2717–2729 2717–2729"},"PeriodicalIF":3.2000,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Crystal Growth & Design","FirstCategoryId":"92","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acs.cgd.5c00347","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Cocrystals offer significant potential across various industries, especially pharmaceuticals, by addressing the poor solubility of new drug candidates. However, traditional experimental screening for cocrystal formation is expensive and time-consuming, highlighting the need for predictive models. In this study, we compared four cocrystal prediction approaches: two deep learning (DL) models based on DFT-driven data (PointNet for electrostatic potential (ESP) maps and a novel LSTM for sequential hydrogen bond parameters), a novel hybrid model combining graph isomorphism networks (GIN) with Mordred descriptors, and the empirical Hydrogen Bond Energy (HBE) method. To perform this comparison, we compiled and carried out DFT calculations for 14,790 molecules (7395 pairs of successful and unsuccessful cocrystals). Notably, the GIN-Mordred model outperformed all other methods, achieving the highest balanced accuracy (BACC: 0.916), F1-score (0.956), recall (0.932), and AUC (0.97), with superior segregation performance in distinguishing between cocrystallization outcomes. Importantly, the GIN-Mordred model does not require costly DFT calculations, demonstrating that a combination of graph-based and descriptor-based molecular representation provides an efficient and accurate alternative for cocrystal prediction. This model significantly streamlines the process of tuning the physicochemical properties of crystalline materials for various applications.
期刊介绍:
The aim of Crystal Growth & Design is to stimulate crossfertilization of knowledge among scientists and engineers working in the fields of crystal growth, crystal engineering, and the industrial application of crystalline materials.
Crystal Growth & Design publishes theoretical and experimental studies of the physical, chemical, and biological phenomena and processes related to the design, growth, and application of crystalline materials. Synergistic approaches originating from different disciplines and technologies and integrating the fields of crystal growth, crystal engineering, intermolecular interactions, and industrial application are encouraged.