SMPR: a structure-enhanced multimodal drug‒disease prediction model for drug repositioning and cold start

IF 5.7 2区 化学 Q1 CHEMISTRY, MULTIDISCIPLINARY
Xin Dong, Rui Miao, Suyan Zhang, Shuaibing Jia, Leifeng Zhang, Yong Liang, Jianhua Zhang, Yi Zhun Zhu
{"title":"SMPR: a structure-enhanced multimodal drug‒disease prediction model for drug repositioning and cold start","authors":"Xin Dong, Rui Miao, Suyan Zhang, Shuaibing Jia, Leifeng Zhang, Yong Liang, Jianhua Zhang, Yi Zhun Zhu","doi":"10.1186/s13321-025-01085-2","DOIUrl":null,"url":null,"abstract":"Repositioning drug‒disease relationships has always been a hot field of research. However, actual cases of biologically validated drug relocation remain very limited, and existing models have not yet fully utilized the structural information of the drug. Furthermore, most repositioning models are only used to complete the relationship matrix, and their practicality is poor when dealing with drug cold start problems. This paper proposes a structure-enhanced multimodal relationship prediction model (SMRP). SMPR is based on the SMILE structure of the drug, uses the MOL2VEC method to generate drug-embedded representations, and learns disease-embedded representations through heterogeneous network graph neural networks. Ultimately, a drug‒disease relationship matrix is constructed. In addition, to reduce the difficulty of use, SMPR also provides a cold start interface on the basis of structural similarity based on repositioning results to predict drug-related diseases simply and quickly. The repositioning ability and cold start capability of the model were verified from multiple perspectives. While the AUC and ACUPR scores of repositioning reached 99% and 61%, respectively, the AUC of the cold start method was 80%. In particular, the cold start recall indicator can reach more than 70%, which means that the SMPR is more sensitive to positive samples. Finally, case analysis was used to verify the practical value of the model, and visual analysis directly demonstrated the improvement in the structure of the model. For ease of use, we also provide local deployment of the model and packaged it into an executable program. Scientific contribution The SMPR model is a structure-enhanced multimodal learning model that focuses on the reference value of similar structures in practical docking and adds a drug embedding module. Considering that the predictive ability of the model is limited by the dataset, the model provides a cold start interface for the effective prediction of drugs that are not in the dataset. To further facilitate the use by pharmacology workers without programming knowledge and enhance its practical application value, the model was also encapsulated into a local executable program.","PeriodicalId":617,"journal":{"name":"Journal of Cheminformatics","volume":"1 1","pages":""},"PeriodicalIF":5.7000,"publicationDate":"2025-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Cheminformatics","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1186/s13321-025-01085-2","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Repositioning drug‒disease relationships has always been a hot field of research. However, actual cases of biologically validated drug relocation remain very limited, and existing models have not yet fully utilized the structural information of the drug. Furthermore, most repositioning models are only used to complete the relationship matrix, and their practicality is poor when dealing with drug cold start problems. This paper proposes a structure-enhanced multimodal relationship prediction model (SMRP). SMPR is based on the SMILE structure of the drug, uses the MOL2VEC method to generate drug-embedded representations, and learns disease-embedded representations through heterogeneous network graph neural networks. Ultimately, a drug‒disease relationship matrix is constructed. In addition, to reduce the difficulty of use, SMPR also provides a cold start interface on the basis of structural similarity based on repositioning results to predict drug-related diseases simply and quickly. The repositioning ability and cold start capability of the model were verified from multiple perspectives. While the AUC and ACUPR scores of repositioning reached 99% and 61%, respectively, the AUC of the cold start method was 80%. In particular, the cold start recall indicator can reach more than 70%, which means that the SMPR is more sensitive to positive samples. Finally, case analysis was used to verify the practical value of the model, and visual analysis directly demonstrated the improvement in the structure of the model. For ease of use, we also provide local deployment of the model and packaged it into an executable program. Scientific contribution The SMPR model is a structure-enhanced multimodal learning model that focuses on the reference value of similar structures in practical docking and adds a drug embedding module. Considering that the predictive ability of the model is limited by the dataset, the model provides a cold start interface for the effective prediction of drugs that are not in the dataset. To further facilitate the use by pharmacology workers without programming knowledge and enhance its practical application value, the model was also encapsulated into a local executable program.
SMPR:用于药物重新定位和冷启动的结构增强多模态药物-疾病预测模型
重新定位药病关系一直是一个研究热点。然而,经过生物学验证的药物再定位的实际案例仍然非常有限,现有模型尚未充分利用药物的结构信息。此外,大多数重定位模型仅用于完成关系矩阵,在处理药物冷启动问题时实用性较差。提出了一种结构增强的多模态关系预测模型(SMRP)。SMPR基于药物的SMILE结构,采用MOL2VEC方法生成药物嵌入表征,并通过异构网络图神经网络学习疾病嵌入表征。最后,构建了药物-疾病关系矩阵。此外,为了降低使用难度,SMPR还提供了基于重定位结果的结构相似性的冷启动界面,简单快速地预测药物相关疾病。从多个角度验证了模型的再定位能力和冷启动能力。重新定位的AUC和acur评分分别达到99%和61%,而冷启动法的AUC为80%。特别是冷启动召回指标可以达到70%以上,这意味着SMPR对阳性样品更加敏感。最后通过案例分析验证了模型的实用价值,可视化分析直接体现了模型在结构上的改进。为了便于使用,我们还提供了模型的本地部署,并将其打包为可执行程序。SMPR模型是一种结构增强的多模态学习模型,注重相似结构在实际对接中的参考价值,增加了药物嵌入模块。考虑到模型的预测能力受数据集的限制,模型提供了冷启动接口,对数据集中没有的药物进行有效预测。为了进一步方便没有编程知识的药理学工作者使用,提高其实际应用价值,还将该模型封装为本地可执行程序。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Cheminformatics
Journal of Cheminformatics CHEMISTRY, MULTIDISCIPLINARY-COMPUTER SCIENCE, INFORMATION SYSTEMS
CiteScore
14.10
自引率
7.00%
发文量
82
审稿时长
3 months
期刊介绍: Journal of Cheminformatics is an open access journal publishing original peer-reviewed research in all aspects of cheminformatics and molecular modelling. Coverage includes, but is not limited to: chemical information systems, software and databases, and molecular modelling, chemical structure representations and their use in structure, substructure, and similarity searching of chemical substance and chemical reaction databases, computer and molecular graphics, computer-aided molecular design, expert systems, QSAR, and data mining techniques.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信