An inductive learning-based method for predicting drug-gene interactions using a multi-relational drug-disease-gene graph.

IF 8.9
Journal of pharmaceutical analysis Pub Date : 2025-08-01 Epub Date: 2025-05-16 DOI:10.1016/j.jpha.2025.101347
Jian He, Yanling Wu, Linxi Yuan, Jiangguo Qiu, Menglong Li, Xuemei Pu, Yanzhi Guo
{"title":"An inductive learning-based method for predicting drug-gene interactions using a multi-relational drug-disease-gene graph.","authors":"Jian He, Yanling Wu, Linxi Yuan, Jiangguo Qiu, Menglong Li, Xuemei Pu, Yanzhi Guo","doi":"10.1016/j.jpha.2025.101347","DOIUrl":null,"url":null,"abstract":"<p><p>Computational analysis can accurately detect drug-gene interactions (DGIs) cost-effectively. However, transductive learning models are the hotspot to reveal the promising performance for unknown DGIs (both drugs and genes are present in the training model), without special attention to the unseen DGIs (both drugs and genes are absent in the training model). In view of this, this study, for the first time, proposed an inductive learning-based model for the precise identification of unseen DGIs. In our study, by integrating disease nodes to avoid data sparsity, a multi-relational drug-disease-gene (DDG) graph was constructed to achieve effective fusion of data on DDG intro-relationships and inter-actions. Following the extraction of graph features by utilizing graph embedding algorithms, our next step was the retrieval of the attributes of individual gene and drug nodes. In this way, a hybrid feature characterization was represented by integrating graph features and node attributes. Machine learning (ML) models were built, enabling the fulfillment of transductive predictions of unknown DGIs. To realize inductive learning, this study generated an innovative idea of transforming known node vectors derived from the DDG graph into representations of unseen nodes using node similarities as weights, enabling inductive predictions for the unseen DGIs. Consequently, the final model was superior to existing models, with significant improvement in predicting both external unknown and unseen DGIs. The practical feasibility of our model was further confirmed through case study and molecular docking. In summary, this study establishes an efficient data-driven approach through the proposed modeling, suggesting its value as a promising tool for accelerating drug discovery and repurposing.</p>","PeriodicalId":94338,"journal":{"name":"Journal of pharmaceutical analysis","volume":"15 8","pages":"101347"},"PeriodicalIF":8.9000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12446772/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of pharmaceutical analysis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.jpha.2025.101347","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/5/16 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Computational analysis can accurately detect drug-gene interactions (DGIs) cost-effectively. However, transductive learning models are the hotspot to reveal the promising performance for unknown DGIs (both drugs and genes are present in the training model), without special attention to the unseen DGIs (both drugs and genes are absent in the training model). In view of this, this study, for the first time, proposed an inductive learning-based model for the precise identification of unseen DGIs. In our study, by integrating disease nodes to avoid data sparsity, a multi-relational drug-disease-gene (DDG) graph was constructed to achieve effective fusion of data on DDG intro-relationships and inter-actions. Following the extraction of graph features by utilizing graph embedding algorithms, our next step was the retrieval of the attributes of individual gene and drug nodes. In this way, a hybrid feature characterization was represented by integrating graph features and node attributes. Machine learning (ML) models were built, enabling the fulfillment of transductive predictions of unknown DGIs. To realize inductive learning, this study generated an innovative idea of transforming known node vectors derived from the DDG graph into representations of unseen nodes using node similarities as weights, enabling inductive predictions for the unseen DGIs. Consequently, the final model was superior to existing models, with significant improvement in predicting both external unknown and unseen DGIs. The practical feasibility of our model was further confirmed through case study and molecular docking. In summary, this study establishes an efficient data-driven approach through the proposed modeling, suggesting its value as a promising tool for accelerating drug discovery and repurposing.

使用多关系药物-疾病-基因图预测药物-基因相互作用的基于归纳学习的方法。
计算分析可以准确、经济、有效地检测药物-基因相互作用。然而,转导学习模型是揭示未知dgi(训练模型中同时存在药物和基因)有希望的性能的热点,而没有特别关注不可见的dgi(训练模型中同时不存在药物和基因)。鉴于此,本研究首次提出了一种基于归纳学习的未见dgi精确识别模型。本研究通过整合疾病节点来避免数据稀疏,构建多关系药物-疾病-基因(DDG)图,实现DDG间关系和交互作用数据的有效融合。在利用图嵌入算法提取图特征之后,我们的下一步是检索单个基因和药物节点的属性。通过对图特征和节点属性的整合,实现了混合特征表征。建立了机器学习(ML)模型,实现了对未知dgi的转导预测。为了实现归纳学习,本研究提出了一种创新的思想,即使用节点相似度作为权重,将来自DDG图的已知节点向量转换为未见节点的表示,从而实现对未见dgi的归纳预测。因此,最终模型优于现有模型,在预测外部未知和未见dgi方面都有显着改善。通过案例研究和分子对接进一步证实了模型的实际可行性。总之,本研究通过提出的建模建立了一种有效的数据驱动方法,表明其作为加速药物发现和再利用的有前途的工具的价值。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信