An inductive learning-based method for predicting drug-gene interactions using a multi-relational drug-disease-gene graph.

IF 8.9

Journal of pharmaceutical analysis Pub Date : 2025-08-01 Epub Date: 2025-05-16 DOI:10.1016/j.jpha.2025.101347

Jian He, Yanling Wu, Linxi Yuan, Jiangguo Qiu, Menglong Li, Xuemei Pu, Yanzhi Guo

{"title":"An inductive learning-based method for predicting drug-gene interactions using a multi-relational drug-disease-gene graph.","authors":"Jian He, Yanling Wu, Linxi Yuan, Jiangguo Qiu, Menglong Li, Xuemei Pu, Yanzhi Guo","doi":"10.1016/j.jpha.2025.101347","DOIUrl":null,"url":null,"abstract":"<p><p>Computational analysis can accurately detect drug-gene interactions (DGIs) cost-effectively. However, transductive learning models are the hotspot to reveal the promising performance for unknown DGIs (both drugs and genes are present in the training model), without special attention to the unseen DGIs (both drugs and genes are absent in the training model). In view of this, this study, for the first time, proposed an inductive learning-based model for the precise identification of unseen DGIs. In our study, by integrating disease nodes to avoid data sparsity, a multi-relational drug-disease-gene (DDG) graph was constructed to achieve effective fusion of data on DDG intro-relationships and inter-actions. Following the extraction of graph features by utilizing graph embedding algorithms, our next step was the retrieval of the attributes of individual gene and drug nodes. In this way, a hybrid feature characterization was represented by integrating graph features and node attributes. Machine learning (ML) models were built, enabling the fulfillment of transductive predictions of unknown DGIs. To realize inductive learning, this study generated an innovative idea of transforming known node vectors derived from the DDG graph into representations of unseen nodes using node similarities as weights, enabling inductive predictions for the unseen DGIs. Consequently, the final model was superior to existing models, with significant improvement in predicting both external unknown and unseen DGIs. The practical feasibility of our model was further confirmed through case study and molecular docking. In summary, this study establishes an efficient data-driven approach through the proposed modeling, suggesting its value as a promising tool for accelerating drug discovery and repurposing.</p>","PeriodicalId":94338,"journal":{"name":"Journal of pharmaceutical analysis","volume":"15 8","pages":"101347"},"PeriodicalIF":8.9000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12446772/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of pharmaceutical analysis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.jpha.2025.101347","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/5/16 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Computational analysis can accurately detect drug-gene interactions (DGIs) cost-effectively. However, transductive learning models are the hotspot to reveal the promising performance for unknown DGIs (both drugs and genes are present in the training model), without special attention to the unseen DGIs (both drugs and genes are absent in the training model). In view of this, this study, for the first time, proposed an inductive learning-based model for the precise identification of unseen DGIs. In our study, by integrating disease nodes to avoid data sparsity, a multi-relational drug-disease-gene (DDG) graph was constructed to achieve effective fusion of data on DDG intro-relationships and inter-actions. Following the extraction of graph features by utilizing graph embedding algorithms, our next step was the retrieval of the attributes of individual gene and drug nodes. In this way, a hybrid feature characterization was represented by integrating graph features and node attributes. Machine learning (ML) models were built, enabling the fulfillment of transductive predictions of unknown DGIs. To realize inductive learning, this study generated an innovative idea of transforming known node vectors derived from the DDG graph into representations of unseen nodes using node similarities as weights, enabling inductive predictions for the unseen DGIs. Consequently, the final model was superior to existing models, with significant improvement in predicting both external unknown and unseen DGIs. The practical feasibility of our model was further confirmed through case study and molecular docking. In summary, this study establishes an efficient data-driven approach through the proposed modeling, suggesting its value as a promising tool for accelerating drug discovery and repurposing.

查看原文本刊更多论文

使用多关系药物-疾病-基因图预测药物-基因相互作用的基于归纳学习的方法。

计算分析可以准确、经济、有效地检测药物-基因相互作用。然而，转导学习模型是揭示未知dgi（训练模型中同时存在药物和基因）有希望的性能的热点，而没有特别关注不可见的dgi（训练模型中同时不存在药物和基因）。鉴于此，本研究首次提出了一种基于归纳学习的未见dgi精确识别模型。本研究通过整合疾病节点来避免数据稀疏，构建多关系药物-疾病-基因（DDG）图，实现DDG间关系和交互作用数据的有效融合。在利用图嵌入算法提取图特征之后，我们的下一步是检索单个基因和药物节点的属性。通过对图特征和节点属性的整合，实现了混合特征表征。建立了机器学习（ML）模型，实现了对未知dgi的转导预测。为了实现归纳学习，本研究提出了一种创新的思想，即使用节点相似度作为权重，将来自DDG图的已知节点向量转换为未见节点的表示，从而实现对未见dgi的归纳预测。因此，最终模型优于现有模型，在预测外部未知和未见dgi方面都有显着改善。通过案例研究和分子对接进一步证实了模型的实际可行性。总之，本研究通过提出的建模建立了一种有效的数据驱动方法，表明其作为加速药物发现和再利用的有前途的工具的价值。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of pharmaceutical analysis

自引率

0.00%

发文量