具有多视图路径聚合的异质网络:药物-靶标相互作用预测研究设计。

IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS
Haixue Zhao, Kui Yao, Yunjiong Liu, Chao Che, Lin Tang
{"title":"具有多视图路径聚合的异质网络:药物-靶标相互作用预测研究设计。","authors":"Haixue Zhao, Kui Yao, Yunjiong Liu, Chao Che, Lin Tang","doi":"10.2196/74974","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Drug-target interaction (DTI) prediction is crucial in drug repositioning, as it can significantly reduce research and development costs and shorten the development cycle. Most existing deep learning-based approaches employ graph neural networks for DTI prediction. However, these approaches still face limitations in capturing complex biochemical features, integrating multilevel information, and providing interpretable model insights.</p><p><strong>Objective: </strong>This study proposes a heterogeneous network model based on multiview path aggregation, aiming to predict interactions between drugs and targets.</p><p><strong>Methods: </strong>This study employed a molecular attention transformer to extract 3D conformation features from the chemical structures of drugs and utilized Prot-T5, a protein-specific large language model, to deeply explore biophysically and functionally relevant features from protein sequences. By integrating drugs, proteins, diseases, and side effects from multisource heterogeneous data, we constructed a heterogeneous graph model to systematically characterize multidimensional associations between biological entities. On this foundation, a meta-path aggregation mechanism was proposed, which dynamically integrates information from both feature views and biological network relationship views. This mechanism effectively learned potential interaction patterns between biological entities and provided a more comprehensive representation of the complex relationships in the heterogeneous graph. It enhanced the model's ability to capture sophisticated, context-dependent relationships in biological networks. Furthermore, we integrated multiscale features of drugs and proteins within the heterogeneous network, significantly improving the prediction accuracy of DTIs and enhancing the model's interpretability and generalization ability.</p><p><strong>Results: </strong>In the DTI prediction task, the proposed model achieves an AUPR (area under the precision-recall curve) of 0.901 and an AUROC (area under the receiver operating characteristic curve) of 0.966, representing improvements of 1.7% and 0.8%, respectively, over the baseline methods. Furthermore, a case study on the KCNH2 target demonstrates that the proposed model successfully predicts 38 out of 53 candidate drugs as having interactions, which further validates its reliability and practicality in real-world scenarios.</p><p><strong>Conclusions: </strong>The proposed model shows marked superiority over baseline methods, highlighting the importance of integrating heterogeneous information with biological knowledge in DTI prediction.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e74974"},"PeriodicalIF":3.8000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12490815/pdf/","citationCount":"0","resultStr":"{\"title\":\"Heterogeneous Network With Multiview Path Aggregation: Drug-Target Interaction Prediction Study Design.\",\"authors\":\"Haixue Zhao, Kui Yao, Yunjiong Liu, Chao Che, Lin Tang\",\"doi\":\"10.2196/74974\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Drug-target interaction (DTI) prediction is crucial in drug repositioning, as it can significantly reduce research and development costs and shorten the development cycle. Most existing deep learning-based approaches employ graph neural networks for DTI prediction. However, these approaches still face limitations in capturing complex biochemical features, integrating multilevel information, and providing interpretable model insights.</p><p><strong>Objective: </strong>This study proposes a heterogeneous network model based on multiview path aggregation, aiming to predict interactions between drugs and targets.</p><p><strong>Methods: </strong>This study employed a molecular attention transformer to extract 3D conformation features from the chemical structures of drugs and utilized Prot-T5, a protein-specific large language model, to deeply explore biophysically and functionally relevant features from protein sequences. By integrating drugs, proteins, diseases, and side effects from multisource heterogeneous data, we constructed a heterogeneous graph model to systematically characterize multidimensional associations between biological entities. On this foundation, a meta-path aggregation mechanism was proposed, which dynamically integrates information from both feature views and biological network relationship views. This mechanism effectively learned potential interaction patterns between biological entities and provided a more comprehensive representation of the complex relationships in the heterogeneous graph. It enhanced the model's ability to capture sophisticated, context-dependent relationships in biological networks. Furthermore, we integrated multiscale features of drugs and proteins within the heterogeneous network, significantly improving the prediction accuracy of DTIs and enhancing the model's interpretability and generalization ability.</p><p><strong>Results: </strong>In the DTI prediction task, the proposed model achieves an AUPR (area under the precision-recall curve) of 0.901 and an AUROC (area under the receiver operating characteristic curve) of 0.966, representing improvements of 1.7% and 0.8%, respectively, over the baseline methods. Furthermore, a case study on the KCNH2 target demonstrates that the proposed model successfully predicts 38 out of 53 candidate drugs as having interactions, which further validates its reliability and practicality in real-world scenarios.</p><p><strong>Conclusions: </strong>The proposed model shows marked superiority over baseline methods, highlighting the importance of integrating heterogeneous information with biological knowledge in DTI prediction.</p>\",\"PeriodicalId\":56334,\"journal\":{\"name\":\"JMIR Medical Informatics\",\"volume\":\"13 \",\"pages\":\"e74974\"},\"PeriodicalIF\":3.8000,\"publicationDate\":\"2025-10-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12490815/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JMIR Medical Informatics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.2196/74974\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MEDICAL INFORMATICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR Medical Informatics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2196/74974","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

摘要

背景:药物-靶标相互作用(drug -target interaction, DTI)预测在药物重新定位中至关重要,因为它可以显著降低研发成本,缩短研发周期。大多数现有的基于深度学习的方法使用图神经网络进行DTI预测。然而,这些方法在捕获复杂的生化特征、整合多层次信息和提供可解释的模型见解方面仍然面临局限性。目的:提出一种基于多视图路径聚合的异构网络模型,用于预测药物与靶点之间的相互作用。方法:利用分子注意力转换器从药物的化学结构中提取三维构象特征,利用蛋白质特异性大语言模型Prot-T5从蛋白质序列中深入挖掘生物物理和功能相关特征。通过整合来自多源异构数据的药物、蛋白质、疾病和副作用,我们构建了一个异构图模型,以系统地表征生物实体之间的多维关联。在此基础上,提出了一种元路径聚合机制,动态集成特征视图和生物网络关系视图的信息。该机制有效地学习了生物实体之间潜在的相互作用模式,并在异构图中提供了更全面的复杂关系表示。它增强了模型捕捉生物网络中复杂的、依赖于环境的关系的能力。此外,我们在异构网络中整合了药物和蛋白质的多尺度特征,显著提高了dti的预测精度,增强了模型的可解释性和泛化能力。结果:在DTI预测任务中,该模型的AUPR (precision-recall curve下面积)为0.901,AUROC (receiver operating characteristic curve下面积)为0.966,分别比基线方法提高了1.7%和0.8%。此外,对KCNH2靶点的案例研究表明,该模型成功预测了53种候选药物中的38种具有相互作用,进一步验证了其在现实场景中的可靠性和实用性。结论:所提出的模型比基线方法具有明显的优势,突出了在DTI预测中整合异质信息和生物学知识的重要性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Heterogeneous Network With Multiview Path Aggregation: Drug-Target Interaction Prediction Study Design.

Background: Drug-target interaction (DTI) prediction is crucial in drug repositioning, as it can significantly reduce research and development costs and shorten the development cycle. Most existing deep learning-based approaches employ graph neural networks for DTI prediction. However, these approaches still face limitations in capturing complex biochemical features, integrating multilevel information, and providing interpretable model insights.

Objective: This study proposes a heterogeneous network model based on multiview path aggregation, aiming to predict interactions between drugs and targets.

Methods: This study employed a molecular attention transformer to extract 3D conformation features from the chemical structures of drugs and utilized Prot-T5, a protein-specific large language model, to deeply explore biophysically and functionally relevant features from protein sequences. By integrating drugs, proteins, diseases, and side effects from multisource heterogeneous data, we constructed a heterogeneous graph model to systematically characterize multidimensional associations between biological entities. On this foundation, a meta-path aggregation mechanism was proposed, which dynamically integrates information from both feature views and biological network relationship views. This mechanism effectively learned potential interaction patterns between biological entities and provided a more comprehensive representation of the complex relationships in the heterogeneous graph. It enhanced the model's ability to capture sophisticated, context-dependent relationships in biological networks. Furthermore, we integrated multiscale features of drugs and proteins within the heterogeneous network, significantly improving the prediction accuracy of DTIs and enhancing the model's interpretability and generalization ability.

Results: In the DTI prediction task, the proposed model achieves an AUPR (area under the precision-recall curve) of 0.901 and an AUROC (area under the receiver operating characteristic curve) of 0.966, representing improvements of 1.7% and 0.8%, respectively, over the baseline methods. Furthermore, a case study on the KCNH2 target demonstrates that the proposed model successfully predicts 38 out of 53 candidate drugs as having interactions, which further validates its reliability and practicality in real-world scenarios.

Conclusions: The proposed model shows marked superiority over baseline methods, highlighting the importance of integrating heterogeneous information with biological knowledge in DTI prediction.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
JMIR Medical Informatics
JMIR Medical Informatics Medicine-Health Informatics
CiteScore
7.90
自引率
3.10%
发文量
173
审稿时长
12 weeks
期刊介绍: JMIR Medical Informatics (JMI, ISSN 2291-9694) is a top-rated, tier A journal which focuses on clinical informatics, big data in health and health care, decision support for health professionals, electronic health records, ehealth infrastructures and implementation. It has a focus on applied, translational research, with a broad readership including clinicians, CIOs, engineers, industry and health informatics professionals. Published by JMIR Publications, publisher of the Journal of Medical Internet Research (JMIR), the leading eHealth/mHealth journal (Impact Factor 2016: 5.175), JMIR Med Inform has a slightly different scope (emphasizing more on applications for clinicians and health professionals rather than consumers/citizens, which is the focus of JMIR), publishes even faster, and also allows papers which are more technical or more formative than what would be published in the Journal of Medical Internet Research.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信