Haixue Zhao, Kui Yao, Yunjiong Liu, Chao Che, Lin Tang
{"title":"Heterogeneous Network With Multiview Path Aggregation: Drug-Target Interaction Prediction Study Design.","authors":"Haixue Zhao, Kui Yao, Yunjiong Liu, Chao Che, Lin Tang","doi":"10.2196/74974","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Drug-target interaction (DTI) prediction is crucial in drug repositioning, as it can significantly reduce research and development costs and shorten the development cycle. Most existing deep learning-based approaches employ graph neural networks for DTI prediction. However, these approaches still face limitations in capturing complex biochemical features, integrating multilevel information, and providing interpretable model insights.</p><p><strong>Objective: </strong>This study proposes a heterogeneous network model based on multiview path aggregation, aiming to predict interactions between drugs and targets.</p><p><strong>Methods: </strong>This study employed a molecular attention transformer to extract 3D conformation features from the chemical structures of drugs and utilized Prot-T5, a protein-specific large language model, to deeply explore biophysically and functionally relevant features from protein sequences. By integrating drugs, proteins, diseases, and side effects from multisource heterogeneous data, we constructed a heterogeneous graph model to systematically characterize multidimensional associations between biological entities. On this foundation, a meta-path aggregation mechanism was proposed, which dynamically integrates information from both feature views and biological network relationship views. This mechanism effectively learned potential interaction patterns between biological entities and provided a more comprehensive representation of the complex relationships in the heterogeneous graph. It enhanced the model's ability to capture sophisticated, context-dependent relationships in biological networks. Furthermore, we integrated multiscale features of drugs and proteins within the heterogeneous network, significantly improving the prediction accuracy of DTIs and enhancing the model's interpretability and generalization ability.</p><p><strong>Results: </strong>In the DTI prediction task, the proposed model achieves an AUPR (area under the precision-recall curve) of 0.901 and an AUROC (area under the receiver operating characteristic curve) of 0.966, representing improvements of 1.7% and 0.8%, respectively, over the baseline methods. Furthermore, a case study on the KCNH2 target demonstrates that the proposed model successfully predicts 38 out of 53 candidate drugs as having interactions, which further validates its reliability and practicality in real-world scenarios.</p><p><strong>Conclusions: </strong>The proposed model shows marked superiority over baseline methods, highlighting the importance of integrating heterogeneous information with biological knowledge in DTI prediction.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e74974"},"PeriodicalIF":3.8000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12490815/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR Medical Informatics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2196/74974","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Drug-target interaction (DTI) prediction is crucial in drug repositioning, as it can significantly reduce research and development costs and shorten the development cycle. Most existing deep learning-based approaches employ graph neural networks for DTI prediction. However, these approaches still face limitations in capturing complex biochemical features, integrating multilevel information, and providing interpretable model insights.
Objective: This study proposes a heterogeneous network model based on multiview path aggregation, aiming to predict interactions between drugs and targets.
Methods: This study employed a molecular attention transformer to extract 3D conformation features from the chemical structures of drugs and utilized Prot-T5, a protein-specific large language model, to deeply explore biophysically and functionally relevant features from protein sequences. By integrating drugs, proteins, diseases, and side effects from multisource heterogeneous data, we constructed a heterogeneous graph model to systematically characterize multidimensional associations between biological entities. On this foundation, a meta-path aggregation mechanism was proposed, which dynamically integrates information from both feature views and biological network relationship views. This mechanism effectively learned potential interaction patterns between biological entities and provided a more comprehensive representation of the complex relationships in the heterogeneous graph. It enhanced the model's ability to capture sophisticated, context-dependent relationships in biological networks. Furthermore, we integrated multiscale features of drugs and proteins within the heterogeneous network, significantly improving the prediction accuracy of DTIs and enhancing the model's interpretability and generalization ability.
Results: In the DTI prediction task, the proposed model achieves an AUPR (area under the precision-recall curve) of 0.901 and an AUROC (area under the receiver operating characteristic curve) of 0.966, representing improvements of 1.7% and 0.8%, respectively, over the baseline methods. Furthermore, a case study on the KCNH2 target demonstrates that the proposed model successfully predicts 38 out of 53 candidate drugs as having interactions, which further validates its reliability and practicality in real-world scenarios.
Conclusions: The proposed model shows marked superiority over baseline methods, highlighting the importance of integrating heterogeneous information with biological knowledge in DTI prediction.
期刊介绍:
JMIR Medical Informatics (JMI, ISSN 2291-9694) is a top-rated, tier A journal which focuses on clinical informatics, big data in health and health care, decision support for health professionals, electronic health records, ehealth infrastructures and implementation. It has a focus on applied, translational research, with a broad readership including clinicians, CIOs, engineers, industry and health informatics professionals.
Published by JMIR Publications, publisher of the Journal of Medical Internet Research (JMIR), the leading eHealth/mHealth journal (Impact Factor 2016: 5.175), JMIR Med Inform has a slightly different scope (emphasizing more on applications for clinicians and health professionals rather than consumers/citizens, which is the focus of JMIR), publishes even faster, and also allows papers which are more technical or more formative than what would be published in the Journal of Medical Internet Research.