{"title":"Graph neural network integrated with pretrained protein language model for predicting human-virus protein-protein interactions.","authors":"Linyang Jiang, Xiaodi Yang, Xiaokun Guo, Dianke Li, Jiajun Li, Stefan Wuchty, Wenyu Shi, Ziding Zhang","doi":"10.1093/bib/bbaf461","DOIUrl":null,"url":null,"abstract":"<p><p>The systematic identification of human-virus protein-protein interactions (PPIs) is a critical step toward elucidating the underlying mechanisms of viral infection, directly informing the development of targeted interventions against existing and emerging viral threats. In this work, we presented DeepGNHV, an end-to-end framework that integrated a pretrained protein language model with structural features derived from AlphaFold2 and leveraged graph attention networks to predict human-virus PPIs. In comparison to other state-of-the-art approaches, DeepGNHV exhibited superior predictive performance, especially when applied to viral proteins absent from the training process, indicating its strong generalization capability for detecting newly emerging virus-related PPIs. We further demonstrated DeepGNHV's robustness across diverse perturbations and its practical application under high-confidence thresholds. Additionally, we conducted extensive predictions of human-HPV PPIs, which were supported by multiple lines of evidence and identified several host factors that specifically interact with high-risk HPV. To further explore the biological significance of DeepGNHV, we provided a case study to pinpoint specific residues that play critical roles in facilitating the corresponding PPIs. The source code of DeepGNHV and related data is publicly available on GitHub (https://github.com/bioboy0415/DeepGNHV).</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"26 5","pages":""},"PeriodicalIF":7.7000,"publicationDate":"2025-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12415850/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Briefings in bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/bib/bbaf461","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
The systematic identification of human-virus protein-protein interactions (PPIs) is a critical step toward elucidating the underlying mechanisms of viral infection, directly informing the development of targeted interventions against existing and emerging viral threats. In this work, we presented DeepGNHV, an end-to-end framework that integrated a pretrained protein language model with structural features derived from AlphaFold2 and leveraged graph attention networks to predict human-virus PPIs. In comparison to other state-of-the-art approaches, DeepGNHV exhibited superior predictive performance, especially when applied to viral proteins absent from the training process, indicating its strong generalization capability for detecting newly emerging virus-related PPIs. We further demonstrated DeepGNHV's robustness across diverse perturbations and its practical application under high-confidence thresholds. Additionally, we conducted extensive predictions of human-HPV PPIs, which were supported by multiple lines of evidence and identified several host factors that specifically interact with high-risk HPV. To further explore the biological significance of DeepGNHV, we provided a case study to pinpoint specific residues that play critical roles in facilitating the corresponding PPIs. The source code of DeepGNHV and related data is publicly available on GitHub (https://github.com/bioboy0415/DeepGNHV).
期刊介绍:
Briefings in Bioinformatics is an international journal serving as a platform for researchers and educators in the life sciences. It also appeals to mathematicians, statisticians, and computer scientists applying their expertise to biological challenges. The journal focuses on reviews tailored for users of databases and analytical tools in contemporary genetics, molecular and systems biology. It stands out by offering practical assistance and guidance to non-specialists in computerized methodologies. Covering a wide range from introductory concepts to specific protocols and analyses, the papers address bacterial, plant, fungal, animal, and human data.
The journal's detailed subject areas include genetic studies of phenotypes and genotypes, mapping, DNA sequencing, expression profiling, gene expression studies, microarrays, alignment methods, protein profiles and HMMs, lipids, metabolic and signaling pathways, structure determination and function prediction, phylogenetic studies, and education and training.