{"title":"Protein Language Models Expose Viral Immune Mimicry.","authors":"Dan Ofer, Michal Linial","doi":"10.3390/v17091199","DOIUrl":null,"url":null,"abstract":"<p><p>Viruses have evolved sophisticated solutions to evade host immunity. One of the most pervasive strategies is molecular mimicry, whereby viruses imitate the molecular and biophysical features of their hosts. This mimicry poses significant challenges for immune recognition, therapeutic targeting, and vaccine development. In this study, we leverage pretrained protein language models (PLMs) to distinguish between viral and human proteins. Our model enables the identification and interpretation of viral proteins that most frequently elude classification. We characterize these by integrating PLMs with explainable models. Our approach achieves state-of-the-art performance with ROC-AUC of 99.7%. The 3.9% of misclassified sequences are signified by viral proteins with low immunogenicity. These errors disproportionately involve human-specific viral families associated with chronic infections and immune evasion, suggesting that both the immune system and machine learning models are confounded by overlapping biophysical signals. By coupling PLMs with explainable AI techniques, our work advances computational virology and offers mechanistic insights into viral immune escape. These findings carry implications for the rational design of vaccines, and improved strategies to counteract viral persistence and pathogenicity.</p>","PeriodicalId":49328,"journal":{"name":"Viruses-Basel","volume":"17 9","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2025-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12474240/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Viruses-Basel","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3390/v17091199","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"VIROLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Viruses have evolved sophisticated solutions to evade host immunity. One of the most pervasive strategies is molecular mimicry, whereby viruses imitate the molecular and biophysical features of their hosts. This mimicry poses significant challenges for immune recognition, therapeutic targeting, and vaccine development. In this study, we leverage pretrained protein language models (PLMs) to distinguish between viral and human proteins. Our model enables the identification and interpretation of viral proteins that most frequently elude classification. We characterize these by integrating PLMs with explainable models. Our approach achieves state-of-the-art performance with ROC-AUC of 99.7%. The 3.9% of misclassified sequences are signified by viral proteins with low immunogenicity. These errors disproportionately involve human-specific viral families associated with chronic infections and immune evasion, suggesting that both the immune system and machine learning models are confounded by overlapping biophysical signals. By coupling PLMs with explainable AI techniques, our work advances computational virology and offers mechanistic insights into viral immune escape. These findings carry implications for the rational design of vaccines, and improved strategies to counteract viral persistence and pathogenicity.
期刊介绍:
Viruses (ISSN 1999-4915) is an open access journal which provides an advanced forum for studies of viruses. It publishes reviews, regular research papers, communications, conference reports and short notes. Our aim is to encourage scientists to publish their experimental and theoretical results in as much detail as possible. There is no restriction on the length of the papers. The full experimental details must be provided so that the results can be reproduced. We also encourage the publication of timely reviews and commentaries on topics of interest to the virology community and feature highlights from the virology literature in the ''News and Views'' section. Electronic files or software regarding the full details of the calculation and experimental procedure, if unable to be published in a normal way, can be deposited as supplementary material.