Xiangyi Li, Chuance Sun, Weiran Huang, Yanjing Wang, Buyong Ma
{"title":"SageTCR: a structure-based model integrating residue- and atom-level representations for enhanced TCR-pMHC binding prediction.","authors":"Xiangyi Li, Chuance Sun, Weiran Huang, Yanjing Wang, Buyong Ma","doi":"10.1093/bib/bbaf496","DOIUrl":null,"url":null,"abstract":"<p><p>T-cell receptors (TCRs) recognize peptide-MHC (pMHC) complexes through intricate structural interactions, which is a core component of adaptive immunity. However, the diverse and cross-reactive nature of TCRs poses great challenges for accurate prediction of TCR-epitope interactions, hampering the advancement and broad application of TCR-related therapies. Here, we present SageTCR, a bi-level graph neural network (GNN) framework that leverages structural data to predict TCR-pMHC binding possibilities. Harnessing the pretrained language models, SageTCR encodes detailed structural arrangement at both residue-level and atomic-level and effectively integrates the bimodal representations via attention mechanisms. To tackle the deficiency of experimental structures, we explore comprehensive data augmentation strategies to enrich the training and increase the generalizability while concurrently preserving the characteristic TCR-pMHC diagonal binding mode. SageTCR demonstrates superior performance compared to six methods with different deep learning architectures. Furthermore, SageTCR offers the interpretability by identifying and focusing on the conformational features of pivotal contact residues on the interface, which can provide valuable insights for TCR engineering and immunotherapy design.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"26 5","pages":""},"PeriodicalIF":7.7000,"publicationDate":"2025-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12454268/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Briefings in bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/bib/bbaf496","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
T-cell receptors (TCRs) recognize peptide-MHC (pMHC) complexes through intricate structural interactions, which is a core component of adaptive immunity. However, the diverse and cross-reactive nature of TCRs poses great challenges for accurate prediction of TCR-epitope interactions, hampering the advancement and broad application of TCR-related therapies. Here, we present SageTCR, a bi-level graph neural network (GNN) framework that leverages structural data to predict TCR-pMHC binding possibilities. Harnessing the pretrained language models, SageTCR encodes detailed structural arrangement at both residue-level and atomic-level and effectively integrates the bimodal representations via attention mechanisms. To tackle the deficiency of experimental structures, we explore comprehensive data augmentation strategies to enrich the training and increase the generalizability while concurrently preserving the characteristic TCR-pMHC diagonal binding mode. SageTCR demonstrates superior performance compared to six methods with different deep learning architectures. Furthermore, SageTCR offers the interpretability by identifying and focusing on the conformational features of pivotal contact residues on the interface, which can provide valuable insights for TCR engineering and immunotherapy design.
期刊介绍:
Briefings in Bioinformatics is an international journal serving as a platform for researchers and educators in the life sciences. It also appeals to mathematicians, statisticians, and computer scientists applying their expertise to biological challenges. The journal focuses on reviews tailored for users of databases and analytical tools in contemporary genetics, molecular and systems biology. It stands out by offering practical assistance and guidance to non-specialists in computerized methodologies. Covering a wide range from introductory concepts to specific protocols and analyses, the papers address bacterial, plant, fungal, animal, and human data.
The journal's detailed subject areas include genetic studies of phenotypes and genotypes, mapping, DNA sequencing, expression profiling, gene expression studies, microarrays, alignment methods, protein profiles and HMMs, lipids, metabolic and signaling pathways, structure determination and function prediction, phylogenetic studies, and education and training.