SageTCR: a structure-based model integrating residue- and atom-level representations for enhanced TCR-pMHC binding prediction.

IF 7.7 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS
Xiangyi Li, Chuance Sun, Weiran Huang, Yanjing Wang, Buyong Ma
{"title":"SageTCR: a structure-based model integrating residue- and atom-level representations for enhanced TCR-pMHC binding prediction.","authors":"Xiangyi Li, Chuance Sun, Weiran Huang, Yanjing Wang, Buyong Ma","doi":"10.1093/bib/bbaf496","DOIUrl":null,"url":null,"abstract":"<p><p>T-cell receptors (TCRs) recognize peptide-MHC (pMHC) complexes through intricate structural interactions, which is a core component of adaptive immunity. However, the diverse and cross-reactive nature of TCRs poses great challenges for accurate prediction of TCR-epitope interactions, hampering the advancement and broad application of TCR-related therapies. Here, we present SageTCR, a bi-level graph neural network (GNN) framework that leverages structural data to predict TCR-pMHC binding possibilities. Harnessing the pretrained language models, SageTCR encodes detailed structural arrangement at both residue-level and atomic-level and effectively integrates the bimodal representations via attention mechanisms. To tackle the deficiency of experimental structures, we explore comprehensive data augmentation strategies to enrich the training and increase the generalizability while concurrently preserving the characteristic TCR-pMHC diagonal binding mode. SageTCR demonstrates superior performance compared to six methods with different deep learning architectures. Furthermore, SageTCR offers the interpretability by identifying and focusing on the conformational features of pivotal contact residues on the interface, which can provide valuable insights for TCR engineering and immunotherapy design.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"26 5","pages":""},"PeriodicalIF":7.7000,"publicationDate":"2025-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12454268/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Briefings in bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/bib/bbaf496","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

T-cell receptors (TCRs) recognize peptide-MHC (pMHC) complexes through intricate structural interactions, which is a core component of adaptive immunity. However, the diverse and cross-reactive nature of TCRs poses great challenges for accurate prediction of TCR-epitope interactions, hampering the advancement and broad application of TCR-related therapies. Here, we present SageTCR, a bi-level graph neural network (GNN) framework that leverages structural data to predict TCR-pMHC binding possibilities. Harnessing the pretrained language models, SageTCR encodes detailed structural arrangement at both residue-level and atomic-level and effectively integrates the bimodal representations via attention mechanisms. To tackle the deficiency of experimental structures, we explore comprehensive data augmentation strategies to enrich the training and increase the generalizability while concurrently preserving the characteristic TCR-pMHC diagonal binding mode. SageTCR demonstrates superior performance compared to six methods with different deep learning architectures. Furthermore, SageTCR offers the interpretability by identifying and focusing on the conformational features of pivotal contact residues on the interface, which can provide valuable insights for TCR engineering and immunotherapy design.

SageTCR:一个基于结构的模型,集成残基和原子级表示,用于增强TCR-pMHC结合预测。
t细胞受体(tcr)通过复杂的结构相互作用识别多肽- mhc (pMHC)复合物,是适应性免疫的核心组成部分。然而,tcr的多样性和交叉反应性给准确预测tcr -表位相互作用带来了很大的挑战,阻碍了tcr相关治疗的进步和广泛应用。在这里,我们提出了SageTCR,一个利用结构数据预测TCR-pMHC结合可能性的双级图神经网络(GNN)框架。SageTCR利用预训练的语言模型,对残差级和原子级的详细结构安排进行编码,并通过注意机制有效地集成了双峰表示。为了解决实验结构的不足,我们探索了综合的数据增强策略,在保留TCR-pMHC对角结合模式特征的同时,丰富了训练内容,提高了可泛化性。与六种不同深度学习架构的方法相比,SageTCR表现出卓越的性能。此外,SageTCR通过识别和关注界面上关键接触残基的构象特征提供了可解释性,这可以为TCR工程和免疫治疗设计提供有价值的见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Briefings in bioinformatics
Briefings in bioinformatics 生物-生化研究方法
CiteScore
13.20
自引率
13.70%
发文量
549
审稿时长
6 months
期刊介绍: Briefings in Bioinformatics is an international journal serving as a platform for researchers and educators in the life sciences. It also appeals to mathematicians, statisticians, and computer scientists applying their expertise to biological challenges. The journal focuses on reviews tailored for users of databases and analytical tools in contemporary genetics, molecular and systems biology. It stands out by offering practical assistance and guidance to non-specialists in computerized methodologies. Covering a wide range from introductory concepts to specific protocols and analyses, the papers address bacterial, plant, fungal, animal, and human data. The journal's detailed subject areas include genetic studies of phenotypes and genotypes, mapping, DNA sequencing, expression profiling, gene expression studies, microarrays, alignment methods, protein profiles and HMMs, lipids, metabolic and signaling pathways, structure determination and function prediction, phylogenetic studies, and education and training.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信