{"title":"Multi-positive contrastive learning-based cross-attention model for T cell receptor–antigen binding prediction","authors":"Yi Shuai , Pengcheng Shen , Xianrui Zhang","doi":"10.1016/j.cmpb.2025.108797","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and Objective:</h3><div>T cells play a vital role in the immune system by recognizing and eliminating infected or cancerous cells, thus driving adaptive immune responses. Their activation is triggered by the binding of T cell receptors (TCRs) to epitopes presented on Major Histocompatibility Complex (MHC) molecules. However, experimentally identifying antigens that could be recognizable by T cells and possess immunogenic properties is resource-intensive, with most candidates proving non-immunogenic, underscoring the need for computational tools to predict peptide-MHC (pMHC) and TCR binding. Despite extensive efforts, accurately predicting TCR-antigen binding pairs remains challenging due to the vast diversity of TCRs.</div></div><div><h3>Methods:</h3><div>In this study, we propose a Contrastive Cross-attention model for TCR (ConTCR) and pMHC binding prediction. Firstly, the pMHC and TCR sequences are transformed into high-level embedding by pretrained encoders as feature representations. Then, we employ the multi-modal cross-attention to combine the features between pMHC sequences and TCR sequences. Next, based on the contrastive learning strategy, we pretrained the backbone of ConTCR to boost the model’s feature extraction ability for pMHC and TCR sequences. Finally, the model is fine-tuned for classification between positive and negative samples.</div></div><div><h3>Results:</h3><div>Based on this advanced strategy, our proposed model could effectively capture the critical information on TCR-pMHC interactions, and the model is visualized by the attention score heatmap for interpretability. ConTCR demonstrates strong generalization in predicting binding specificity for unseen epitopes and diverse TCR repertoires. On independent non-zero-shot test sets, the model achieved AUC-ROC scores of 0.849 and 0.950; on zero-shot test sets, it obtained AUC-ROC scores of 0.830 and 0.938.</div></div><div><h3>Conclusion:</h3><div>Our framework offers a promising solution for improving pMHC-TCR binding prediction and model interpretability. By leveraging the ConTCR model and pMHC-TCR features, we achieve more precise precision than recently advanced models. Overall, ConTCR is a robust tool for predicting pMHC-TCR binding and holds significant promise to advance TCR-based immunotherapies as a valuable artificial intelligence tool. The codes and data used in this study are available at this <span><span>website</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"268 ","pages":"Article 108797"},"PeriodicalIF":4.9000,"publicationDate":"2025-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169260725002147","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Background and Objective:
T cells play a vital role in the immune system by recognizing and eliminating infected or cancerous cells, thus driving adaptive immune responses. Their activation is triggered by the binding of T cell receptors (TCRs) to epitopes presented on Major Histocompatibility Complex (MHC) molecules. However, experimentally identifying antigens that could be recognizable by T cells and possess immunogenic properties is resource-intensive, with most candidates proving non-immunogenic, underscoring the need for computational tools to predict peptide-MHC (pMHC) and TCR binding. Despite extensive efforts, accurately predicting TCR-antigen binding pairs remains challenging due to the vast diversity of TCRs.
Methods:
In this study, we propose a Contrastive Cross-attention model for TCR (ConTCR) and pMHC binding prediction. Firstly, the pMHC and TCR sequences are transformed into high-level embedding by pretrained encoders as feature representations. Then, we employ the multi-modal cross-attention to combine the features between pMHC sequences and TCR sequences. Next, based on the contrastive learning strategy, we pretrained the backbone of ConTCR to boost the model’s feature extraction ability for pMHC and TCR sequences. Finally, the model is fine-tuned for classification between positive and negative samples.
Results:
Based on this advanced strategy, our proposed model could effectively capture the critical information on TCR-pMHC interactions, and the model is visualized by the attention score heatmap for interpretability. ConTCR demonstrates strong generalization in predicting binding specificity for unseen epitopes and diverse TCR repertoires. On independent non-zero-shot test sets, the model achieved AUC-ROC scores of 0.849 and 0.950; on zero-shot test sets, it obtained AUC-ROC scores of 0.830 and 0.938.
Conclusion:
Our framework offers a promising solution for improving pMHC-TCR binding prediction and model interpretability. By leveraging the ConTCR model and pMHC-TCR features, we achieve more precise precision than recently advanced models. Overall, ConTCR is a robust tool for predicting pMHC-TCR binding and holds significant promise to advance TCR-based immunotherapies as a valuable artificial intelligence tool. The codes and data used in this study are available at this website.
期刊介绍:
To encourage the development of formal computing methods, and their application in biomedical research and medical practice, by illustration of fundamental principles in biomedical informatics research; to stimulate basic research into application software design; to report the state of research of biomedical information processing projects; to report new computer methodologies applied in biomedical areas; the eventual distribution of demonstrable software to avoid duplication of effort; to provide a forum for discussion and improvement of existing software; to optimize contact between national organizations and regional user groups by promoting an international exchange of information on formal methods, standards and software in biomedicine.
Computer Methods and Programs in Biomedicine covers computing methodology and software systems derived from computing science for implementation in all aspects of biomedical research and medical practice. It is designed to serve: biochemists; biologists; geneticists; immunologists; neuroscientists; pharmacologists; toxicologists; clinicians; epidemiologists; psychiatrists; psychologists; cardiologists; chemists; (radio)physicists; computer scientists; programmers and systems analysts; biomedical, clinical, electrical and other engineers; teachers of medical informatics and users of educational software.