{"title":"CLTAP:一种使用预训练的大型语言嵌入模型和对比学习来增强语境共注意机制的tap结合肽预测方法","authors":"Lun Zhu , Wei Chen , Sen Yang","doi":"10.1016/j.eswa.2025.127991","DOIUrl":null,"url":null,"abstract":"<div><div>The transporter associated with antigen processing (TAP) transports peptides to the endoplasmic reticulum, which is a crucial step for determining CD8<sup>+</sup> T cell epitopes. CLTAP is a novel method for predicting TAP-binding peptides, which enhances prediction accuracy by integrating contrastive learning (CL) with a contextual co-attention mechanism. To the best of our knowledge, this is the first study to utilize CL for predicting TAP-binding peptides. Leveraging CL to capture the consistency and inconsistency among similar and dissimilar TAP-binding peptides, respectively, significantly enhanced the model’s ability to differentiate between various samples. Additionally, the contextual co-attention mechanism introduced in this study effectively integrates and emphasizes two distinct features of peptide sequences. CL further enhanced the fused features, resulting in discriminative representations and significantly improving the accuracy of TAP-binding peptide prediction. By integrating these two technologies, CLTAP demonstrates superior predictive performance than existing methods across multiple benchmark datasets. Evaluations on test datasets indicate that CLTAP outperforms the second-best method by 6.32 % in Accuracy, 15.26 % in Matthews Correlation Coefficient, 4.48 % in F1-score, and 4.13 % in AUROC. It offers an efficient and reliable tool for TAP-binding peptide research and applications. CLTAP is freely available at <span><span>https://github.com/ChenWeiCCZU/CLTAP</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"285 ","pages":"Article 127991"},"PeriodicalIF":7.5000,"publicationDate":"2025-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CLTAP: A TAP-binding peptide prediction method using pre-trained large language-embedding models with contrastive learning to enhance contextual co-attention mechanism\",\"authors\":\"Lun Zhu , Wei Chen , Sen Yang\",\"doi\":\"10.1016/j.eswa.2025.127991\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The transporter associated with antigen processing (TAP) transports peptides to the endoplasmic reticulum, which is a crucial step for determining CD8<sup>+</sup> T cell epitopes. CLTAP is a novel method for predicting TAP-binding peptides, which enhances prediction accuracy by integrating contrastive learning (CL) with a contextual co-attention mechanism. To the best of our knowledge, this is the first study to utilize CL for predicting TAP-binding peptides. Leveraging CL to capture the consistency and inconsistency among similar and dissimilar TAP-binding peptides, respectively, significantly enhanced the model’s ability to differentiate between various samples. Additionally, the contextual co-attention mechanism introduced in this study effectively integrates and emphasizes two distinct features of peptide sequences. CL further enhanced the fused features, resulting in discriminative representations and significantly improving the accuracy of TAP-binding peptide prediction. By integrating these two technologies, CLTAP demonstrates superior predictive performance than existing methods across multiple benchmark datasets. Evaluations on test datasets indicate that CLTAP outperforms the second-best method by 6.32 % in Accuracy, 15.26 % in Matthews Correlation Coefficient, 4.48 % in F1-score, and 4.13 % in AUROC. It offers an efficient and reliable tool for TAP-binding peptide research and applications. CLTAP is freely available at <span><span>https://github.com/ChenWeiCCZU/CLTAP</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"285 \",\"pages\":\"Article 127991\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-05-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0957417425016124\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425016124","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
CLTAP: A TAP-binding peptide prediction method using pre-trained large language-embedding models with contrastive learning to enhance contextual co-attention mechanism
The transporter associated with antigen processing (TAP) transports peptides to the endoplasmic reticulum, which is a crucial step for determining CD8+ T cell epitopes. CLTAP is a novel method for predicting TAP-binding peptides, which enhances prediction accuracy by integrating contrastive learning (CL) with a contextual co-attention mechanism. To the best of our knowledge, this is the first study to utilize CL for predicting TAP-binding peptides. Leveraging CL to capture the consistency and inconsistency among similar and dissimilar TAP-binding peptides, respectively, significantly enhanced the model’s ability to differentiate between various samples. Additionally, the contextual co-attention mechanism introduced in this study effectively integrates and emphasizes two distinct features of peptide sequences. CL further enhanced the fused features, resulting in discriminative representations and significantly improving the accuracy of TAP-binding peptide prediction. By integrating these two technologies, CLTAP demonstrates superior predictive performance than existing methods across multiple benchmark datasets. Evaluations on test datasets indicate that CLTAP outperforms the second-best method by 6.32 % in Accuracy, 15.26 % in Matthews Correlation Coefficient, 4.48 % in F1-score, and 4.13 % in AUROC. It offers an efficient and reliable tool for TAP-binding peptide research and applications. CLTAP is freely available at https://github.com/ChenWeiCCZU/CLTAP.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.