ESMpHLA: Evolutionary Scale Model-Based Deep Learning Prediction of HLA Class I Binding Peptides

IF 5.9 4区 医学 Q2 CELL BIOLOGY
HLA Pub Date : 2025-05-22 DOI:10.1111/tan.70263
Xiaorui Cheng, Hu Mei, Pengji Chen, Haixia Wu, Rui Liu, Yuanyuan Lei, Pingqing Wang
{"title":"ESMpHLA: Evolutionary Scale Model-Based Deep Learning Prediction of HLA Class I Binding Peptides","authors":"Xiaorui Cheng,&nbsp;Hu Mei,&nbsp;Pengji Chen,&nbsp;Haixia Wu,&nbsp;Rui Liu,&nbsp;Yuanyuan Lei,&nbsp;Pingqing Wang","doi":"10.1111/tan.70263","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>The recognition of endogenous peptides by HLA class I plays a crucial role in CD8+ T cell immune responses and human adaptive cell immune. Thus, the prediction of HLA class I-peptide binding affinities is always the core issue for the research of immune recognition and vaccine development. In this study, an evolutionary scale model (ESM) combined with parallel CNN blocks and a cross attention mechanism was used to construct a novel ESMpHLA model for predicting HLA class I binding peptides. Based on the 91,560 binding peptides of 41 HLA-A alleles, 56,731 of 50 HLA-B alleles and 2444 of 10 HLA-C alleles, the ESMpHLA model was successfully established and achieved satisfying prediction performances with the overall accuracy and AUC values of 0.874 and 0.938 for the test dataset. The results indicate that the ESMpHLA model performs well in dealing with different HLA class I 2-field alleles as well as the peptides with different lengths. Then, the generalisation ability of the ESMpHLA model was validated by an independent test dataset compiled from recent IEDB weekly benchmark datasets. The results showed that the ESMpHLA model achieved the highest ROC-AUC and PR-AUC values when compared with the latest BVMHC, CapsNet-MHC, STMHCpan and BVLSTM models. In addition, two ensemble models were also established by integrating the above 5 deep learning models using soft-voting and hard-voting strategies.</p>\n </div>","PeriodicalId":13172,"journal":{"name":"HLA","volume":"105 5","pages":""},"PeriodicalIF":5.9000,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"HLA","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/tan.70263","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CELL BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

The recognition of endogenous peptides by HLA class I plays a crucial role in CD8+ T cell immune responses and human adaptive cell immune. Thus, the prediction of HLA class I-peptide binding affinities is always the core issue for the research of immune recognition and vaccine development. In this study, an evolutionary scale model (ESM) combined with parallel CNN blocks and a cross attention mechanism was used to construct a novel ESMpHLA model for predicting HLA class I binding peptides. Based on the 91,560 binding peptides of 41 HLA-A alleles, 56,731 of 50 HLA-B alleles and 2444 of 10 HLA-C alleles, the ESMpHLA model was successfully established and achieved satisfying prediction performances with the overall accuracy and AUC values of 0.874 and 0.938 for the test dataset. The results indicate that the ESMpHLA model performs well in dealing with different HLA class I 2-field alleles as well as the peptides with different lengths. Then, the generalisation ability of the ESMpHLA model was validated by an independent test dataset compiled from recent IEDB weekly benchmark datasets. The results showed that the ESMpHLA model achieved the highest ROC-AUC and PR-AUC values when compared with the latest BVMHC, CapsNet-MHC, STMHCpan and BVLSTM models. In addition, two ensemble models were also established by integrating the above 5 deep learning models using soft-voting and hard-voting strategies.

基于进化尺度模型的HLA I类结合肽深度学习预测
HLA I类对内源性多肽的识别在CD8+ T细胞免疫应答和人类适应性细胞免疫中起着至关重要的作用。因此,HLA i类肽结合亲和力的预测一直是免疫识别研究和疫苗开发的核心问题。本研究采用进化尺度模型(ESM)结合平行CNN块和交叉注意机制,构建了预测HLA I类结合肽的ESMpHLA模型。基于41个HLA-A等位基因的91560条结合肽、50个HLA-B等位基因的56731条结合肽和10个HLA-C等位基因的2444条结合肽,成功建立了ESMpHLA模型,测试数据集的总体准确率和AUC值分别为0.874和0.938,取得了令人满意的预测效果。结果表明,ESMpHLA模型能较好地处理不同的HLA I类2场等位基因和不同长度的肽段。然后,通过独立的测试数据集对ESMpHLA模型的泛化能力进行了验证,这些数据集来自最近的IEDB每周基准数据集。结果表明,与最新的BVMHC、CapsNet-MHC、STMHCpan和BVLSTM模式相比,esmpla模式的ROC-AUC和PR-AUC值最高。此外,采用软投票和硬投票策略对上述5个深度学习模型进行整合,建立了两个集成模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
HLA
HLA Immunology and Microbiology-Immunology
CiteScore
3.00
自引率
28.80%
发文量
368
期刊介绍: HLA, the journal, publishes articles on various aspects of immunogenetics. These include the immunogenetics of cell surface antigens, the ontogeny and phylogeny of the immune system, the immunogenetics of cell interactions, the functional aspects of cell surface molecules and their natural ligands, and the role of tissue antigens in immune reactions. Additionally, the journal covers experimental and clinical transplantation, the relationships between normal tissue antigens and tumor-associated antigens, the genetic control of immune response and disease susceptibility, and the biochemistry and molecular biology of alloantigens and leukocyte differentiation. Manuscripts on molecules expressed on lymphoid cells, myeloid cells, platelets, and non-lineage-restricted antigens are welcomed. Lastly, the journal focuses on the immunogenetics of histocompatibility antigens in both humans and experimental animals, including their tissue distribution, regulation, and expression in normal and malignant cells, as well as the use of antigens as markers for disease.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信