Xiaorui Cheng, Hu Mei, Pengji Chen, Haixia Wu, Rui Liu, Yuanyuan Lei, Pingqing Wang
{"title":"ESMpHLA: Evolutionary Scale Model-Based Deep Learning Prediction of HLA Class I Binding Peptides","authors":"Xiaorui Cheng, Hu Mei, Pengji Chen, Haixia Wu, Rui Liu, Yuanyuan Lei, Pingqing Wang","doi":"10.1111/tan.70263","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>The recognition of endogenous peptides by HLA class I plays a crucial role in CD8+ T cell immune responses and human adaptive cell immune. Thus, the prediction of HLA class I-peptide binding affinities is always the core issue for the research of immune recognition and vaccine development. In this study, an evolutionary scale model (ESM) combined with parallel CNN blocks and a cross attention mechanism was used to construct a novel ESMpHLA model for predicting HLA class I binding peptides. Based on the 91,560 binding peptides of 41 HLA-A alleles, 56,731 of 50 HLA-B alleles and 2444 of 10 HLA-C alleles, the ESMpHLA model was successfully established and achieved satisfying prediction performances with the overall accuracy and AUC values of 0.874 and 0.938 for the test dataset. The results indicate that the ESMpHLA model performs well in dealing with different HLA class I 2-field alleles as well as the peptides with different lengths. Then, the generalisation ability of the ESMpHLA model was validated by an independent test dataset compiled from recent IEDB weekly benchmark datasets. The results showed that the ESMpHLA model achieved the highest ROC-AUC and PR-AUC values when compared with the latest BVMHC, CapsNet-MHC, STMHCpan and BVLSTM models. In addition, two ensemble models were also established by integrating the above 5 deep learning models using soft-voting and hard-voting strategies.</p>\n </div>","PeriodicalId":13172,"journal":{"name":"HLA","volume":"105 5","pages":""},"PeriodicalIF":5.9000,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"HLA","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/tan.70263","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CELL BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
The recognition of endogenous peptides by HLA class I plays a crucial role in CD8+ T cell immune responses and human adaptive cell immune. Thus, the prediction of HLA class I-peptide binding affinities is always the core issue for the research of immune recognition and vaccine development. In this study, an evolutionary scale model (ESM) combined with parallel CNN blocks and a cross attention mechanism was used to construct a novel ESMpHLA model for predicting HLA class I binding peptides. Based on the 91,560 binding peptides of 41 HLA-A alleles, 56,731 of 50 HLA-B alleles and 2444 of 10 HLA-C alleles, the ESMpHLA model was successfully established and achieved satisfying prediction performances with the overall accuracy and AUC values of 0.874 and 0.938 for the test dataset. The results indicate that the ESMpHLA model performs well in dealing with different HLA class I 2-field alleles as well as the peptides with different lengths. Then, the generalisation ability of the ESMpHLA model was validated by an independent test dataset compiled from recent IEDB weekly benchmark datasets. The results showed that the ESMpHLA model achieved the highest ROC-AUC and PR-AUC values when compared with the latest BVMHC, CapsNet-MHC, STMHCpan and BVLSTM models. In addition, two ensemble models were also established by integrating the above 5 deep learning models using soft-voting and hard-voting strategies.
期刊介绍:
HLA, the journal, publishes articles on various aspects of immunogenetics. These include the immunogenetics of cell surface antigens, the ontogeny and phylogeny of the immune system, the immunogenetics of cell interactions, the functional aspects of cell surface molecules and their natural ligands, and the role of tissue antigens in immune reactions. Additionally, the journal covers experimental and clinical transplantation, the relationships between normal tissue antigens and tumor-associated antigens, the genetic control of immune response and disease susceptibility, and the biochemistry and molecular biology of alloantigens and leukocyte differentiation. Manuscripts on molecules expressed on lymphoid cells, myeloid cells, platelets, and non-lineage-restricted antigens are welcomed. Lastly, the journal focuses on the immunogenetics of histocompatibility antigens in both humans and experimental animals, including their tissue distribution, regulation, and expression in normal and malignant cells, as well as the use of antigens as markers for disease.