APEX-pHLA: A novel method for accurate prediction of the binding between exogenous short peptides and HLA class I molecules

IF 4.2 3区生物学 Q1 BIOCHEMICAL RESEARCH METHODS

Methods Pub Date : 2024-05-19 DOI:10.1016/j.ymeth.2024.05.013

Zhihao Su , Yejian Wu , Kaiqiang Cao , Jie Du , Lujing Cao , Zhipeng Wu , Xinyi Wu , Xinqiao Wang , Ying Song , Xudong Wang , Hongliang Duan

{"title":"APEX-pHLA: A novel method for accurate prediction of the binding between exogenous short peptides and HLA class I molecules","authors":"Zhihao Su , Yejian Wu , Kaiqiang Cao , Jie Du , Lujing Cao , Zhipeng Wu , Xinyi Wu , Xinqiao Wang , Ying Song , Xudong Wang , Hongliang Duan","doi":"10.1016/j.ymeth.2024.05.013","DOIUrl":null,"url":null,"abstract":"<div><p>Human leukocyte antigen (HLA) molecules play critically significant role within the realm of immunotherapy due to their capacities to recognize and bind exogenous antigens such as peptides, subsequently delivering them to immune cells. Predicting the binding between peptides and HLA molecules (pHLA) can expedite the screening of immunogenic peptides and facilitate vaccine design. However, traditional experimental methods are time-consuming and inefficient. In this study, an efficient method based on deep learning was developed for predicting peptide-HLA binding, which treated peptide sequences as linguistic entities. It combined the architectures of textCNN and BiLSTM to create a deep neural network model called APEX-pHLA. This model operated without limitations related to HLA class I allele variants and peptide segment lengths, enabling efficient encoding of sequence features for both HLA and peptide segments. On the independent test set, the model achieved Accuracy, ROC_AUC, F1, and MCC is 0.9449, 0.9850, 0.9453, and 0.8899, respectively. Similarly, on an external test set, the results were 0.9803, 0.9574, 0.8835, and 0.7863, respectively. These findings outperformed fifteen methods previously reported in the literature. The accurate prediction capability of the APEX-pHLA model in peptide-HLA binding might provide valuable insights for future HLA vaccine design.</p></div>","PeriodicalId":390,"journal":{"name":"Methods","volume":"228 ","pages":"Pages 38-47"},"PeriodicalIF":4.2000,"publicationDate":"2024-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Methods","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1046202324001324","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}

引用次数: 0

Abstract

Human leukocyte antigen (HLA) molecules play critically significant role within the realm of immunotherapy due to their capacities to recognize and bind exogenous antigens such as peptides, subsequently delivering them to immune cells. Predicting the binding between peptides and HLA molecules (pHLA) can expedite the screening of immunogenic peptides and facilitate vaccine design. However, traditional experimental methods are time-consuming and inefficient. In this study, an efficient method based on deep learning was developed for predicting peptide-HLA binding, which treated peptide sequences as linguistic entities. It combined the architectures of textCNN and BiLSTM to create a deep neural network model called APEX-pHLA. This model operated without limitations related to HLA class I allele variants and peptide segment lengths, enabling efficient encoding of sequence features for both HLA and peptide segments. On the independent test set, the model achieved Accuracy, ROC_AUC, F1, and MCC is 0.9449, 0.9850, 0.9453, and 0.8899, respectively. Similarly, on an external test set, the results were 0.9803, 0.9574, 0.8835, and 0.7863, respectively. These findings outperformed fifteen methods previously reported in the literature. The accurate prediction capability of the APEX-pHLA model in peptide-HLA binding might provide valuable insights for future HLA vaccine design.

查看原文本刊更多论文

APEX-pHLA：准确预测外源短肽与 HLA I 类分子结合的新方法。

人类白细胞抗原（HLA）分子在免疫疗法领域发挥着至关重要的作用，因为它们能够识别和结合肽等外源抗原，然后将其传递给免疫细胞。预测肽与 HLA 分子（pHLA）之间的结合可以加快免疫原肽的筛选，促进疫苗设计。然而，传统的实验方法耗时长、效率低。本研究开发了一种基于深度学习的预测多肽与 HLA 结合的高效方法，该方法将多肽序列视为语言实体。它结合了 textCNN 和 BiLSTM 的架构，创建了一个名为 APEX-pHLA 的深度神经网络模型。该模型在运行时不受 HLA I 类等位基因变异和肽段长度的限制，因此能有效编码 HLA 和肽段的序列特征。在独立测试集上，该模型的准确度、ROC_AUC、F1 和 MCC 分别为 0.9449、0.9850、0.9453 和 0.8899。同样，在外部测试集上，结果分别为 0.9803、0.9574、0.8835 和 0.7863。这些结果优于之前文献中报道的 15 种方法。APEX-pHLA 模型在多肽-HLA 结合方面的准确预测能力可能会为未来的 HLA 疫苗设计提供有价值的见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Methods 生物-生化研究方法

CiteScore

9.80

自引率

2.10%

发文量

222

审稿时长

11.3 weeks

期刊介绍： Methods focuses on rapidly developing techniques in the experimental biological and medical sciences. Each topical issue, organized by a guest editor who is an expert in the area covered, consists solely of invited quality articles by specialist authors, many of them reviews. Issues are devoted to specific technical approaches with emphasis on clear detailed descriptions of protocols that allow them to be reproduced easily. The background information provided enables researchers to understand the principles underlying the methods; other helpful sections include comparisons of alternative methods giving the advantages and disadvantages of particular methods, guidance on avoiding potential pitfalls, and suggestions for troubleshooting.