APEX-pHLA: A novel method for accurate prediction of the binding between exogenous short peptides and HLA class I molecules

IF 4.2 3区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS
Zhihao Su , Yejian Wu , Kaiqiang Cao , Jie Du , Lujing Cao , Zhipeng Wu , Xinyi Wu , Xinqiao Wang , Ying Song , Xudong Wang , Hongliang Duan
{"title":"APEX-pHLA: A novel method for accurate prediction of the binding between exogenous short peptides and HLA class I molecules","authors":"Zhihao Su ,&nbsp;Yejian Wu ,&nbsp;Kaiqiang Cao ,&nbsp;Jie Du ,&nbsp;Lujing Cao ,&nbsp;Zhipeng Wu ,&nbsp;Xinyi Wu ,&nbsp;Xinqiao Wang ,&nbsp;Ying Song ,&nbsp;Xudong Wang ,&nbsp;Hongliang Duan","doi":"10.1016/j.ymeth.2024.05.013","DOIUrl":null,"url":null,"abstract":"<div><p>Human leukocyte antigen (HLA) molecules play critically significant role within the realm of immunotherapy due to their capacities to recognize and bind exogenous antigens such as peptides, subsequently delivering them to immune cells. Predicting the binding between peptides and HLA molecules (pHLA) can expedite the screening of immunogenic peptides and facilitate vaccine design. However, traditional experimental methods are time-consuming and inefficient. In this study, an efficient method based on deep learning was developed for predicting peptide-HLA binding, which treated peptide sequences as linguistic entities. It combined the architectures of textCNN and BiLSTM to create a deep neural network model called APEX-pHLA. This model operated without limitations related to HLA class I allele variants and peptide segment lengths, enabling efficient encoding of sequence features for both HLA and peptide segments. On the independent test set, the model achieved Accuracy, ROC_AUC, F1, and MCC is 0.9449, 0.9850, 0.9453, and 0.8899, respectively. Similarly, on an external test set, the results were 0.9803, 0.9574, 0.8835, and 0.7863, respectively. These findings outperformed fifteen methods previously reported in the literature. The accurate prediction capability of the APEX-pHLA model in peptide-HLA binding might provide valuable insights for future HLA vaccine design.</p></div>","PeriodicalId":390,"journal":{"name":"Methods","volume":"228 ","pages":"Pages 38-47"},"PeriodicalIF":4.2000,"publicationDate":"2024-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Methods","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1046202324001324","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

Human leukocyte antigen (HLA) molecules play critically significant role within the realm of immunotherapy due to their capacities to recognize and bind exogenous antigens such as peptides, subsequently delivering them to immune cells. Predicting the binding between peptides and HLA molecules (pHLA) can expedite the screening of immunogenic peptides and facilitate vaccine design. However, traditional experimental methods are time-consuming and inefficient. In this study, an efficient method based on deep learning was developed for predicting peptide-HLA binding, which treated peptide sequences as linguistic entities. It combined the architectures of textCNN and BiLSTM to create a deep neural network model called APEX-pHLA. This model operated without limitations related to HLA class I allele variants and peptide segment lengths, enabling efficient encoding of sequence features for both HLA and peptide segments. On the independent test set, the model achieved Accuracy, ROC_AUC, F1, and MCC is 0.9449, 0.9850, 0.9453, and 0.8899, respectively. Similarly, on an external test set, the results were 0.9803, 0.9574, 0.8835, and 0.7863, respectively. These findings outperformed fifteen methods previously reported in the literature. The accurate prediction capability of the APEX-pHLA model in peptide-HLA binding might provide valuable insights for future HLA vaccine design.

APEX-pHLA:准确预测外源短肽与 HLA I 类分子结合的新方法。
人类白细胞抗原(HLA)分子在免疫疗法领域发挥着至关重要的作用,因为它们能够识别和结合肽等外源抗原,然后将其传递给免疫细胞。预测肽与 HLA 分子(pHLA)之间的结合可以加快免疫原肽的筛选,促进疫苗设计。然而,传统的实验方法耗时长、效率低。本研究开发了一种基于深度学习的预测多肽与 HLA 结合的高效方法,该方法将多肽序列视为语言实体。它结合了 textCNN 和 BiLSTM 的架构,创建了一个名为 APEX-pHLA 的深度神经网络模型。该模型在运行时不受 HLA I 类等位基因变异和肽段长度的限制,因此能有效编码 HLA 和肽段的序列特征。在独立测试集上,该模型的准确度、ROC_AUC、F1 和 MCC 分别为 0.9449、0.9850、0.9453 和 0.8899。同样,在外部测试集上,结果分别为 0.9803、0.9574、0.8835 和 0.7863。这些结果优于之前文献中报道的 15 种方法。APEX-pHLA 模型在多肽-HLA 结合方面的准确预测能力可能会为未来的 HLA 疫苗设计提供有价值的见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Methods
Methods 生物-生化研究方法
CiteScore
9.80
自引率
2.10%
发文量
222
审稿时长
11.3 weeks
期刊介绍: Methods focuses on rapidly developing techniques in the experimental biological and medical sciences. Each topical issue, organized by a guest editor who is an expert in the area covered, consists solely of invited quality articles by specialist authors, many of them reviews. Issues are devoted to specific technical approaches with emphasis on clear detailed descriptions of protocols that allow them to be reproduced easily. The background information provided enables researchers to understand the principles underlying the methods; other helpful sections include comparisons of alternative methods giving the advantages and disadvantages of particular methods, guidance on avoiding potential pitfalls, and suggestions for troubleshooting.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信