机器学习应用于预测含有非规范氨基酸的肽与HLA-A0201之间的结合亲和力。

IF 2.6 3区综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES

PLoS ONE Pub Date : 2025-06-27 eCollection Date: 2025-01-01 DOI:10.1371/journal.pone.0314833

Shan Jiang, Zhaoqian Su, Nathaniel Bloodworth, Yunchao Liu, Cristina E Martina, David G Harrison, Jens Meiler

{"title":"机器学习应用于预测含有非规范氨基酸的肽与HLA-A0201之间的结合亲和力。","authors":"Shan Jiang, Zhaoqian Su, Nathaniel Bloodworth, Yunchao Liu, Cristina E Martina, David G Harrison, Jens Meiler","doi":"10.1371/journal.pone.0314833","DOIUrl":null,"url":null,"abstract":"Class Ι major histocompatibility complexes (MHC-Ι), encoded by the highly polymorphic HLA-A, HLA-B, and HLA-C genes in humans, are expressed on all nucleated cells. Both self and foreign proteins are processed to peptides of 8-10 amino acids, loaded into MHC-Ι, within the endoplasmic reticulum and then presented on the cell surface. Foreign peptides presented in this fashion activate CD8 + T cells and their immunogenicity correlates with their affinity for the MHC-Ι binding groove. Thus, predicting antigen binding affinity for MHC-Ι is a valuable tool for identifying potentially immunogenic antigens. While quite a few predictors for MHC-Ι binding exist, there are no currently available tools that can predict antigen/MHC-Ι binding affinity for antigens with explicitly labeled post-translational modifications or unusual/non-canonical amino acids (NCAAs). However, such modifications are increasingly recognized as critical mediators of peptide immunogenicity. In this work, we propose a machine learning application that quantifies the binding affinity of epitopes containing NCAAs to MHC-Ι and compares its performance with other commonly used regressors. Our model demonstrates robust performance, with 5-fold cross-validation yielding an R2 value of 0.477 and a root-mean-square error (RMSE) of 0.735, indicating strong predictive capability for peptides with NCAAs. This work provides a valuable tool for the computational design and optimization of peptides incorporating NCAAs, potentially accelerating the development of novel peptide-based therapeutics with enhanced properties and efficacy.","PeriodicalId":20189,"journal":{"name":"PLoS ONE","volume":"20 6","pages":"e0314833"},"PeriodicalIF":2.6000,"publicationDate":"2025-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12204577/pdf/","citationCount":"0","resultStr":"{\"title\":\"Machine learning application to predict binding affinity between peptide containing non-canonical amino acids and HLA-A0201.\",\"authors\":\"Shan Jiang, Zhaoqian Su, Nathaniel Bloodworth, Yunchao Liu, Cristina E Martina, David G Harrison, Jens Meiler\",\"doi\":\"10.1371/journal.pone.0314833\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Class Ι major histocompatibility complexes (MHC-Ι), encoded by the highly polymorphic HLA-A, HLA-B, and HLA-C genes in humans, are expressed on all nucleated cells. Both self and foreign proteins are processed to peptides of 8-10 amino acids, loaded into MHC-Ι, within the endoplasmic reticulum and then presented on the cell surface. Foreign peptides presented in this fashion activate CD8 + T cells and their immunogenicity correlates with their affinity for the MHC-Ι binding groove. Thus, predicting antigen binding affinity for MHC-Ι is a valuable tool for identifying potentially immunogenic antigens. While quite a few predictors for MHC-Ι binding exist, there are no currently available tools that can predict antigen/MHC-Ι binding affinity for antigens with explicitly labeled post-translational modifications or unusual/non-canonical amino acids (NCAAs). However, such modifications are increasingly recognized as critical mediators of peptide immunogenicity. In this work, we propose a machine learning application that quantifies the binding affinity of epitopes containing NCAAs to MHC-Ι and compares its performance with other commonly used regressors. Our model demonstrates robust performance, with 5-fold cross-validation yielding an R2 value of 0.477 and a root-mean-square error (RMSE) of 0.735, indicating strong predictive capability for peptides with NCAAs. This work provides a valuable tool for the computational design and optimization of peptides incorporating NCAAs, potentially accelerating the development of novel peptide-based therapeutics with enhanced properties and efficacy.\",\"PeriodicalId\":20189,\"journal\":{\"name\":\"PLoS ONE\",\"volume\":\"20 6\",\"pages\":\"e0314833\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2025-06-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12204577/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"PLoS ONE\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1371/journal.pone.0314833\",\"RegionNum\":3,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"PLoS ONE","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1371/journal.pone.0314833","RegionNum":3,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

一类Ι主要组织相容性复合体（MHC-Ι），由人类高度多态性的HLA-A、HLA-B和HLA-C基因编码，在所有有核细胞上表达。自体和外源蛋白都被加工成8-10个氨基酸的肽，在内质网内装载到MHC-Ι中，然后呈现在细胞表面。以这种方式呈现的外源肽激活CD8 + T细胞，其免疫原性与其对MHC-Ι结合槽的亲和力相关。因此，预测MHC-Ι的抗原结合亲和力是鉴定潜在免疫原性抗原的有价值的工具。虽然存在相当多的MHC-Ι结合预测因子，但目前还没有可用的工具可以预测抗原/MHC-Ι与明确标记的翻译后修饰或异常/非规范氨基酸（NCAAs）抗原的结合亲和力。然而，这种修饰越来越被认为是肽免疫原性的关键介质。在这项工作中，我们提出了一个机器学习应用程序，该应用程序量化含有NCAAs的表位与MHC-Ι的结合亲和力，并将其性能与其他常用的回归量进行比较。我们的模型表现出稳健的性能，5倍交叉验证的R2值为0.477，均方根误差（RMSE）为0.735，表明对具有NCAAs的肽具有很强的预测能力。这项工作为包含NCAAs的肽的计算设计和优化提供了一个有价值的工具，可能会加速开发具有增强性能和功效的新型肽基治疗方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Machine learning application to predict binding affinity between peptide containing non-canonical amino acids and HLA-A0201.

查看原文本刊更多论文

Machine learning application to predict binding affinity between peptide containing non-canonical amino acids and HLA-A0201.

Class Ι major histocompatibility complexes (MHC-Ι), encoded by the highly polymorphic HLA-A, HLA-B, and HLA-C genes in humans, are expressed on all nucleated cells. Both self and foreign proteins are processed to peptides of 8-10 amino acids, loaded into MHC-Ι, within the endoplasmic reticulum and then presented on the cell surface. Foreign peptides presented in this fashion activate CD8 + T cells and their immunogenicity correlates with their affinity for the MHC-Ι binding groove. Thus, predicting antigen binding affinity for MHC-Ι is a valuable tool for identifying potentially immunogenic antigens. While quite a few predictors for MHC-Ι binding exist, there are no currently available tools that can predict antigen/MHC-Ι binding affinity for antigens with explicitly labeled post-translational modifications or unusual/non-canonical amino acids (NCAAs). However, such modifications are increasingly recognized as critical mediators of peptide immunogenicity. In this work, we propose a machine learning application that quantifies the binding affinity of epitopes containing NCAAs to MHC-Ι and compares its performance with other commonly used regressors. Our model demonstrates robust performance, with 5-fold cross-validation yielding an R2 value of 0.477 and a root-mean-square error (RMSE) of 0.735, indicating strong predictive capability for peptides with NCAAs. This work provides a valuable tool for the computational design and optimization of peptides incorporating NCAAs, potentially accelerating the development of novel peptide-based therapeutics with enhanced properties and efficacy.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

PLoS ONE 生物-生物学

CiteScore

6.20

自引率

5.40%

发文量

14242

审稿时长

3.7 months

期刊介绍： PLOS ONE is an international, peer-reviewed, open-access, online publication. PLOS ONE welcomes reports on primary research from any scientific discipline. It provides: * Open-access—freely accessible online, authors retain copyright * Fast publication times * Peer review by expert, practicing researchers * Post-publication tools to indicate quality and impact * Community-based dialogue on articles * Worldwide media coverage