ImmunoPepper: Extracting personalized peptides from complex splicing graphs.

IF 5.4
Laurie Prélot, Jiayu Chen, Matthias Hüser, André Kahles, Gunnar Rätsch
{"title":"ImmunoPepper: Extracting personalized peptides from complex splicing graphs.","authors":"Laurie Prélot, Jiayu Chen, Matthias Hüser, André Kahles, Gunnar Rätsch","doi":"10.1093/bioinformatics/btaf492","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>RNA Sequencing enables the characterization of a cell's transcript isoforms in healthy and disease conditions. In the context of cancer, local transcript variability may translate to splicing-derived tumor-associated peptides recognized by the immune system. A software tool that extracts such candidate peptides, is of great interest for personalized cancer therapy.</p><p><strong>Results: </strong>We present the open-source software tool ImmunoPepper, which extracts a set of biologically plausible peptides from a splicing graph, derived from a set of RNA-Seq datasets. This peptide set can be personalized with germline and somatic variation and takes novel RNA splice variants into account. ImmunoPepper supports several filtering options, including subtraction of normal tissue background, prediction of MHC-binding affinity, as well as MassSpec-based validation of identified peptides. We analyzed 32 ovarian cancer (TCGA-OV) and 31 breast invasive carcinoma (TCGA-BRCA) samples, with a strict cancer-specific filtering configuration, and obtained on average 834 and 569 cancer-specific predicted MHC-I binding 9-mers per sample, for each cohort, respectively. MassSpec validation with the target-decoy competition Subset-Neighbor-Search (SNS) showed an average validation rate of 4.5% per TCGA-OV sample and 5.3% per TCGA-BRCA sample. This corresponded to 25 MHC-I binders 9-mers per TCGA-OV sample, and 20 MHC-I binders 9-mers per TCGA-BRCA sample in average. Finally, we draw conclusions about the best framework for generation of splicing-derived neoepitopes and recommend to use joint data structures when processing homogeneously a cancer and a normal cohort and to focus on reproducibility of the candidates across generation pipelines.</p><p><strong>Availability: </strong>ImmunoPepper is implemented in Python 3 and is available as open source software at https://github.com/ratschlab/immunopepper. The online documentation can be found at https://immunopepper.readthedocs.io/en/latest/.</p><p><strong>Supplementary information: </strong>Supplementary data are available at Bioinformatics online.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":5.4000,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btaf492","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Motivation: RNA Sequencing enables the characterization of a cell's transcript isoforms in healthy and disease conditions. In the context of cancer, local transcript variability may translate to splicing-derived tumor-associated peptides recognized by the immune system. A software tool that extracts such candidate peptides, is of great interest for personalized cancer therapy.

Results: We present the open-source software tool ImmunoPepper, which extracts a set of biologically plausible peptides from a splicing graph, derived from a set of RNA-Seq datasets. This peptide set can be personalized with germline and somatic variation and takes novel RNA splice variants into account. ImmunoPepper supports several filtering options, including subtraction of normal tissue background, prediction of MHC-binding affinity, as well as MassSpec-based validation of identified peptides. We analyzed 32 ovarian cancer (TCGA-OV) and 31 breast invasive carcinoma (TCGA-BRCA) samples, with a strict cancer-specific filtering configuration, and obtained on average 834 and 569 cancer-specific predicted MHC-I binding 9-mers per sample, for each cohort, respectively. MassSpec validation with the target-decoy competition Subset-Neighbor-Search (SNS) showed an average validation rate of 4.5% per TCGA-OV sample and 5.3% per TCGA-BRCA sample. This corresponded to 25 MHC-I binders 9-mers per TCGA-OV sample, and 20 MHC-I binders 9-mers per TCGA-BRCA sample in average. Finally, we draw conclusions about the best framework for generation of splicing-derived neoepitopes and recommend to use joint data structures when processing homogeneously a cancer and a normal cohort and to focus on reproducibility of the candidates across generation pipelines.

Availability: ImmunoPepper is implemented in Python 3 and is available as open source software at https://github.com/ratschlab/immunopepper. The online documentation can be found at https://immunopepper.readthedocs.io/en/latest/.

Supplementary information: Supplementary data are available at Bioinformatics online.

免疫辣椒:从复杂剪接图中提取个性化肽。
动机:RNA测序能够表征健康和疾病条件下细胞的转录异构体。在癌症的背景下,局部转录变异性可能转化为剪接衍生的肿瘤相关肽被免疫系统识别。一个软件工具,提取这些候选肽,是非常感兴趣的个性化癌症治疗。结果:我们提出了开源软件工具ImmunoPepper,它从一组RNA-Seq数据集的剪接图中提取了一组生物学上合理的肽。这种肽集可以个性化的生殖系和体细胞变异,并考虑到新的RNA剪接变异。ImmunoPepper支持多种过滤选项,包括正常组织背景的减去,mhc结合亲和力的预测,以及基于massspec的鉴定肽的验证。我们分析了32例卵巢癌(TCGA-OV)和31例乳腺浸润性癌(TCGA-BRCA)样本,采用严格的癌症特异性过滤配置,每个队列平均每个样本分别获得834和569个癌症特异性预测MHC-I结合9-mers。使用目标-诱饵竞争子集-邻居搜索(SNS)的MassSpec验证显示,TCGA-OV样本的平均验证率为4.5%,TCGA-BRCA样本的平均验证率为5.3%。这相当于每个TCGA-OV样本平均有25个MHC-I结合物9米,每个TCGA-BRCA样本平均有20个MHC-I结合物9米。最后,我们得出了关于剪接衍生新表位生成的最佳框架的结论,并建议在同质处理癌症和正常队列时使用联合数据结构,并关注跨生成管道的候选数据的可重复性。可用性:ImmunoPepper是在Python 3中实现的,并且可以在https://github.com/ratschlab/immunopepper上作为开源软件获得。在线文件可在https://immunopepper.readthedocs.io/en/latest/.Supplementary信息上找到:补充数据可在Bioinformatics在线上获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信