Identifying, Prioritizing, and Visualizing Functional Promoter SNVs with the Recurrence-agnostic REMIND-Cancer Pipeline and pSNV Hunter

Nicholas Allen Baclig Abad, Irina Glas, Chen Hong, Yoann Pageaud, Barbara Hutter, Benedikt Brors, Cindy Korner, Lars Feuerbach
{"title":"Identifying, Prioritizing, and Visualizing Functional Promoter SNVs with the Recurrence-agnostic REMIND-Cancer Pipeline and pSNV Hunter","authors":"Nicholas Allen Baclig Abad, Irina Glas, Chen Hong, Yoann Pageaud, Barbara Hutter, Benedikt Brors, Cindy Korner, Lars Feuerbach","doi":"10.1101/2024.09.10.612212","DOIUrl":null,"url":null,"abstract":"Cancer is a heterogeneous disease that arises due to mutations that drive cancer progression. However, the identification of these functional mutations has typically focused only on protein-coding DNA. Among non-coding mutations, only a few have been clearly associated with cancer. We hypothesize that this gap in discovery is partly due to the limitations of current methods requiring high recurrence of mutations. To support candidate selection for experimental validation of lowly recurrent and singleton promoter mutations, new computational approaches for the integrated analysis of multi-omics data are required. To address this challenge, the REMIND-Cancer Pipeline leverages whole-genome sequencing and RNA-Seq data to extract and prioritize functional promoter mutations, regardless of their recurrence status. Subsequently, pSNV Hunter aggregates and visualizes comprehensive information for each candidate. We demonstrate the functionality of both tools by applying it to the PCAWG dataset. This workflow successfully identified and prioritized known highly-recurrent mutations, as well as, novel singletons and lowly recurrent candidates. Hence, the output of our workflow directly supports hypothesis generation for subsequent experimental validation to overcome limitations of recurrence-based approaches.","PeriodicalId":501233,"journal":{"name":"bioRxiv - Cancer Biology","volume":"47 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"bioRxiv - Cancer Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.09.10.612212","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Cancer is a heterogeneous disease that arises due to mutations that drive cancer progression. However, the identification of these functional mutations has typically focused only on protein-coding DNA. Among non-coding mutations, only a few have been clearly associated with cancer. We hypothesize that this gap in discovery is partly due to the limitations of current methods requiring high recurrence of mutations. To support candidate selection for experimental validation of lowly recurrent and singleton promoter mutations, new computational approaches for the integrated analysis of multi-omics data are required. To address this challenge, the REMIND-Cancer Pipeline leverages whole-genome sequencing and RNA-Seq data to extract and prioritize functional promoter mutations, regardless of their recurrence status. Subsequently, pSNV Hunter aggregates and visualizes comprehensive information for each candidate. We demonstrate the functionality of both tools by applying it to the PCAWG dataset. This workflow successfully identified and prioritized known highly-recurrent mutations, as well as, novel singletons and lowly recurrent candidates. Hence, the output of our workflow directly supports hypothesis generation for subsequent experimental validation to overcome limitations of recurrence-based approaches.
利用具有复发诊断功能的 REMIND-Cancer Pipeline 和 pSNV 猎人识别、优先排序和可视化功能启动子 SNV
癌症是一种异质性疾病,由于突变导致癌症进展。然而,对这些功能性突变的鉴定通常只侧重于编码蛋白质的 DNA。在非编码突变中,只有少数突变与癌症明确相关。我们推测,这种发现上的差距部分是由于目前要求高突变复发率的方法的局限性造成的。为了支持对低复发和单体启动子突变进行实验验证的候选选择,需要采用新的计算方法对多组学数据进行综合分析。为了应对这一挑战,REMIND-Cancer Pipeline 利用全基因组测序和 RNA-Seq 数据提取功能性启动子突变,并对其进行优先排序,而不论其复发状况如何。随后,pSNV Hunter 聚合并可视化每个候选基因的综合信息。我们将这两种工具应用于 PCAWG 数据集,展示了它们的功能。该工作流程成功识别并优先处理了已知的高复发性突变、新型单突变和低复发性候选突变。因此,我们工作流程的输出结果可直接支持假设的生成,以便随后进行实验验证,从而克服基于复发的方法的局限性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信