David Jones, Keiran M. Raine, Helen Davies, Patrick S. Tarpey, Adam P. Butler, Jon W. Teague, Serena Nik-Zainal, Peter J. Campbell
下载PDF
{"title":"cgpCaVEManWrapper: Simple Execution of CaVEMan in Order to Detect Somatic Single Nucleotide Variants in NGS Data","authors":"David Jones, Keiran M. Raine, Helen Davies, Patrick S. Tarpey, Adam P. Butler, Jon W. Teague, Serena Nik-Zainal, Peter J. Campbell","doi":"10.1002/cpbi.20","DOIUrl":null,"url":null,"abstract":"<p>CaVEMan is an expectation maximization–based somatic substitution-detection algorithm that is written in C. The algorithm analyzes sequence data from a test sample, such as a tumor relative to a reference normal sample from the same patient and the reference genome. It performs a comparative analysis of the tumor and normal sample to derive a probabilistic estimate for putative somatic substitutions. When combined with a set of validated post-hoc filters, CaVEMan generates a set of somatic substitution calls with high recall and positive predictive value. Here we provide instructions for using a wrapper script called cgpCaVEManWrapper, which runs the CaVEMan algorithm and additional downstream post-hoc filters. We describe both a simple one-shot run of cgpCaVEManWrapper and a more in-depth implementation suited to large-scale compute farms. © 2016 by John Wiley & Sons, Inc.</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"56 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2016-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.20","citationCount":"159","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current protocols in bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cpbi.20","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 159
引用
批量引用
Abstract
CaVEMan is an expectation maximization–based somatic substitution-detection algorithm that is written in C. The algorithm analyzes sequence data from a test sample, such as a tumor relative to a reference normal sample from the same patient and the reference genome. It performs a comparative analysis of the tumor and normal sample to derive a probabilistic estimate for putative somatic substitutions. When combined with a set of validated post-hoc filters, CaVEMan generates a set of somatic substitution calls with high recall and positive predictive value. Here we provide instructions for using a wrapper script called cgpCaVEManWrapper, which runs the CaVEMan algorithm and additional downstream post-hoc filters. We describe both a simple one-shot run of cgpCaVEManWrapper and a more in-depth implementation suited to large-scale compute farms. © 2016 by John Wiley & Sons, Inc.
cgpCaVEManWrapper:简单执行CaVEMan以检测NGS数据中的体细胞单核苷酸变异
CaVEMan是一种基于期望最大化的体细胞替代检测算法,用c语言编写。该算法分析来自测试样本的序列数据,例如来自同一患者和参考基因组的相对于参考正常样本的肿瘤。它执行肿瘤和正常样本的比较分析,以得出假定的体细胞替代的概率估计。当与一组经过验证的事后过滤器相结合时,CaVEMan生成了一组具有高召回率和正预测值的体细胞替换呼叫。在这里,我们提供了使用名为cgpCaVEManWrapper的包装器脚本的说明,该脚本运行CaVEMan算法和其他下游事后过滤器。我们描述了cgpCaVEManWrapper的一个简单的一次性运行,以及适合大规模计算场的更深入的实现。©2016 by John Wiley &儿子,Inc。
本文章由计算机程序翻译,如有差异,请以英文原文为准。