vcfView: An Extensible Data Visualization and Quality Assurance Platform for Integrated Somatic Variant Analysis.

IF 2.4 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY
Cancer Informatics Pub Date : 2020-11-11 eCollection Date: 2020-01-01 DOI:10.1177/1176935120972377
Brian O'Sullivan, Cathal Seoighe
{"title":"vcfView: An Extensible Data Visualization and Quality Assurance Platform for Integrated Somatic Variant Analysis.","authors":"Brian O'Sullivan,&nbsp;Cathal Seoighe","doi":"10.1177/1176935120972377","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Somatic mutations can have critical prognostic and therapeutic implications for cancer patients. Although targeted methods are often used to assay specific cancer driver mutations, high throughput sequencing is frequently applied to discover novel driver mutations and to determine the status of less-frequent driver mutations. The task of recovering somatic mutations from these data is nontrivial as somatic mutations must be distinguished from germline variants, sequencing errors, and other artefacts. Consequently, bioinformatics pipelines for recovery of somatic mutations from high throughput sequencing typically involve a large number of analytical choices in the form of quality filters.</p><p><strong>Results: </strong>We present vcfView, an interactive tool designed to support the evaluation of somatic mutation calls from cancer sequencing data. The tool takes as input a single variant call format (VCF) file and enables researchers to explore the impacts of analytical choices on the mutant allele frequency spectrum, on mutational signatures and on annotated somatic variants in genes of interest. It allows variants that have failed variant caller filters to be re-examined to improve sensitivity or guide the design of future experiments. It is extensible, allowing other algorithms to be incorporated easily.</p><p><strong>Availability: </strong>The shiny application can be downloaded from GitHub (https://github.com/BrianOSullivanGit/vcfView). All data processing is performed within <i>R</i> to ensure platform independence. The app has been tested on RStudio, version 1.1.456, with base <i>R</i> 3.6.2 and Shiny 1.4.0. A vignette based on a publicly available data set is also available on GitHub.</p>","PeriodicalId":35418,"journal":{"name":"Cancer Informatics","volume":"19 ","pages":"1176935120972377"},"PeriodicalIF":2.4000,"publicationDate":"2020-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1177/1176935120972377","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancer Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/1176935120972377","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2020/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 1

Abstract

Motivation: Somatic mutations can have critical prognostic and therapeutic implications for cancer patients. Although targeted methods are often used to assay specific cancer driver mutations, high throughput sequencing is frequently applied to discover novel driver mutations and to determine the status of less-frequent driver mutations. The task of recovering somatic mutations from these data is nontrivial as somatic mutations must be distinguished from germline variants, sequencing errors, and other artefacts. Consequently, bioinformatics pipelines for recovery of somatic mutations from high throughput sequencing typically involve a large number of analytical choices in the form of quality filters.

Results: We present vcfView, an interactive tool designed to support the evaluation of somatic mutation calls from cancer sequencing data. The tool takes as input a single variant call format (VCF) file and enables researchers to explore the impacts of analytical choices on the mutant allele frequency spectrum, on mutational signatures and on annotated somatic variants in genes of interest. It allows variants that have failed variant caller filters to be re-examined to improve sensitivity or guide the design of future experiments. It is extensible, allowing other algorithms to be incorporated easily.

Availability: The shiny application can be downloaded from GitHub (https://github.com/BrianOSullivanGit/vcfView). All data processing is performed within R to ensure platform independence. The app has been tested on RStudio, version 1.1.456, with base R 3.6.2 and Shiny 1.4.0. A vignette based on a publicly available data set is also available on GitHub.

Abstract Image

Abstract Image

Abstract Image

vcfView:一个集成体细胞变异分析的可扩展数据可视化和质量保证平台。
动机:体细胞突变对癌症患者的预后和治疗具有重要意义。虽然靶向方法通常用于分析特定的癌症驱动突变,但高通量测序经常用于发现新的驱动突变和确定不太常见的驱动突变的状态。从这些数据中恢复体细胞突变的任务是不平凡的,因为体细胞突变必须与种系变异、测序错误和其他人为因素区分开来。因此,从高通量测序中恢复体细胞突变的生物信息学管道通常涉及大量高质量过滤器形式的分析选择。结果:我们提出了vcfView,这是一个交互式工具,旨在支持评估来自癌症测序数据的体细胞突变调用。该工具将单个变体调用格式(VCF)文件作为输入,使研究人员能够探索分析选择对突变等位基因频谱、突变签名和感兴趣基因中注释的体细胞变体的影响。它允许对失败的变体调用者过滤器进行重新检查,以提高灵敏度或指导未来实验的设计。它是可扩展的,允许很容易地合并其他算法。可用性:闪亮的应用程序可以从GitHub下载(https://github.com/BrianOSullivanGit/vcfView)。所有数据处理都在R中执行,以确保平台独立性。该应用程序已经在RStudio上进行了测试,版本1.1.456,基本版本R 3.6.2和Shiny 1.4.0。一个基于公开数据集的小插图也可以在GitHub上找到。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Cancer Informatics
Cancer Informatics Medicine-Oncology
CiteScore
3.00
自引率
5.00%
发文量
30
审稿时长
8 weeks
期刊介绍: The field of cancer research relies on advances in many other disciplines, including omics technology, mass spectrometry, radio imaging, computer science, and biostatistics. Cancer Informatics provides open access to peer-reviewed high-quality manuscripts reporting bioinformatics analysis of molecular genetics and/or clinical data pertaining to cancer, emphasizing the use of machine learning, artificial intelligence, statistical algorithms, advanced imaging techniques, data visualization, and high-throughput technologies. As the leading journal dedicated exclusively to the report of the use of computational methods in cancer research and practice, Cancer Informatics leverages methodological improvements in systems biology, genomics, proteomics, metabolomics, and molecular biochemistry into the fields of cancer detection, treatment, classification, risk-prediction, prevention, outcome, and modeling.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信