下一代测序数据的可视化分析,以检测细菌基因组中的重叠基因

Svenja Simon, Daniela Oelke, Richard Landstorfer, K. Neuhaus, D. Keim
{"title":"下一代测序数据的可视化分析,以检测细菌基因组中的重叠基因","authors":"Svenja Simon, Daniela Oelke, Richard Landstorfer, K. Neuhaus, D. Keim","doi":"10.1109/BIOVIS.2011.6094047","DOIUrl":null,"url":null,"abstract":"Next generation sequencing (NGS) technologies are about to revolutionize biological research. Being able to sequence large amounts of DNA or, indirectly, RNA sequences in a short time period opens numerous new possibilities. However, analyzing the large amounts of data generated in NGS is a serious challenge, which requires novel data analysis and visualization methods to allow the biological experimenter to understand the results. In this paper, we describe a novel system to deal with the flood of data generated by transcriptome sequencing (RNA-seq) using NGS. Our system allows the analyzer to get a quick overview of the data and interactively explore interesting regions based on the three important parameters coverage, transcription, and fit. In particular, our system supports the NGS analysis in the following respects: (1) Representation of the coverage sequence in a way that no artifacts are introduced. (2) Easy determination of a fit of an open reading frame (ORF) to a transcript by mapping the coverage sequence directly into the ORF representation. (3) Providing automatic support for finding interesting regions to address the problems that the overwhelming volume of data comes with. (4) Providing an overview representation that allows parameter tuning and enables quick access to interesting areas of the genome. We show the usefulness of our system by a case study in the area of overlapping gene detection in a bacterial genome.","PeriodicalId":354473,"journal":{"name":"2011 IEEE Symposium on Biological Data Visualization (BioVis).","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Visual analysis of next-generation sequencing data to detect overlapping genes in bacterial genomes\",\"authors\":\"Svenja Simon, Daniela Oelke, Richard Landstorfer, K. Neuhaus, D. Keim\",\"doi\":\"10.1109/BIOVIS.2011.6094047\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Next generation sequencing (NGS) technologies are about to revolutionize biological research. Being able to sequence large amounts of DNA or, indirectly, RNA sequences in a short time period opens numerous new possibilities. However, analyzing the large amounts of data generated in NGS is a serious challenge, which requires novel data analysis and visualization methods to allow the biological experimenter to understand the results. In this paper, we describe a novel system to deal with the flood of data generated by transcriptome sequencing (RNA-seq) using NGS. Our system allows the analyzer to get a quick overview of the data and interactively explore interesting regions based on the three important parameters coverage, transcription, and fit. In particular, our system supports the NGS analysis in the following respects: (1) Representation of the coverage sequence in a way that no artifacts are introduced. (2) Easy determination of a fit of an open reading frame (ORF) to a transcript by mapping the coverage sequence directly into the ORF representation. (3) Providing automatic support for finding interesting regions to address the problems that the overwhelming volume of data comes with. (4) Providing an overview representation that allows parameter tuning and enables quick access to interesting areas of the genome. We show the usefulness of our system by a case study in the area of overlapping gene detection in a bacterial genome.\",\"PeriodicalId\":354473,\"journal\":{\"name\":\"2011 IEEE Symposium on Biological Data Visualization (BioVis).\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-10-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 IEEE Symposium on Biological Data Visualization (BioVis).\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BIOVIS.2011.6094047\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE Symposium on Biological Data Visualization (BioVis).","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIOVIS.2011.6094047","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10

摘要

下一代测序(NGS)技术即将给生物学研究带来革命性的变化。能够在短时间内对大量DNA或间接的RNA序列进行测序,开辟了许多新的可能性。然而,分析NGS中产生的大量数据是一项严峻的挑战,这需要新颖的数据分析和可视化方法,以使生物实验者能够理解结果。在本文中,我们描述了一个使用NGS处理转录组测序(RNA-seq)产生的大量数据的新系统。我们的系统允许分析仪获得数据的快速概述,并基于三个重要参数覆盖,转录和拟合交互式探索有趣的区域。特别地,我们的系统在以下方面支持NGS分析:(1)以不引入工件的方式表示覆盖序列。(2)通过将覆盖序列直接映射到开放阅读框(ORF)表示中,轻松确定开放阅读框(ORF)与转录本的匹配度。(3)为寻找感兴趣的区域提供自动支持,以解决海量数据带来的问题。(4)提供一个概览表示,允许参数调整并能够快速访问基因组的有趣区域。我们通过在细菌基因组中重叠基因检测领域的案例研究显示了我们系统的实用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Visual analysis of next-generation sequencing data to detect overlapping genes in bacterial genomes
Next generation sequencing (NGS) technologies are about to revolutionize biological research. Being able to sequence large amounts of DNA or, indirectly, RNA sequences in a short time period opens numerous new possibilities. However, analyzing the large amounts of data generated in NGS is a serious challenge, which requires novel data analysis and visualization methods to allow the biological experimenter to understand the results. In this paper, we describe a novel system to deal with the flood of data generated by transcriptome sequencing (RNA-seq) using NGS. Our system allows the analyzer to get a quick overview of the data and interactively explore interesting regions based on the three important parameters coverage, transcription, and fit. In particular, our system supports the NGS analysis in the following respects: (1) Representation of the coverage sequence in a way that no artifacts are introduced. (2) Easy determination of a fit of an open reading frame (ORF) to a transcript by mapping the coverage sequence directly into the ORF representation. (3) Providing automatic support for finding interesting regions to address the problems that the overwhelming volume of data comes with. (4) Providing an overview representation that allows parameter tuning and enables quick access to interesting areas of the genome. We show the usefulness of our system by a case study in the area of overlapping gene detection in a bacterial genome.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信