Glbase:一个用于组合、分析和显示异质基因组和高通量测序数据的框架

IF 4 Q2 CELL & TISSUE ENGINEERING
Andrew Paul Hutchins , Ralf Jauch , Mateusz Dyla , Diego Miranda-Saavedra
{"title":"Glbase:一个用于组合、分析和显示异质基因组和高通量测序数据的框架","authors":"Andrew Paul Hutchins ,&nbsp;Ralf Jauch ,&nbsp;Mateusz Dyla ,&nbsp;Diego Miranda-Saavedra","doi":"10.1186/2045-9769-3-1","DOIUrl":null,"url":null,"abstract":"<div><p>Genomic datasets and the tools to analyze them have proliferated at an astonishing rate. However, such tools are often poorly integrated with each other: each program typically produces its own custom output in a variety of non-standard file formats. Here we present glbase, a framework that uses a flexible set of descriptors that can quickly parse non-binary data files. glbase includes many functions to intersect two lists of data, including operations on genomic interval data and support for the efficient random access to huge genomic data files. Many glbase functions can produce graphical outputs, including scatter plots, heatmaps, boxplots and other common analytical displays of high-throughput data such as RNA-seq, ChIP-seq and microarray expression data. glbase is designed to rapidly bring biological data into a Python-based analytical environment to facilitate analysis and data processing. In summary, glbase is a flexible and multifunctional toolkit that allows the combination and analysis of high-throughput data (especially next-generation sequencing and genome-wide data), and which has been instrumental in the analysis of complex data sets. glbase is freely available at <span>http://bitbucket.org/oaxiom/glbase/</span><svg><path></path></svg>.</p></div>","PeriodicalId":9811,"journal":{"name":"Cell Regeneration","volume":"3 1","pages":"Article 3:1"},"PeriodicalIF":4.0000,"publicationDate":"2014-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/2045-9769-3-1","citationCount":"65","resultStr":"{\"title\":\"glbase: a framework for combining, analyzing and displaying heterogeneous genomic and high-throughput sequencing data\",\"authors\":\"Andrew Paul Hutchins ,&nbsp;Ralf Jauch ,&nbsp;Mateusz Dyla ,&nbsp;Diego Miranda-Saavedra\",\"doi\":\"10.1186/2045-9769-3-1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Genomic datasets and the tools to analyze them have proliferated at an astonishing rate. However, such tools are often poorly integrated with each other: each program typically produces its own custom output in a variety of non-standard file formats. Here we present glbase, a framework that uses a flexible set of descriptors that can quickly parse non-binary data files. glbase includes many functions to intersect two lists of data, including operations on genomic interval data and support for the efficient random access to huge genomic data files. Many glbase functions can produce graphical outputs, including scatter plots, heatmaps, boxplots and other common analytical displays of high-throughput data such as RNA-seq, ChIP-seq and microarray expression data. glbase is designed to rapidly bring biological data into a Python-based analytical environment to facilitate analysis and data processing. In summary, glbase is a flexible and multifunctional toolkit that allows the combination and analysis of high-throughput data (especially next-generation sequencing and genome-wide data), and which has been instrumental in the analysis of complex data sets. glbase is freely available at <span>http://bitbucket.org/oaxiom/glbase/</span><svg><path></path></svg>.</p></div>\",\"PeriodicalId\":9811,\"journal\":{\"name\":\"Cell Regeneration\",\"volume\":\"3 1\",\"pages\":\"Article 3:1\"},\"PeriodicalIF\":4.0000,\"publicationDate\":\"2014-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1186/2045-9769-3-1\",\"citationCount\":\"65\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cell Regeneration\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2045976917300196\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CELL & TISSUE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cell Regeneration","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2045976917300196","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CELL & TISSUE ENGINEERING","Score":null,"Total":0}
引用次数: 65

摘要

基因组数据集和分析它们的工具以惊人的速度激增。然而,这些工具之间的集成通常很差:每个程序通常以各种非标准文件格式生成自己的自定义输出。这里我们介绍glbase,这是一个框架,它使用一组灵活的描述符,可以快速解析非二进制数据文件。Glbase包含许多功能来交叉两个数据列表,包括对基因组区间数据的操作和支持对巨大基因组数据文件的高效随机访问。许多glbase函数可以生成图形输出,包括散点图、热图、箱形图和其他常见的高通量数据分析显示,如RNA-seq、ChIP-seq和微阵列表达数据。glbase旨在快速将生物数据带入基于python的分析环境,以方便分析和数据处理。总之,glbase是一个灵活的多功能工具包,允许高通量数据(特别是下一代测序和全基因组数据)的组合和分析,并且在复杂数据集的分析中发挥了重要作用。Glbase可在http://bitbucket.org/oaxiom/glbase/免费获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

glbase: a framework for combining, analyzing and displaying heterogeneous genomic and high-throughput sequencing data

glbase: a framework for combining, analyzing and displaying heterogeneous genomic and high-throughput sequencing data

glbase: a framework for combining, analyzing and displaying heterogeneous genomic and high-throughput sequencing data

Genomic datasets and the tools to analyze them have proliferated at an astonishing rate. However, such tools are often poorly integrated with each other: each program typically produces its own custom output in a variety of non-standard file formats. Here we present glbase, a framework that uses a flexible set of descriptors that can quickly parse non-binary data files. glbase includes many functions to intersect two lists of data, including operations on genomic interval data and support for the efficient random access to huge genomic data files. Many glbase functions can produce graphical outputs, including scatter plots, heatmaps, boxplots and other common analytical displays of high-throughput data such as RNA-seq, ChIP-seq and microarray expression data. glbase is designed to rapidly bring biological data into a Python-based analytical environment to facilitate analysis and data processing. In summary, glbase is a flexible and multifunctional toolkit that allows the combination and analysis of high-throughput data (especially next-generation sequencing and genome-wide data), and which has been instrumental in the analysis of complex data sets. glbase is freely available at http://bitbucket.org/oaxiom/glbase/.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Cell Regeneration
Cell Regeneration Biochemistry, Genetics and Molecular Biology-Cell Biology
CiteScore
5.80
自引率
0.00%
发文量
42
审稿时长
35 days
期刊介绍: Cell Regeneration aims to provide a worldwide platform for researches on stem cells and regenerative biology to develop basic science and to foster its clinical translation in medicine. Cell Regeneration welcomes reports on novel discoveries, theories, methods, technologies, and products in the field of stem cells and regenerative research, the journal is interested, but not limited to the following topics: ◎ Embryonic stem cells ◎ Induced pluripotent stem cells ◎ Tissue-specific stem cells ◎ Tissue or organ regeneration ◎ Methodology ◎ Biomaterials and regeneration ◎ Clinical translation or application in medicine
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信