PEGR: a management platform for ChIP-based next generation sequencing pipelines.

Danying Shao, Gretta Kellogg, Shaun Mahony, William Lai, B Franklin Pugh
{"title":"PEGR: a management platform for ChIP-based next generation sequencing pipelines.","authors":"Danying Shao,&nbsp;Gretta Kellogg,&nbsp;Shaun Mahony,&nbsp;William Lai,&nbsp;B Franklin Pugh","doi":"10.1145/3311790.3396621","DOIUrl":null,"url":null,"abstract":"<p><p>There has been a rapid development in genome sequencing, including high-throughput next generation sequencing (NGS) technologies, automation in biological experiments, new bioinformatics tools and utilization of high-performance computing and cloud computing. ChIP-based NGS technologies, e.g. ChIP-seq and ChIP-exo, are widely used to detect the binding sites of DNA-interacting proteins in the genome and help us to have a deeper mechanistic understanding of genomic regulation. As sequencing data is generated at an unprecedented pace from the ChIP-based NGS pipelines, there is an urgent need for a metadata management system. To meet this need, we developed the Platform for Eukaryotic Genomic Regulation (PEGR), a web service platform that logs metadata for samples and sequencing experiments, manages the data processing workflows, and provides reporting and visualization. PEGR links together people, samples, protocols, DNA sequencers and bioinformatics computation. With the help of PEGR, scientists can have a more integrated understanding of the sequencing data and better understand the scientific mechanisms of genomic regulation. In this paper, we present the architecture and the major functionalities of PEGR. We also share our experience in developing this application and discuss the future directions.</p>","PeriodicalId":74406,"journal":{"name":"PEARC20 : Practice and Experience in Advanced Research Computing 2020 : Catch the wave : July 27-31, 2020, Portland, Or Virtual Conference. Practice and Experience in Advanced Research Computing (Conference) (2020 : Online)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3311790.3396621","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PEARC20 : Practice and Experience in Advanced Research Computing 2020 : Catch the wave : July 27-31, 2020, Portland, Or Virtual Conference. Practice and Experience in Advanced Research Computing (Conference) (2020 : Online)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3311790.3396621","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

There has been a rapid development in genome sequencing, including high-throughput next generation sequencing (NGS) technologies, automation in biological experiments, new bioinformatics tools and utilization of high-performance computing and cloud computing. ChIP-based NGS technologies, e.g. ChIP-seq and ChIP-exo, are widely used to detect the binding sites of DNA-interacting proteins in the genome and help us to have a deeper mechanistic understanding of genomic regulation. As sequencing data is generated at an unprecedented pace from the ChIP-based NGS pipelines, there is an urgent need for a metadata management system. To meet this need, we developed the Platform for Eukaryotic Genomic Regulation (PEGR), a web service platform that logs metadata for samples and sequencing experiments, manages the data processing workflows, and provides reporting and visualization. PEGR links together people, samples, protocols, DNA sequencers and bioinformatics computation. With the help of PEGR, scientists can have a more integrated understanding of the sequencing data and better understand the scientific mechanisms of genomic regulation. In this paper, we present the architecture and the major functionalities of PEGR. We also share our experience in developing this application and discuss the future directions.

PEGR:基于芯片的下一代测序流水线管理平台。
基因组测序技术发展迅速,包括高通量下一代测序技术、生物实验自动化、新型生物信息学工具以及高性能计算和云计算的应用。基于芯片的NGS技术,如ChIP-seq和ChIP-exo,被广泛用于检测基因组中dna相互作用蛋白的结合位点,帮助我们对基因组调控有更深层次的机制理解。由于基于芯片的NGS管道以前所未有的速度生成测序数据,因此迫切需要元数据管理系统。为了满足这一需求,我们开发了真核生物基因组调控平台(PEGR),这是一个web服务平台,记录样本和测序实验的元数据,管理数据处理工作流程,并提供报告和可视化。PEGR将人、样品、协议、DNA测序仪和生物信息学计算联系在一起。借助PEGR,科学家可以更全面地了解测序数据,更好地了解基因组调控的科学机制。在本文中,我们介绍了PEGR的体系结构和主要功能。我们还分享了开发该应用程序的经验,并讨论了未来的发展方向。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信