SLUR(M)-py：一个SLURM驱动的python流水线，用于并行处理3D （Epi）基因组图谱。

IF 2.6 4区医学 Q2 GENETICS & HEREDITY

Epigenomics Pub Date : 2025-10-03 DOI:10.1080/17501911.2025.2568368

Cullen Roth, Vrinda Venu, Sasha Bacot, Shawn R Starkenburg, Christina R Steadman

{"title":"SLUR(M)-py：一个SLURM驱动的python流水线，用于并行处理3D （Epi）基因组图谱。","authors":"Cullen Roth, Vrinda Venu, Sasha Bacot, Shawn R Starkenburg, Christina R Steadman","doi":"10.1080/17501911.2025.2568368","DOIUrl":null,"url":null,"abstract":"Epigenomics has become multi-faceted, with researchers exploring chromatin structure, nucleosome states, and epigenetic modifications, producing large, complex multi-omic datasets. Given this shift, there is a demand for bioinformatics that leverage high-performance computing (HPC) and parallelization to quickly process data. As such, we developed SLUR(M)-py, a pythonic computational platform that leverages the Simple Linux Utility for Resource Management system (SLURM) to process sequencing data. SLUR(M)-py is multi-omic and automates calls to SLURM for processing paired-end sequences from chromatin characterization experiments, including whole-genome, ChIP-seq, ATAC-seq, and Hi-C, thereby eliminating the need for multiple analytics pipelines. To demonstrate SLUR(M)-py's utility, we employ ATAC-seq and Hi-C data from viral infection experiments and the ENCODE project, and illustrate its processing speed and completeness, which outpaces current HPC pipelines. We explore the effect of dropping duplicate sequenced reads in ATAC-seq, demonstrate how SLUR(M)-py can be used for quality control, and how to detect artifacts in Hi-C from viral infection experiments. Finally, we show how features in SLUR(M)-py, like inter-chromosomal analysis, can be used to explore the dynamics of chromosomal contacts in mammalian cells. This multi-omic, system-agnostic platform eases the computational burden for researchers and quickly produces accurate and reliable data analytics for the epigenomics community.","PeriodicalId":11959,"journal":{"name":"Epigenomics","volume":" ","pages":"1-15"},"PeriodicalIF":2.6000,"publicationDate":"2025-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SLUR(M)-py: a SLURM powered Pythonic pipeline for parallel processing of 3D (Epi)genomic profiles.\",\"authors\":\"Cullen Roth, Vrinda Venu, Sasha Bacot, Shawn R Starkenburg, Christina R Steadman\",\"doi\":\"10.1080/17501911.2025.2568368\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Epigenomics has become multi-faceted, with researchers exploring chromatin structure, nucleosome states, and epigenetic modifications, producing large, complex multi-omic datasets. Given this shift, there is a demand for bioinformatics that leverage high-performance computing (HPC) and parallelization to quickly process data. As such, we developed SLUR(M)-py, a pythonic computational platform that leverages the Simple Linux Utility for Resource Management system (SLURM) to process sequencing data. SLUR(M)-py is multi-omic and automates calls to SLURM for processing paired-end sequences from chromatin characterization experiments, including whole-genome, ChIP-seq, ATAC-seq, and Hi-C, thereby eliminating the need for multiple analytics pipelines. To demonstrate SLUR(M)-py's utility, we employ ATAC-seq and Hi-C data from viral infection experiments and the ENCODE project, and illustrate its processing speed and completeness, which outpaces current HPC pipelines. We explore the effect of dropping duplicate sequenced reads in ATAC-seq, demonstrate how SLUR(M)-py can be used for quality control, and how to detect artifacts in Hi-C from viral infection experiments. Finally, we show how features in SLUR(M)-py, like inter-chromosomal analysis, can be used to explore the dynamics of chromosomal contacts in mammalian cells. This multi-omic, system-agnostic platform eases the computational burden for researchers and quickly produces accurate and reliable data analytics for the epigenomics community.\",\"PeriodicalId\":11959,\"journal\":{\"name\":\"Epigenomics\",\"volume\":\" \",\"pages\":\"1-15\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2025-10-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Epigenomics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1080/17501911.2025.2568368\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Epigenomics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/17501911.2025.2568368","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}

引用次数: 0

摘要

随着研究人员探索染色质结构、核小体状态和表观遗传修饰，表观基因组学已经变得多面性，产生了大型、复杂的多基因组数据集。鉴于这种转变，生物信息学需要利用高性能计算（HPC）和并行化来快速处理数据。因此，我们开发了SLUR(M)-py，这是一个python计算平台，利用简单Linux资源管理系统（SLURM）来处理测序数据。SLUR(M)-py是多组学的，可以自动调用SLURM来处理来自染色质表征实验的配对端序列，包括全基因组、ChIP-seq、ATAC-seq和Hi-C，从而消除了对多个分析管道的需求。为了证明SLUR(M)-py的实用性，我们使用了来自病毒感染实验和ENCODE项目的ATAC-seq和Hi-C数据，并说明了其处理速度和完整性，超过了当前的HPC管道。我们探讨了在ATAC-seq中删除重复序列的影响，展示了SLUR(M)-py如何用于质量控制，以及如何从病毒感染实验中检测Hi-C中的伪像。最后，我们展示了SLUR(M)-py中的特征，如染色体间分析，如何用于探索哺乳动物细胞中染色体接触的动力学。这个多组学、系统无关的平台减轻了研究人员的计算负担，并为表观基因组学社区快速生成准确可靠的数据分析。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

SLUR(M)-py: a SLURM powered Pythonic pipeline for parallel processing of 3D (Epi)genomic profiles.

Epigenomics has become multi-faceted, with researchers exploring chromatin structure, nucleosome states, and epigenetic modifications, producing large, complex multi-omic datasets. Given this shift, there is a demand for bioinformatics that leverage high-performance computing (HPC) and parallelization to quickly process data. As such, we developed SLUR(M)-py, a pythonic computational platform that leverages the Simple Linux Utility for Resource Management system (SLURM) to process sequencing data. SLUR(M)-py is multi-omic and automates calls to SLURM for processing paired-end sequences from chromatin characterization experiments, including whole-genome, ChIP-seq, ATAC-seq, and Hi-C, thereby eliminating the need for multiple analytics pipelines. To demonstrate SLUR(M)-py's utility, we employ ATAC-seq and Hi-C data from viral infection experiments and the ENCODE project, and illustrate its processing speed and completeness, which outpaces current HPC pipelines. We explore the effect of dropping duplicate sequenced reads in ATAC-seq, demonstrate how SLUR(M)-py can be used for quality control, and how to detect artifacts in Hi-C from viral infection experiments. Finally, we show how features in SLUR(M)-py, like inter-chromosomal analysis, can be used to explore the dynamics of chromosomal contacts in mammalian cells. This multi-omic, system-agnostic platform eases the computational burden for researchers and quickly produces accurate and reliable data analytics for the epigenomics community.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Epigenomics GENETICS & HEREDITY-

CiteScore

5.80

自引率

2.60%

发文量

审稿时长

>12 weeks

期刊介绍： Epigenomics provides the forum to address the rapidly progressing research developments in this ever-expanding field; to report on the major challenges ahead and critical advances that are propelling the science forward. The journal delivers this information in concise, at-a-glance article formats – invaluable to a time constrained community. Substantial developments in our current knowledge and understanding of genomics and epigenetics are constantly being made, yet this field is still in its infancy. Epigenomics provides a critical overview of the latest and most significant advances as they unfold and explores their potential application in the clinical setting.