Cullen Roth, Vrinda Venu, Sasha Bacot, Shawn R Starkenburg, Christina R Steadman
{"title":"SLUR(M)-py:一个SLURM驱动的python流水线,用于并行处理3D (Epi)基因组图谱。","authors":"Cullen Roth, Vrinda Venu, Sasha Bacot, Shawn R Starkenburg, Christina R Steadman","doi":"10.1080/17501911.2025.2568368","DOIUrl":null,"url":null,"abstract":"<p><p>Epigenomics has become multi-faceted, with researchers exploring chromatin structure, nucleosome states, and epigenetic modifications, producing large, complex multi-omic datasets. Given this shift, there is a demand for bioinformatics that leverage high-performance computing (HPC) and parallelization to quickly process data. As such, we developed SLUR(M)-py, a pythonic computational platform that leverages the Simple Linux Utility for Resource Management system (SLURM) to process sequencing data. SLUR(M)-py is multi-omic and automates calls to SLURM for processing paired-end sequences from chromatin characterization experiments, including whole-genome, ChIP-seq, ATAC-seq, and Hi-C, thereby eliminating the need for multiple analytics pipelines. To demonstrate SLUR(M)-py's utility, we employ ATAC-seq and Hi-C data from viral infection experiments and the ENCODE project, and illustrate its processing speed and completeness, which outpaces current HPC pipelines. We explore the effect of dropping duplicate sequenced reads in ATAC-seq, demonstrate how SLUR(M)-py can be used for quality control, and how to detect artifacts in Hi-C from viral infection experiments. Finally, we show how features in SLUR(M)-py, like inter-chromosomal analysis, can be used to explore the dynamics of chromosomal contacts in mammalian cells. This multi-omic, system-agnostic platform eases the computational burden for researchers and quickly produces accurate and reliable data analytics for the epigenomics community.</p>","PeriodicalId":11959,"journal":{"name":"Epigenomics","volume":" ","pages":"1-15"},"PeriodicalIF":2.6000,"publicationDate":"2025-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SLUR(M)-py: a SLURM powered Pythonic pipeline for parallel processing of 3D (Epi)genomic profiles.\",\"authors\":\"Cullen Roth, Vrinda Venu, Sasha Bacot, Shawn R Starkenburg, Christina R Steadman\",\"doi\":\"10.1080/17501911.2025.2568368\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Epigenomics has become multi-faceted, with researchers exploring chromatin structure, nucleosome states, and epigenetic modifications, producing large, complex multi-omic datasets. Given this shift, there is a demand for bioinformatics that leverage high-performance computing (HPC) and parallelization to quickly process data. As such, we developed SLUR(M)-py, a pythonic computational platform that leverages the Simple Linux Utility for Resource Management system (SLURM) to process sequencing data. SLUR(M)-py is multi-omic and automates calls to SLURM for processing paired-end sequences from chromatin characterization experiments, including whole-genome, ChIP-seq, ATAC-seq, and Hi-C, thereby eliminating the need for multiple analytics pipelines. To demonstrate SLUR(M)-py's utility, we employ ATAC-seq and Hi-C data from viral infection experiments and the ENCODE project, and illustrate its processing speed and completeness, which outpaces current HPC pipelines. We explore the effect of dropping duplicate sequenced reads in ATAC-seq, demonstrate how SLUR(M)-py can be used for quality control, and how to detect artifacts in Hi-C from viral infection experiments. Finally, we show how features in SLUR(M)-py, like inter-chromosomal analysis, can be used to explore the dynamics of chromosomal contacts in mammalian cells. This multi-omic, system-agnostic platform eases the computational burden for researchers and quickly produces accurate and reliable data analytics for the epigenomics community.</p>\",\"PeriodicalId\":11959,\"journal\":{\"name\":\"Epigenomics\",\"volume\":\" \",\"pages\":\"1-15\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2025-10-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Epigenomics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1080/17501911.2025.2568368\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Epigenomics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/17501911.2025.2568368","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
SLUR(M)-py: a SLURM powered Pythonic pipeline for parallel processing of 3D (Epi)genomic profiles.
Epigenomics has become multi-faceted, with researchers exploring chromatin structure, nucleosome states, and epigenetic modifications, producing large, complex multi-omic datasets. Given this shift, there is a demand for bioinformatics that leverage high-performance computing (HPC) and parallelization to quickly process data. As such, we developed SLUR(M)-py, a pythonic computational platform that leverages the Simple Linux Utility for Resource Management system (SLURM) to process sequencing data. SLUR(M)-py is multi-omic and automates calls to SLURM for processing paired-end sequences from chromatin characterization experiments, including whole-genome, ChIP-seq, ATAC-seq, and Hi-C, thereby eliminating the need for multiple analytics pipelines. To demonstrate SLUR(M)-py's utility, we employ ATAC-seq and Hi-C data from viral infection experiments and the ENCODE project, and illustrate its processing speed and completeness, which outpaces current HPC pipelines. We explore the effect of dropping duplicate sequenced reads in ATAC-seq, demonstrate how SLUR(M)-py can be used for quality control, and how to detect artifacts in Hi-C from viral infection experiments. Finally, we show how features in SLUR(M)-py, like inter-chromosomal analysis, can be used to explore the dynamics of chromosomal contacts in mammalian cells. This multi-omic, system-agnostic platform eases the computational burden for researchers and quickly produces accurate and reliable data analytics for the epigenomics community.
期刊介绍:
Epigenomics provides the forum to address the rapidly progressing research developments in this ever-expanding field; to report on the major challenges ahead and critical advances that are propelling the science forward. The journal delivers this information in concise, at-a-glance article formats – invaluable to a time constrained community.
Substantial developments in our current knowledge and understanding of genomics and epigenetics are constantly being made, yet this field is still in its infancy. Epigenomics provides a critical overview of the latest and most significant advances as they unfold and explores their potential application in the clinical setting.