Yan Zhou, Yaohua Hu, Liuting Tan, Jiadi Zhu, Yutong Fei, Ming Gu, Dechao Tian
{"title":"pb - diffhc:用于从高分辨率伪大体积Hi-C数据中检测差异染色质相互作用的统计框架。","authors":"Yan Zhou, Yaohua Hu, Liuting Tan, Jiadi Zhu, Yutong Fei, Ming Gu, Dechao Tian","doi":"10.1186/s12864-025-11987-y","DOIUrl":null,"url":null,"abstract":"<p><p>Single-cell Hi-C (scHi-C) data provide unprecedented opportunities for analyzing differential chromatin interactions, essential for understanding genome structure-function relationships across various biological conditions. However, biologically meaningful differential chromatin interaction analysis at high resolution (e.g., 10 Kb) remains challenging due to the inherent sparsity of scHi-C data. Existing approaches typically rely on single cell imputation, which is computationally intensive and lacks validation, or apply conventional bulk Hi-C tools to pseudo-bulk matrices aggregated from individual cells. The sparsity of high-resolution pseudo-bulk data limits the effectiveness of bulk-oriented methods. Here, we present PB-DiffHiC, an optimized parametric statistical framework that directly analyzes raw pseudo-bulk Hi-C data at 10 Kb resolution between conditions. PB-DiffHiC incorporates Gaussian convolution, the stability of short-range interactions, and Poisson modeling to jointly perform normalization and statistical testing. Benchmarking on cell-type-specific chromatin loops shows that PB-DiffHiC achieves higher precision than alternative methods. Application to pseudo-bulk and matched bulk Hi-C data demonstrates stronger concordance in identified differential interactions, reinforcing its reliability. In a case study, PB-DiffHiC successfully identifies Kcnq5-associated differential interactions that closely matching SnapHiC-D results, despite not relying on single-cell imputation. PB-DiffHiC is a statistically sound and robust method for high-resolution differential analysis of chromatin interactions using raw pseudo-bulk Hi-C data. The source code of PB-DiffHiC is publicly available at https://github.com/Tian-Dechao/PB-DiffHiC .</p>","PeriodicalId":9030,"journal":{"name":"BMC Genomics","volume":"26 1","pages":"900"},"PeriodicalIF":3.7000,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12512566/pdf/","citationCount":"0","resultStr":"{\"title\":\"PB-DiffHiC: a statistical framework for detecting differential chromatin interactions from high resolution pseudo-bulk Hi-C data.\",\"authors\":\"Yan Zhou, Yaohua Hu, Liuting Tan, Jiadi Zhu, Yutong Fei, Ming Gu, Dechao Tian\",\"doi\":\"10.1186/s12864-025-11987-y\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Single-cell Hi-C (scHi-C) data provide unprecedented opportunities for analyzing differential chromatin interactions, essential for understanding genome structure-function relationships across various biological conditions. However, biologically meaningful differential chromatin interaction analysis at high resolution (e.g., 10 Kb) remains challenging due to the inherent sparsity of scHi-C data. Existing approaches typically rely on single cell imputation, which is computationally intensive and lacks validation, or apply conventional bulk Hi-C tools to pseudo-bulk matrices aggregated from individual cells. The sparsity of high-resolution pseudo-bulk data limits the effectiveness of bulk-oriented methods. Here, we present PB-DiffHiC, an optimized parametric statistical framework that directly analyzes raw pseudo-bulk Hi-C data at 10 Kb resolution between conditions. PB-DiffHiC incorporates Gaussian convolution, the stability of short-range interactions, and Poisson modeling to jointly perform normalization and statistical testing. Benchmarking on cell-type-specific chromatin loops shows that PB-DiffHiC achieves higher precision than alternative methods. Application to pseudo-bulk and matched bulk Hi-C data demonstrates stronger concordance in identified differential interactions, reinforcing its reliability. In a case study, PB-DiffHiC successfully identifies Kcnq5-associated differential interactions that closely matching SnapHiC-D results, despite not relying on single-cell imputation. PB-DiffHiC is a statistically sound and robust method for high-resolution differential analysis of chromatin interactions using raw pseudo-bulk Hi-C data. The source code of PB-DiffHiC is publicly available at https://github.com/Tian-Dechao/PB-DiffHiC .</p>\",\"PeriodicalId\":9030,\"journal\":{\"name\":\"BMC Genomics\",\"volume\":\"26 1\",\"pages\":\"900\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2025-10-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12512566/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Genomics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1186/s12864-025-11987-y\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BIOTECHNOLOGY & APPLIED MICROBIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12864-025-11987-y","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
PB-DiffHiC: a statistical framework for detecting differential chromatin interactions from high resolution pseudo-bulk Hi-C data.
Single-cell Hi-C (scHi-C) data provide unprecedented opportunities for analyzing differential chromatin interactions, essential for understanding genome structure-function relationships across various biological conditions. However, biologically meaningful differential chromatin interaction analysis at high resolution (e.g., 10 Kb) remains challenging due to the inherent sparsity of scHi-C data. Existing approaches typically rely on single cell imputation, which is computationally intensive and lacks validation, or apply conventional bulk Hi-C tools to pseudo-bulk matrices aggregated from individual cells. The sparsity of high-resolution pseudo-bulk data limits the effectiveness of bulk-oriented methods. Here, we present PB-DiffHiC, an optimized parametric statistical framework that directly analyzes raw pseudo-bulk Hi-C data at 10 Kb resolution between conditions. PB-DiffHiC incorporates Gaussian convolution, the stability of short-range interactions, and Poisson modeling to jointly perform normalization and statistical testing. Benchmarking on cell-type-specific chromatin loops shows that PB-DiffHiC achieves higher precision than alternative methods. Application to pseudo-bulk and matched bulk Hi-C data demonstrates stronger concordance in identified differential interactions, reinforcing its reliability. In a case study, PB-DiffHiC successfully identifies Kcnq5-associated differential interactions that closely matching SnapHiC-D results, despite not relying on single-cell imputation. PB-DiffHiC is a statistically sound and robust method for high-resolution differential analysis of chromatin interactions using raw pseudo-bulk Hi-C data. The source code of PB-DiffHiC is publicly available at https://github.com/Tian-Dechao/PB-DiffHiC .
期刊介绍:
BMC Genomics is an open access, peer-reviewed journal that considers articles on all aspects of genome-scale analysis, functional genomics, and proteomics.
BMC Genomics is part of the BMC series which publishes subject-specific journals focused on the needs of individual research communities across all areas of biology and medicine. We offer an efficient, fair and friendly peer review service, and are committed to publishing all sound science, provided that there is some advance in knowledge presented by the work.