Jiang Xingzuo, Wang Chenyuan, Yao Jiaxi, Wang Chengyuan
{"title":"Structure-guided integrative soft deep clustering analysis of scRNA-seq and scATAC-seq data.","authors":"Jiang Xingzuo, Wang Chenyuan, Yao Jiaxi, Wang Chengyuan","doi":"10.3389/fmicb.2025.1678891","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Current single-cell clustering methods often rely on hard clustering assignments, which fail to capture the dynamic and transitional states of cells during development. This study introduces the Structure-Guided Soft Deep Clustering (sgSDC) framework to address this limitation by integrating multimodal data and enabling probabilistic cluster assignments.</p><p><strong>Methods: </strong>The sgSDC model combines scRNA-seq and scATAC-seq data using a structure-guided fusion module with global attention. It employs contrastive learning to align modality-specific representations with a consensus representation and introduces a novel soft clustering loss that allows cells to belong to multiple clusters with varying probabilities.</p><p><strong>Results: </strong>Evaluations on four benchmark datasets demonstrate that sgSDC outperforms eight state-of-the-art methods in Accuracy (ACC), Normalized Mutual Information (NMI), and Adjusted Rand Index (ARI), achieving significant improvements-up to 52.62% in ARI on one dataset.</p><p><strong>Discussion: </strong>The results validate the effectiveness of structure-guided contrastive learning and soft clustering in capturing cellular heterogeneity. sgSDC provides a robust tool for analyzing complex single-cell data, with potential applications in developmental biology and tumor microenvironment research.</p>","PeriodicalId":12466,"journal":{"name":"Frontiers in Microbiology","volume":"16 ","pages":"1678891"},"PeriodicalIF":4.0000,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12484220/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Microbiology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.3389/fmicb.2025.1678891","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: Current single-cell clustering methods often rely on hard clustering assignments, which fail to capture the dynamic and transitional states of cells during development. This study introduces the Structure-Guided Soft Deep Clustering (sgSDC) framework to address this limitation by integrating multimodal data and enabling probabilistic cluster assignments.
Methods: The sgSDC model combines scRNA-seq and scATAC-seq data using a structure-guided fusion module with global attention. It employs contrastive learning to align modality-specific representations with a consensus representation and introduces a novel soft clustering loss that allows cells to belong to multiple clusters with varying probabilities.
Results: Evaluations on four benchmark datasets demonstrate that sgSDC outperforms eight state-of-the-art methods in Accuracy (ACC), Normalized Mutual Information (NMI), and Adjusted Rand Index (ARI), achieving significant improvements-up to 52.62% in ARI on one dataset.
Discussion: The results validate the effectiveness of structure-guided contrastive learning and soft clustering in capturing cellular heterogeneity. sgSDC provides a robust tool for analyzing complex single-cell data, with potential applications in developmental biology and tumor microenvironment research.
期刊介绍:
Frontiers in Microbiology is a leading journal in its field, publishing rigorously peer-reviewed research across the entire spectrum of microbiology. Field Chief Editor Martin G. Klotz at Washington State University is supported by an outstanding Editorial Board of international researchers. This multidisciplinary open-access journal is at the forefront of disseminating and communicating scientific knowledge and impactful discoveries to researchers, academics, clinicians and the public worldwide.