{"title":"Mitigation of multi-scale biases in cell-type deconvolution for spatially resolved transcriptomics using HarmoDecon.","authors":"Zirui Wang, Ke Xu, Yang Liu, Yu Xu, Lu Zhang","doi":"10.1093/bioinformatics/btaf451","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>The advent of spatially resolved transcriptomics (SRT) has revolutionized our understanding of tissue molecular microenvironments by enabling the study of gene expression in its spatial context. However, many SRT platforms lack single-cell resolution, necessitating cell-type deconvolution methods to estimate cell-type proportions in SRT spots. Despite advancements in existing tools, these methods have not addressed biases occurring at three scales: individual spots, entire tissue samples, and discrepancies between SRT and reference scRNA-seq datasets. These biases result in overbalanced cell-type proportions for each spot, mismatched cell-type fractions at the sample level, and data distribution shifts across platforms.</p><p><strong>Results: </strong>To mitigate these biases, we introduce HarmoDecon, a novel semi-supervised deep learning model for spatial cell-type deconvolution. HarmoDecon leverages pseudo-spots derived from scRNA-seq data and uses Gaussian Mixture Graph Convolutional Networks to address the aforementioned issues. Through extensive simulations on multi-cell spots from STARmap and osmFISH, HarmoDecon outperformed 11 state-of-the-art methods. Additionally, when applied to legacy SRT platforms and 10x Visium datasets, HarmoDecon achieved the highest accuracy in spatial domain clustering and maintained strong correlations between cancer marker genes and cancer cells in human breast cancer samples. These results highlight the utility of HarmoDecon in advancing spatial transcriptomics analysis.</p><p><strong>Availability and implementation: </strong>The HarmoDecon scripts, with the detailed tutorials, are available at https://github.com/ericcombiolab/HarmoDecon/tree/main.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":5.4000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12417081/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btaf451","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Motivation: The advent of spatially resolved transcriptomics (SRT) has revolutionized our understanding of tissue molecular microenvironments by enabling the study of gene expression in its spatial context. However, many SRT platforms lack single-cell resolution, necessitating cell-type deconvolution methods to estimate cell-type proportions in SRT spots. Despite advancements in existing tools, these methods have not addressed biases occurring at three scales: individual spots, entire tissue samples, and discrepancies between SRT and reference scRNA-seq datasets. These biases result in overbalanced cell-type proportions for each spot, mismatched cell-type fractions at the sample level, and data distribution shifts across platforms.
Results: To mitigate these biases, we introduce HarmoDecon, a novel semi-supervised deep learning model for spatial cell-type deconvolution. HarmoDecon leverages pseudo-spots derived from scRNA-seq data and uses Gaussian Mixture Graph Convolutional Networks to address the aforementioned issues. Through extensive simulations on multi-cell spots from STARmap and osmFISH, HarmoDecon outperformed 11 state-of-the-art methods. Additionally, when applied to legacy SRT platforms and 10x Visium datasets, HarmoDecon achieved the highest accuracy in spatial domain clustering and maintained strong correlations between cancer marker genes and cancer cells in human breast cancer samples. These results highlight the utility of HarmoDecon in advancing spatial transcriptomics analysis.
Availability and implementation: The HarmoDecon scripts, with the detailed tutorials, are available at https://github.com/ericcombiolab/HarmoDecon/tree/main.