Xin Lu, Murong Zhou, Bo Gao, Fang Wang, Shuilin Jin, Qiaoming Liu, Guohua Wang
{"title":"stGRL:基于多任务图对比表示学习的空间转录组数据的空间域识别、去噪和归算算法。","authors":"Xin Lu, Murong Zhou, Bo Gao, Fang Wang, Shuilin Jin, Qiaoming Liu, Guohua Wang","doi":"10.1186/s12915-025-02290-z","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Spatial transcriptomics now enables sequencing while preserving the spatial location of cells. This significantly enhances researchers' understanding of cellular and tissue functions in their spatial context. However, due to current technical limitations, spatial transcriptomics data often exhibit high dropout rates and noise, posing challenges for downstream analysis, like spot clustering, differential gene analysis, and spatial domain identification. To address those challenges, we propose stGRL, a novel deep multi-task graph neural network model tailored for spatial transcriptomics. stGRL employs an encoder-decoder architecture with a zero-inflated negative binomial (ZINB) distribution to reconstruct input data while effectively addressing dropout events. Additionally, it integrates graph contrastive representation learning to enhance the consistency of node embeddings, thereby improving clustering performance.</p><p><strong>Results: </strong>Through benchmark experiments on various spatial transcriptomics datasets, stGRL demonstrated a superior ability to identify spatial features compared to current mainstream methods. In-depth analyses reveal that the denoised data generated by stGRL not only preserves the spatial hierarchy of tissues but also accurately identifies differentially expressed genes. When applied to breast cancer datasets, stGRL effectively analyzed the differences between cancerous regions and carcinoma in situ areas, uncovering that carcinoma in situ regions are predominantly regulated by the immune system, which limits cancer cell development through inflammatory responses. Additionally, in the spatial transcriptomics analysis of ovarian cancer, stGRL successfully annotated cell types, accurately identified B cell-enriched regions, and discovered a novel target gene, MZB1, with potential therapeutic value.</p><p><strong>Conclusions: </strong>stGRL is an effective method for integrating multiple tasks in spatial transcriptome analysis. Our study highlights its broad applicability and outstanding performance in analyzing spatial transcriptome data. This method offers a powerful analytical tool for uncovering the spatial heterogeneity of complex tissues and identifying potential therapeutic targets for disease.</p>","PeriodicalId":9339,"journal":{"name":"BMC Biology","volume":"23 1","pages":"177"},"PeriodicalIF":4.4000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12211972/pdf/","citationCount":"0","resultStr":"{\"title\":\"stGRL: spatial domain identification, denoising, and imputation algorithm for spatial transcriptome data based on multi-task graph contrastive representation learning.\",\"authors\":\"Xin Lu, Murong Zhou, Bo Gao, Fang Wang, Shuilin Jin, Qiaoming Liu, Guohua Wang\",\"doi\":\"10.1186/s12915-025-02290-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Spatial transcriptomics now enables sequencing while preserving the spatial location of cells. This significantly enhances researchers' understanding of cellular and tissue functions in their spatial context. However, due to current technical limitations, spatial transcriptomics data often exhibit high dropout rates and noise, posing challenges for downstream analysis, like spot clustering, differential gene analysis, and spatial domain identification. To address those challenges, we propose stGRL, a novel deep multi-task graph neural network model tailored for spatial transcriptomics. stGRL employs an encoder-decoder architecture with a zero-inflated negative binomial (ZINB) distribution to reconstruct input data while effectively addressing dropout events. Additionally, it integrates graph contrastive representation learning to enhance the consistency of node embeddings, thereby improving clustering performance.</p><p><strong>Results: </strong>Through benchmark experiments on various spatial transcriptomics datasets, stGRL demonstrated a superior ability to identify spatial features compared to current mainstream methods. In-depth analyses reveal that the denoised data generated by stGRL not only preserves the spatial hierarchy of tissues but also accurately identifies differentially expressed genes. When applied to breast cancer datasets, stGRL effectively analyzed the differences between cancerous regions and carcinoma in situ areas, uncovering that carcinoma in situ regions are predominantly regulated by the immune system, which limits cancer cell development through inflammatory responses. Additionally, in the spatial transcriptomics analysis of ovarian cancer, stGRL successfully annotated cell types, accurately identified B cell-enriched regions, and discovered a novel target gene, MZB1, with potential therapeutic value.</p><p><strong>Conclusions: </strong>stGRL is an effective method for integrating multiple tasks in spatial transcriptome analysis. Our study highlights its broad applicability and outstanding performance in analyzing spatial transcriptome data. This method offers a powerful analytical tool for uncovering the spatial heterogeneity of complex tissues and identifying potential therapeutic targets for disease.</p>\",\"PeriodicalId\":9339,\"journal\":{\"name\":\"BMC Biology\",\"volume\":\"23 1\",\"pages\":\"177\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2025-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12211972/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Biology\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1186/s12915-025-02290-z\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12915-025-02290-z","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
stGRL: spatial domain identification, denoising, and imputation algorithm for spatial transcriptome data based on multi-task graph contrastive representation learning.
Background: Spatial transcriptomics now enables sequencing while preserving the spatial location of cells. This significantly enhances researchers' understanding of cellular and tissue functions in their spatial context. However, due to current technical limitations, spatial transcriptomics data often exhibit high dropout rates and noise, posing challenges for downstream analysis, like spot clustering, differential gene analysis, and spatial domain identification. To address those challenges, we propose stGRL, a novel deep multi-task graph neural network model tailored for spatial transcriptomics. stGRL employs an encoder-decoder architecture with a zero-inflated negative binomial (ZINB) distribution to reconstruct input data while effectively addressing dropout events. Additionally, it integrates graph contrastive representation learning to enhance the consistency of node embeddings, thereby improving clustering performance.
Results: Through benchmark experiments on various spatial transcriptomics datasets, stGRL demonstrated a superior ability to identify spatial features compared to current mainstream methods. In-depth analyses reveal that the denoised data generated by stGRL not only preserves the spatial hierarchy of tissues but also accurately identifies differentially expressed genes. When applied to breast cancer datasets, stGRL effectively analyzed the differences between cancerous regions and carcinoma in situ areas, uncovering that carcinoma in situ regions are predominantly regulated by the immune system, which limits cancer cell development through inflammatory responses. Additionally, in the spatial transcriptomics analysis of ovarian cancer, stGRL successfully annotated cell types, accurately identified B cell-enriched regions, and discovered a novel target gene, MZB1, with potential therapeutic value.
Conclusions: stGRL is an effective method for integrating multiple tasks in spatial transcriptome analysis. Our study highlights its broad applicability and outstanding performance in analyzing spatial transcriptome data. This method offers a powerful analytical tool for uncovering the spatial heterogeneity of complex tissues and identifying potential therapeutic targets for disease.
期刊介绍:
BMC Biology is a broad scope journal covering all areas of biology. Our content includes research articles, new methods and tools. BMC Biology also publishes reviews, Q&A, and commentaries.