{"title":"stMCDI:用于空间转录组学数据推算的屏蔽条件扩散模型与图神经网络","authors":"Xiaoyu Li, Wenwen Min, Shunfang Wang, Changmiao Wang, Taosheng Xu","doi":"arxiv-2403.10863","DOIUrl":null,"url":null,"abstract":"Spatially resolved transcriptomics represents a significant advancement in\nsingle-cell analysis by offering both gene expression data and their\ncorresponding physical locations. However, this high degree of spatial\nresolution entails a drawback, as the resulting spatial transcriptomic data at\nthe cellular level is notably plagued by a high incidence of missing values.\nFurthermore, most existing imputation methods either overlook the spatial\ninformation between spots or compromise the overall gene expression data\ndistribution. To address these challenges, our primary focus is on effectively\nutilizing the spatial location information within spatial transcriptomic data\nto impute missing values, while preserving the overall data distribution. We\nintroduce \\textbf{stMCDI}, a novel conditional diffusion model for spatial\ntranscriptomics data imputation, which employs a denoising network trained\nusing randomly masked data portions as guidance, with the unmasked data serving\nas conditions. Additionally, it utilizes a GNN encoder to integrate the spatial\nposition information, thereby enhancing model performance. The results obtained\nfrom spatial transcriptomics datasets elucidate the performance of our methods\nrelative to existing approaches.","PeriodicalId":501070,"journal":{"name":"arXiv - QuanBio - Genomics","volume":"120 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"stMCDI: Masked Conditional Diffusion Model with Graph Neural Network for Spatial Transcriptomics Data Imputation\",\"authors\":\"Xiaoyu Li, Wenwen Min, Shunfang Wang, Changmiao Wang, Taosheng Xu\",\"doi\":\"arxiv-2403.10863\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Spatially resolved transcriptomics represents a significant advancement in\\nsingle-cell analysis by offering both gene expression data and their\\ncorresponding physical locations. However, this high degree of spatial\\nresolution entails a drawback, as the resulting spatial transcriptomic data at\\nthe cellular level is notably plagued by a high incidence of missing values.\\nFurthermore, most existing imputation methods either overlook the spatial\\ninformation between spots or compromise the overall gene expression data\\ndistribution. To address these challenges, our primary focus is on effectively\\nutilizing the spatial location information within spatial transcriptomic data\\nto impute missing values, while preserving the overall data distribution. We\\nintroduce \\\\textbf{stMCDI}, a novel conditional diffusion model for spatial\\ntranscriptomics data imputation, which employs a denoising network trained\\nusing randomly masked data portions as guidance, with the unmasked data serving\\nas conditions. Additionally, it utilizes a GNN encoder to integrate the spatial\\nposition information, thereby enhancing model performance. The results obtained\\nfrom spatial transcriptomics datasets elucidate the performance of our methods\\nrelative to existing approaches.\",\"PeriodicalId\":501070,\"journal\":{\"name\":\"arXiv - QuanBio - Genomics\",\"volume\":\"120 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-03-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - QuanBio - Genomics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2403.10863\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Genomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2403.10863","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
stMCDI: Masked Conditional Diffusion Model with Graph Neural Network for Spatial Transcriptomics Data Imputation
Spatially resolved transcriptomics represents a significant advancement in
single-cell analysis by offering both gene expression data and their
corresponding physical locations. However, this high degree of spatial
resolution entails a drawback, as the resulting spatial transcriptomic data at
the cellular level is notably plagued by a high incidence of missing values.
Furthermore, most existing imputation methods either overlook the spatial
information between spots or compromise the overall gene expression data
distribution. To address these challenges, our primary focus is on effectively
utilizing the spatial location information within spatial transcriptomic data
to impute missing values, while preserving the overall data distribution. We
introduce \textbf{stMCDI}, a novel conditional diffusion model for spatial
transcriptomics data imputation, which employs a denoising network trained
using randomly masked data portions as guidance, with the unmasked data serving
as conditions. Additionally, it utilizes a GNN encoder to integrate the spatial
position information, thereby enhancing model performance. The results obtained
from spatial transcriptomics datasets elucidate the performance of our methods
relative to existing approaches.