{"title":"Flexible integration of spatial and expression information for precise spot embedding via ZINB-based graph-enhanced autoencoder.","authors":"Jiacheng Leng, Jiating Yu, Ling-Yun Wu, Hongyang Chen","doi":"10.1038/s42003-025-07965-5","DOIUrl":null,"url":null,"abstract":"<p><p>Domain identification is a critical problem in spatially resolved transcriptomics data analysis, which aims to identify distinct spatial domains within a tissue that maintain both spatial continuity and expression consistency. The degree of coupling between expression data and spatial information in different datasets often varies significantly. Some regions have intact and clear boundaries, while others exhibit blurred boundaries with high intra-domain expression similarity. However, most domain identification methods do not adequately integrate expression and spatial information to flexibly identify different types of domains. To address these issues, we introduce Spot2vector, a computational framework that leverages a graph-enhanced autoencoder integrating zero-inflated negative binomial distribution modeling, combining both graph convolutional networks and graph attention networks to extract the latent embeddings of spots. Spot2vector encodes and integrates spatial and expression information, enabling effective identification of domains with diverse spatial patterns across spatially resolved transcriptomics data generated by different platforms. The decoders enable us to decipher the distribution and generation mechanisms of data while improving expression quality through denoising. Extensive validation and analyses demonstrate that Spot2vector excels in enhancing domain identification accuracy, effectively reducing data dimensionality, improving expression recovery and denoising, and precisely capturing spatial gene expression patterns.</p>","PeriodicalId":10552,"journal":{"name":"Communications Biology","volume":"8 1","pages":"556"},"PeriodicalIF":5.2000,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11971412/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1038/s42003-025-07965-5","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Domain identification is a critical problem in spatially resolved transcriptomics data analysis, which aims to identify distinct spatial domains within a tissue that maintain both spatial continuity and expression consistency. The degree of coupling between expression data and spatial information in different datasets often varies significantly. Some regions have intact and clear boundaries, while others exhibit blurred boundaries with high intra-domain expression similarity. However, most domain identification methods do not adequately integrate expression and spatial information to flexibly identify different types of domains. To address these issues, we introduce Spot2vector, a computational framework that leverages a graph-enhanced autoencoder integrating zero-inflated negative binomial distribution modeling, combining both graph convolutional networks and graph attention networks to extract the latent embeddings of spots. Spot2vector encodes and integrates spatial and expression information, enabling effective identification of domains with diverse spatial patterns across spatially resolved transcriptomics data generated by different platforms. The decoders enable us to decipher the distribution and generation mechanisms of data while improving expression quality through denoising. Extensive validation and analyses demonstrate that Spot2vector excels in enhancing domain identification accuracy, effectively reducing data dimensionality, improving expression recovery and denoising, and precisely capturing spatial gene expression patterns.
期刊介绍:
Communications Biology is an open access journal from Nature Research publishing high-quality research, reviews and commentary in all areas of the biological sciences. Research papers published by the journal represent significant advances bringing new biological insight to a specialized area of research.