{"title":"Benchmarking multi-slice integration and downstream applications in spatial transcriptomics data analysis","authors":"Kejing Dong, Yicheng Gao, Qi Zou, Yan Cui, Chuangyi Han, Senlin Lin, Zhikang Wang, Chen Tang, Xiaojie Cheng, Fangliangzi Meng, Xiaohan Chen, Shuguang Wang, Xuan Jin, Jingya Yang, Chen Zhang, Guohui Chuai, Zhiyuan Yuan, Qi Liu","doi":"10.1186/s13059-025-03796-z","DOIUrl":null,"url":null,"abstract":"Spatial transcriptomics preserves spatial context of tissues while capturing gene expression. As the technology advances, researchers are increasingly generating data from multiple tissue sections, creating a growing demand for multi-slice integration methods. These methods aim to generate spatially aware embeddings that jointly capture spatial and transcriptomic information, preserving biological signals while mitigating technical artifacts such as batch effects. However, the reliability of these methods varies, and the growing diversity of technologies makes integration even more challenging. This underscores the need for a comprehensive benchmark to evaluate their performance, which is still lacking. To systematically evaluate the performance of multi-slice integration methods, we propose a comprehensive benchmarking framework covering four key tasks that form an upstream-to-downstream pipeline: multi-slice integration, spatial clustering, spatial alignment, slice representation. For each task, we perform detailed analyses of the methods and provide actionable recommendations. Our results reveal substantial data-dependent variation in performance across tasks. We further investigate the relationships between upstream and downstream tasks, showing that downstream performance often depends on upstream quality. Our study provides a comprehensive benchmark of 12 multi-slice integration methods across four key tasks using 19 diverse datasets. Our results reveal that method performance is highly dependent on application context, dataset size, and technology. We also identified strong interdependencies between upstream and downstream tasks, highlighting the importance of robust early-stage analysis.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"29 1","pages":""},"PeriodicalIF":10.1000,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genome Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s13059-025-03796-z","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Spatial transcriptomics preserves spatial context of tissues while capturing gene expression. As the technology advances, researchers are increasingly generating data from multiple tissue sections, creating a growing demand for multi-slice integration methods. These methods aim to generate spatially aware embeddings that jointly capture spatial and transcriptomic information, preserving biological signals while mitigating technical artifacts such as batch effects. However, the reliability of these methods varies, and the growing diversity of technologies makes integration even more challenging. This underscores the need for a comprehensive benchmark to evaluate their performance, which is still lacking. To systematically evaluate the performance of multi-slice integration methods, we propose a comprehensive benchmarking framework covering four key tasks that form an upstream-to-downstream pipeline: multi-slice integration, spatial clustering, spatial alignment, slice representation. For each task, we perform detailed analyses of the methods and provide actionable recommendations. Our results reveal substantial data-dependent variation in performance across tasks. We further investigate the relationships between upstream and downstream tasks, showing that downstream performance often depends on upstream quality. Our study provides a comprehensive benchmark of 12 multi-slice integration methods across four key tasks using 19 diverse datasets. Our results reveal that method performance is highly dependent on application context, dataset size, and technology. We also identified strong interdependencies between upstream and downstream tasks, highlighting the importance of robust early-stage analysis.
Genome BiologyBiochemistry, Genetics and Molecular Biology-Genetics
CiteScore
21.00
自引率
3.30%
发文量
241
审稿时长
2 months
期刊介绍:
Genome Biology stands as a premier platform for exceptional research across all domains of biology and biomedicine, explored through a genomic and post-genomic lens.
With an impressive impact factor of 12.3 (2022),* the journal secures its position as the 3rd-ranked research journal in the Genetics and Heredity category and the 2nd-ranked research journal in the Biotechnology and Applied Microbiology category by Thomson Reuters. Notably, Genome Biology holds the distinction of being the highest-ranked open-access journal in this category.
Our dedicated team of highly trained in-house Editors collaborates closely with our esteemed Editorial Board of international experts, ensuring the journal remains on the forefront of scientific advances and community standards. Regular engagement with researchers at conferences and institute visits underscores our commitment to staying abreast of the latest developments in the field.