Alex C. Soupir , Julia Wrobel , Jordan H. Creed , Oscar E. Ospina , Christopher M. Wilson , Brandon J. Manley , Lauren C. Peres , Brooke L. Fridley
{"title":"scSpatialSIM:空间单细胞分子数据模拟器","authors":"Alex C. Soupir , Julia Wrobel , Jordan H. Creed , Oscar E. Ospina , Christopher M. Wilson , Brandon J. Manley , Lauren C. Peres , Brooke L. Fridley","doi":"10.1016/j.softx.2025.102223","DOIUrl":null,"url":null,"abstract":"<div><div>The increasing use of spatial molecular technologies such as multiplex immunofluorescence (mIF) and spatial transcriptomics (SRT) has driven the need for robust statistical methods to analyze the spatial architecture of tissues. However, a lack of consensus on “gold standard” approaches present challenges for benchmarking and comparison. To address this gap, we developed “scSpatialSIM”, an <em>R</em> package for simulating biologically realistic spatial single-cell molecular data. “scSpatialSIM” enables users to efficiently simulate single-cell spatial patterns without requiring reference datasets, incorporating features such as cell clustering, cell co-localization, tissue compartments, and tissue holes. Additionally, the package supports simulation of both categorical data (e.g., cell phenotypes) and continuous values (e.g., protein expression or gene expression), and integrates with other <em>R</em> packages for downstream spatial analyses. To demonstrate its utility, we applied “scSpatialSIM” to benchmark univariate point pattern summary functions, including Ripley’s K(<em>r</em>), nearest neighbor G(<em>r</em>), and pair correlation g(<em>r</em>), across simulated scenarios. The results showed that Ripley’s K(<em>r</em>) consistently detected clustering across multiple radii, outperforming other methods in sensitivity and robustness. While scSpatialSIM is limited to simulating cell clustering and co-localization rather than broader tissue-level sub-domains, it provides a flexible and scalable framework for generating diverse spatial data. The development of scSpatialSIM facilitates comparative evaluation of spatial statistics and enables researchers to explore hypothetical scenarios at scale, advancing the development of novel methods to characterize the spatial organization of tissues. By providing a platform for spatial simulation, scSpatialSIM supports innovation in spatial molecular research and fosters new insights into tissue architecture and cellular interactions.</div></div>","PeriodicalId":21905,"journal":{"name":"SoftwareX","volume":"31 ","pages":"Article 102223"},"PeriodicalIF":2.4000,"publicationDate":"2025-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"scSpatialSIM: a simulator of spatial single-cell molecular data\",\"authors\":\"Alex C. Soupir , Julia Wrobel , Jordan H. Creed , Oscar E. Ospina , Christopher M. Wilson , Brandon J. Manley , Lauren C. Peres , Brooke L. Fridley\",\"doi\":\"10.1016/j.softx.2025.102223\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The increasing use of spatial molecular technologies such as multiplex immunofluorescence (mIF) and spatial transcriptomics (SRT) has driven the need for robust statistical methods to analyze the spatial architecture of tissues. However, a lack of consensus on “gold standard” approaches present challenges for benchmarking and comparison. To address this gap, we developed “scSpatialSIM”, an <em>R</em> package for simulating biologically realistic spatial single-cell molecular data. “scSpatialSIM” enables users to efficiently simulate single-cell spatial patterns without requiring reference datasets, incorporating features such as cell clustering, cell co-localization, tissue compartments, and tissue holes. Additionally, the package supports simulation of both categorical data (e.g., cell phenotypes) and continuous values (e.g., protein expression or gene expression), and integrates with other <em>R</em> packages for downstream spatial analyses. To demonstrate its utility, we applied “scSpatialSIM” to benchmark univariate point pattern summary functions, including Ripley’s K(<em>r</em>), nearest neighbor G(<em>r</em>), and pair correlation g(<em>r</em>), across simulated scenarios. The results showed that Ripley’s K(<em>r</em>) consistently detected clustering across multiple radii, outperforming other methods in sensitivity and robustness. While scSpatialSIM is limited to simulating cell clustering and co-localization rather than broader tissue-level sub-domains, it provides a flexible and scalable framework for generating diverse spatial data. The development of scSpatialSIM facilitates comparative evaluation of spatial statistics and enables researchers to explore hypothetical scenarios at scale, advancing the development of novel methods to characterize the spatial organization of tissues. By providing a platform for spatial simulation, scSpatialSIM supports innovation in spatial molecular research and fosters new insights into tissue architecture and cellular interactions.</div></div>\",\"PeriodicalId\":21905,\"journal\":{\"name\":\"SoftwareX\",\"volume\":\"31 \",\"pages\":\"Article 102223\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2025-06-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"SoftwareX\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2352711025001906\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"SoftwareX","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352711025001906","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
scSpatialSIM: a simulator of spatial single-cell molecular data
The increasing use of spatial molecular technologies such as multiplex immunofluorescence (mIF) and spatial transcriptomics (SRT) has driven the need for robust statistical methods to analyze the spatial architecture of tissues. However, a lack of consensus on “gold standard” approaches present challenges for benchmarking and comparison. To address this gap, we developed “scSpatialSIM”, an R package for simulating biologically realistic spatial single-cell molecular data. “scSpatialSIM” enables users to efficiently simulate single-cell spatial patterns without requiring reference datasets, incorporating features such as cell clustering, cell co-localization, tissue compartments, and tissue holes. Additionally, the package supports simulation of both categorical data (e.g., cell phenotypes) and continuous values (e.g., protein expression or gene expression), and integrates with other R packages for downstream spatial analyses. To demonstrate its utility, we applied “scSpatialSIM” to benchmark univariate point pattern summary functions, including Ripley’s K(r), nearest neighbor G(r), and pair correlation g(r), across simulated scenarios. The results showed that Ripley’s K(r) consistently detected clustering across multiple radii, outperforming other methods in sensitivity and robustness. While scSpatialSIM is limited to simulating cell clustering and co-localization rather than broader tissue-level sub-domains, it provides a flexible and scalable framework for generating diverse spatial data. The development of scSpatialSIM facilitates comparative evaluation of spatial statistics and enables researchers to explore hypothetical scenarios at scale, advancing the development of novel methods to characterize the spatial organization of tissues. By providing a platform for spatial simulation, scSpatialSIM supports innovation in spatial molecular research and fosters new insights into tissue architecture and cellular interactions.
期刊介绍:
SoftwareX aims to acknowledge the impact of software on today''s research practice, and on new scientific discoveries in almost all research domains. SoftwareX also aims to stress the importance of the software developers who are, in part, responsible for this impact. To this end, SoftwareX aims to support publication of research software in such a way that: The software is given a stamp of scientific relevance, and provided with a peer-reviewed recognition of scientific impact; The software developers are given the credits they deserve; The software is citable, allowing traditional metrics of scientific excellence to apply; The academic career paths of software developers are supported rather than hindered; The software is publicly available for inspection, validation, and re-use. Above all, SoftwareX aims to inform researchers about software applications, tools and libraries with a (proven) potential to impact the process of scientific discovery in various domains. The journal is multidisciplinary and accepts submissions from within and across subject domains such as those represented within the broad thematic areas below: Mathematical and Physical Sciences; Environmental Sciences; Medical and Biological Sciences; Humanities, Arts and Social Sciences. Originating from these broad thematic areas, the journal also welcomes submissions of software that works in cross cutting thematic areas, such as citizen science, cybersecurity, digital economy, energy, global resource stewardship, health and wellbeing, etcetera. SoftwareX specifically aims to accept submissions representing domain-independent software that may impact more than one research domain.