Axel Andersson, Andrea Behanova, Christophe Avenel, Jonas Windhager, Filip Malmberg, Carolina Wählby
{"title":"Points2Regions: Fast, interactive clustering of imaging-based spatial transcriptomics data","authors":"Axel Andersson, Andrea Behanova, Christophe Avenel, Jonas Windhager, Filip Malmberg, Carolina Wählby","doi":"10.1002/cyto.a.24884","DOIUrl":null,"url":null,"abstract":"<p>Imaging-based spatial transcriptomics techniques generate data in the form of spatial points belonging to different mRNA classes. A crucial part of analyzing the data involves the identification of regions with similar composition of mRNA classes. These biologically interesting regions can manifest at different spatial scales. For example, the composition of mRNA classes on a cellular scale corresponds to cell types, whereas compositions on a millimeter scale correspond to tissue-level structures. Traditional techniques for identifying such regions often rely on complementary data, such as pre-segmented cells, or lengthy optimization. This limits their applicability to tasks on a particular scale, restricting their capabilities in exploratory analysis. This article introduces “Points2Regions,” a computational tool for identifying regions with similar mRNA compositions. The tool's novelty lies in its rapid feature extraction by rasterizing points (representing mRNAs) onto a pyramidal grid and its efficient clustering using a combination of hierarchical and <span></span><math>\n <mrow>\n <mi>k</mi>\n </mrow></math>-means clustering. This enables fast and efficient region discovery across multiple scales without relying on additional data, making it a valuable resource for exploratory analysis. Points2Regions has demonstrated performance similar to state-of-the-art methods on two simulated datasets, without relying on segmented cells, while being several times faster. Experiments on real-world datasets show that regions identified by Points2Regions are similar to those identified in other studies, confirming that Points2Regions can be used to extract biologically relevant regions. The tool is shared as a Python package integrated into TissUUmaps and a Napari plugin, offering interactive clustering and visualization, significantly enhancing user experience in data exploration.</p>","PeriodicalId":11068,"journal":{"name":"Cytometry Part A","volume":null,"pages":null},"PeriodicalIF":2.5000,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cyto.a.24884","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cytometry Part A","FirstCategoryId":"99","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cyto.a.24884","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Imaging-based spatial transcriptomics techniques generate data in the form of spatial points belonging to different mRNA classes. A crucial part of analyzing the data involves the identification of regions with similar composition of mRNA classes. These biologically interesting regions can manifest at different spatial scales. For example, the composition of mRNA classes on a cellular scale corresponds to cell types, whereas compositions on a millimeter scale correspond to tissue-level structures. Traditional techniques for identifying such regions often rely on complementary data, such as pre-segmented cells, or lengthy optimization. This limits their applicability to tasks on a particular scale, restricting their capabilities in exploratory analysis. This article introduces “Points2Regions,” a computational tool for identifying regions with similar mRNA compositions. The tool's novelty lies in its rapid feature extraction by rasterizing points (representing mRNAs) onto a pyramidal grid and its efficient clustering using a combination of hierarchical and -means clustering. This enables fast and efficient region discovery across multiple scales without relying on additional data, making it a valuable resource for exploratory analysis. Points2Regions has demonstrated performance similar to state-of-the-art methods on two simulated datasets, without relying on segmented cells, while being several times faster. Experiments on real-world datasets show that regions identified by Points2Regions are similar to those identified in other studies, confirming that Points2Regions can be used to extract biologically relevant regions. The tool is shared as a Python package integrated into TissUUmaps and a Napari plugin, offering interactive clustering and visualization, significantly enhancing user experience in data exploration.
期刊介绍:
Cytometry Part A, the journal of quantitative single-cell analysis, features original research reports and reviews of innovative scientific studies employing quantitative single-cell measurement, separation, manipulation, and modeling techniques, as well as original articles on mechanisms of molecular and cellular functions obtained by cytometry techniques.
The journal welcomes submissions from multiple research fields that fully embrace the study of the cytome:
Biomedical Instrumentation Engineering
Biophotonics
Bioinformatics
Cell Biology
Computational Biology
Data Science
Immunology
Parasitology
Microbiology
Neuroscience
Cancer
Stem Cells
Tissue Regeneration.