Vitoria Guardieiro, Felipe Inagaki de Oliveira, Harish Doraiswamy, Luis Gustavo Nonato, Claudio Silva
{"title":"TopoMap++:更快、更节省空间的拓扑保证投影计算技术","authors":"Vitoria Guardieiro, Felipe Inagaki de Oliveira, Harish Doraiswamy, Luis Gustavo Nonato, Claudio Silva","doi":"arxiv-2409.07257","DOIUrl":null,"url":null,"abstract":"High-dimensional data, characterized by many features, can be difficult to\nvisualize effectively. Dimensionality reduction techniques, such as PCA, UMAP,\nand t-SNE, address this challenge by projecting the data into a\nlower-dimensional space while preserving important relationships. TopoMap is\nanother technique that excels at preserving the underlying structure of the\ndata, leading to interpretable visualizations. In particular, TopoMap maps the\nhigh-dimensional data into a visual space, guaranteeing that the 0-dimensional\npersistence diagram of the Rips filtration of the visual space matches the one\nfrom the high-dimensional data. However, the original TopoMap algorithm can be\nslow and its layout can be too sparse for large and complex datasets. In this\npaper, we propose three improvements to TopoMap: 1) a more space-efficient\nlayout, 2) a significantly faster implementation, and 3) a novel TreeMap-based\nrepresentation that makes use of the topological hierarchy to aid the\nexploration of the projections. These advancements make TopoMap, now referred\nto as TopoMap++, a more powerful tool for visualizing high-dimensional data\nwhich we demonstrate through different use case scenarios.","PeriodicalId":501541,"journal":{"name":"arXiv - CS - Human-Computer Interaction","volume":"43 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"TopoMap++: A faster and more space efficient technique to compute projections with topological guarantees\",\"authors\":\"Vitoria Guardieiro, Felipe Inagaki de Oliveira, Harish Doraiswamy, Luis Gustavo Nonato, Claudio Silva\",\"doi\":\"arxiv-2409.07257\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"High-dimensional data, characterized by many features, can be difficult to\\nvisualize effectively. Dimensionality reduction techniques, such as PCA, UMAP,\\nand t-SNE, address this challenge by projecting the data into a\\nlower-dimensional space while preserving important relationships. TopoMap is\\nanother technique that excels at preserving the underlying structure of the\\ndata, leading to interpretable visualizations. In particular, TopoMap maps the\\nhigh-dimensional data into a visual space, guaranteeing that the 0-dimensional\\npersistence diagram of the Rips filtration of the visual space matches the one\\nfrom the high-dimensional data. However, the original TopoMap algorithm can be\\nslow and its layout can be too sparse for large and complex datasets. In this\\npaper, we propose three improvements to TopoMap: 1) a more space-efficient\\nlayout, 2) a significantly faster implementation, and 3) a novel TreeMap-based\\nrepresentation that makes use of the topological hierarchy to aid the\\nexploration of the projections. These advancements make TopoMap, now referred\\nto as TopoMap++, a more powerful tool for visualizing high-dimensional data\\nwhich we demonstrate through different use case scenarios.\",\"PeriodicalId\":501541,\"journal\":{\"name\":\"arXiv - CS - Human-Computer Interaction\",\"volume\":\"43 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Human-Computer Interaction\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.07257\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Human-Computer Interaction","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.07257","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
TopoMap++: A faster and more space efficient technique to compute projections with topological guarantees
High-dimensional data, characterized by many features, can be difficult to
visualize effectively. Dimensionality reduction techniques, such as PCA, UMAP,
and t-SNE, address this challenge by projecting the data into a
lower-dimensional space while preserving important relationships. TopoMap is
another technique that excels at preserving the underlying structure of the
data, leading to interpretable visualizations. In particular, TopoMap maps the
high-dimensional data into a visual space, guaranteeing that the 0-dimensional
persistence diagram of the Rips filtration of the visual space matches the one
from the high-dimensional data. However, the original TopoMap algorithm can be
slow and its layout can be too sparse for large and complex datasets. In this
paper, we propose three improvements to TopoMap: 1) a more space-efficient
layout, 2) a significantly faster implementation, and 3) a novel TreeMap-based
representation that makes use of the topological hierarchy to aid the
exploration of the projections. These advancements make TopoMap, now referred
to as TopoMap++, a more powerful tool for visualizing high-dimensional data
which we demonstrate through different use case scenarios.