Lukas Silvester Barth, Hannaneh Fahimi, Parvaneh Joharinad, Jürgen Jost, Janis Keck, Thomas Jan Mikhail
{"title":"Fuzzy Simplicial Sets and Their Application to Geometric Data Analysis","authors":"Lukas Silvester Barth, Hannaneh Fahimi, Parvaneh Joharinad, Jürgen Jost, Janis Keck, Thomas Jan Mikhail","doi":"10.1007/s10485-025-09827-x","DOIUrl":null,"url":null,"abstract":"<div><p>In this article, we expand upon the concepts introduced in Spivak (Metric realization of fuzzy simplicial sets, 2009. http://www.dspivak.net/metric_realization090922.pdf) about the relationship between the category <span>\\(\\textbf{UM}\\)</span> of uber metric spaces and the category <span>\\(\\textbf{sFuz}\\)</span> of fuzzy simplicial sets. We show that fuzzy simplicial sets can be regarded as natural combinatorial generalizations of metric relations. Furthermore, we take inspiration from UMAP (McInnes et al, in: Umap: Uniform manifold approximation and projection for dimension reduction, 2018) to apply the theory to manifold learning, dimension reduction and data visualization, while refining some of their constructions to put the corresponding theory on a more solid footing. A generalization of the adjunction between <span>\\(\\textbf{UM}\\)</span> and <span>\\(\\textbf{sFuz}\\)</span> allows us to view the adjunctions used in both publications as special cases. Moreover, we derive an explicit description of colimits in <span>\\(\\textbf{UM}\\)</span> and the realization functor <span>\\(\\text {Re}:\\textbf{sFuz}\\rightarrow \\textbf{UM}\\)</span>, and show that <span>\\(\\textbf{UM}\\)</span> can be embedded into <span>\\(\\textbf{sFuz}\\)</span>. Furthermore, we prove analogous results for the category of extended-pseudo metric spaces <span>\\(\\textbf{EPMet}\\)</span>. We also provide rigorous definitions of functors that make it possible to recursively merge sets of fuzzy simplicial sets and provide a description of the adjunctions between the category of truncated fuzzy simplicial sets and <span>\\(\\textbf{sFuz}\\)</span>, which we relate to persistent homology. Combining those constructions, we can show a surprising connection between the well-known dimension reduction methods UMAP and Isomap (Tenenbaum et al. in Science 290(5500):2319–2323, 2000) and derive an alternative algorithm, which we call IsUMap, that combines some of the strengths of both methods. Additionally, we developed a new embedding method that allows to preserve clusters detected in the original metric space that we construct from the data. The visualization of the optimization process gives the user information, both about the inner-cluster distributions in the original metric space and their inter-cluster relations. We compare our new method with UMAP, Isomap and t-SNE on a series of low- and high-dimensional datasets and provide explanations for observed differences and improvements.</p></div>","PeriodicalId":7952,"journal":{"name":"Applied Categorical Structures","volume":"33 5","pages":""},"PeriodicalIF":0.5000,"publicationDate":"2025-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10485-025-09827-x.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Categorical Structures","FirstCategoryId":"100","ListUrlMain":"https://link.springer.com/article/10.1007/s10485-025-09827-x","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATHEMATICS","Score":null,"Total":0}
引用次数: 0
Abstract
In this article, we expand upon the concepts introduced in Spivak (Metric realization of fuzzy simplicial sets, 2009. http://www.dspivak.net/metric_realization090922.pdf) about the relationship between the category \(\textbf{UM}\) of uber metric spaces and the category \(\textbf{sFuz}\) of fuzzy simplicial sets. We show that fuzzy simplicial sets can be regarded as natural combinatorial generalizations of metric relations. Furthermore, we take inspiration from UMAP (McInnes et al, in: Umap: Uniform manifold approximation and projection for dimension reduction, 2018) to apply the theory to manifold learning, dimension reduction and data visualization, while refining some of their constructions to put the corresponding theory on a more solid footing. A generalization of the adjunction between \(\textbf{UM}\) and \(\textbf{sFuz}\) allows us to view the adjunctions used in both publications as special cases. Moreover, we derive an explicit description of colimits in \(\textbf{UM}\) and the realization functor \(\text {Re}:\textbf{sFuz}\rightarrow \textbf{UM}\), and show that \(\textbf{UM}\) can be embedded into \(\textbf{sFuz}\). Furthermore, we prove analogous results for the category of extended-pseudo metric spaces \(\textbf{EPMet}\). We also provide rigorous definitions of functors that make it possible to recursively merge sets of fuzzy simplicial sets and provide a description of the adjunctions between the category of truncated fuzzy simplicial sets and \(\textbf{sFuz}\), which we relate to persistent homology. Combining those constructions, we can show a surprising connection between the well-known dimension reduction methods UMAP and Isomap (Tenenbaum et al. in Science 290(5500):2319–2323, 2000) and derive an alternative algorithm, which we call IsUMap, that combines some of the strengths of both methods. Additionally, we developed a new embedding method that allows to preserve clusters detected in the original metric space that we construct from the data. The visualization of the optimization process gives the user information, both about the inner-cluster distributions in the original metric space and their inter-cluster relations. We compare our new method with UMAP, Isomap and t-SNE on a series of low- and high-dimensional datasets and provide explanations for observed differences and improvements.
期刊介绍:
Applied Categorical Structures focuses on applications of results, techniques and ideas from category theory to mathematics, physics and computer science. These include the study of topological and algebraic categories, representation theory, algebraic geometry, homological and homotopical algebra, derived and triangulated categories, categorification of (geometric) invariants, categorical investigations in mathematical physics, higher category theory and applications, categorical investigations in functional analysis, in continuous order theory and in theoretical computer science. In addition, the journal also follows the development of emerging fields in which the application of categorical methods proves to be relevant.
Applied Categorical Structures publishes both carefully refereed research papers and survey papers. It promotes communication and increases the dissemination of new results and ideas among mathematicians and computer scientists who use categorical methods in their research.