The Shape of Things: Topological Data Analysis

N. Lazar, Hyunnam Ryu
{"title":"The Shape of Things: Topological Data Analysis","authors":"N. Lazar, Hyunnam Ryu","doi":"10.1080/09332480.2021.1915036","DOIUrl":null,"url":null,"abstract":"An interesting feature of much modern Big Data is that the data we collect, or the data we want to analyze, are not necessarily in the traditional matrix or array form familiar from our textbooks. They may be coerced to such a format for relative ease of analysis, but this is not a strong justification. Past columns have explored new methods that exploit the natural structure of such data sets more directly. Topological data analysis (TDA) is one such method. Much daunting mathematics lies behind the methods of TDA, but it is possible to gain an idea and understanding of the approach and its potential usefulness even without a deep dive into the intricacies of topology, homology classes, and the like. In fact, the basic idea is quite simple: to study data through their low-dimension topological features, which translate into connected components (dimension 0), loops (dimension 1), and voids (dimension 2). Higher dimensions do exist, but often do not contain much useful information. For threedimensional data, up to the second dimension topological features can be considered at most. A good analogy to make the meaning of these features concrete is a piece of Swiss cheese. The piece of cheese itself is one connected component. The holes that are apparent on the The Shape of Things: Topological Data Analysis","PeriodicalId":88226,"journal":{"name":"Chance (New York, N.Y.)","volume":"3 1","pages":"59 - 64"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chance (New York, N.Y.)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/09332480.2021.1915036","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

An interesting feature of much modern Big Data is that the data we collect, or the data we want to analyze, are not necessarily in the traditional matrix or array form familiar from our textbooks. They may be coerced to such a format for relative ease of analysis, but this is not a strong justification. Past columns have explored new methods that exploit the natural structure of such data sets more directly. Topological data analysis (TDA) is one such method. Much daunting mathematics lies behind the methods of TDA, but it is possible to gain an idea and understanding of the approach and its potential usefulness even without a deep dive into the intricacies of topology, homology classes, and the like. In fact, the basic idea is quite simple: to study data through their low-dimension topological features, which translate into connected components (dimension 0), loops (dimension 1), and voids (dimension 2). Higher dimensions do exist, but often do not contain much useful information. For threedimensional data, up to the second dimension topological features can be considered at most. A good analogy to make the meaning of these features concrete is a piece of Swiss cheese. The piece of cheese itself is one connected component. The holes that are apparent on the The Shape of Things: Topological Data Analysis
事物的形状:拓扑数据分析
许多现代大数据的一个有趣特征是,我们收集的数据,或者我们想要分析的数据,不一定是我们在教科书中熟悉的传统矩阵或数组形式。他们可能被迫使用这样的格式来相对容易地进行分析,但这并不是一个强有力的理由。过去的专栏已经探讨了更直接地利用这些数据集的自然结构的新方法。拓扑数据分析(TDA)就是这样一种方法。TDA方法的背后隐藏着许多令人生畏的数学知识,但是即使不深入研究拓扑、同调类等的复杂性,也有可能获得对该方法及其潜在用途的概念和理解。事实上,基本思想非常简单:通过低维拓扑特征来研究数据,这些特征可以转化为连接的组件(维度0)、循环(维度1)和空洞(维度2)。高维确实存在,但通常不包含太多有用的信息。对于三维数据,最多可以考虑到二维拓扑特征。将这些特征的含义具体化的一个很好的类比是一块瑞士奶酪。这块奶酪本身就是一个相连的组成部分。在《事物的形状:拓扑数据分析》中可以明显看到的洞
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信