低维零密度区域的渐近性

IF 1.2 4区 数学 Q2 STATISTICS & PROBABILITY
Hengrui Luo, Steven N. MacEachern, Mario Peruggia
{"title":"低维零密度区域的渐近性","authors":"Hengrui Luo, Steven N. MacEachern, Mario Peruggia","doi":"10.1080/02331888.2023.2262665","DOIUrl":null,"url":null,"abstract":"AbstractTopological data analysis (TDA) allows us to explore the topological features of a dataset. Among topological features, lower dimensional ones have recently drawn the attention of practitioners in mathematics and statistics due to their potential to aid the discovery of low dimensional structure in a data set. However, lower dimensional features are usually challenging to detect based on finite samples and using TDA methods that ignore the probabilistic mechanism that generates the data. In this paper, lower dimensional topological features occurring as zero-density regions of density functions are introduced and thoroughly investigated. Specifically, we consider sequences of coverings for the support of a density function in which the coverings are comprised of balls with shrinking radii. We show that, when these coverings satisfy certain sufficient conditions as the sample size goes to infinity, we can detect lower dimensional, zero-density regions with increasingly higher probability while guarding against false detection. We supplement the theoretical developments with the discussion of simulated experiments that elucidate the behaviour of the methodology for different choices of the tuning parameters that govern the construction of the covering sequences and characterize the asymptotic results.Keywords: Topological data analysiscovering constructionzero-density regions AcknowledgmentsWe thank the anonymous referee, whose comments greatly improve the article. We thank the AE for helpful comments and handling.Disclosure statementNo potential conflict of interest was reported by the author(s).Additional informationFundingThis material is based upon work supported by the National Science Foundation [grants numbers DMS-1613110, DMS-2015552, and SES-1921523].","PeriodicalId":54358,"journal":{"name":"Statistics","volume":"1 1","pages":"0"},"PeriodicalIF":1.2000,"publicationDate":"2023-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Asymptotics of lower dimensional zero-density regions\",\"authors\":\"Hengrui Luo, Steven N. MacEachern, Mario Peruggia\",\"doi\":\"10.1080/02331888.2023.2262665\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"AbstractTopological data analysis (TDA) allows us to explore the topological features of a dataset. Among topological features, lower dimensional ones have recently drawn the attention of practitioners in mathematics and statistics due to their potential to aid the discovery of low dimensional structure in a data set. However, lower dimensional features are usually challenging to detect based on finite samples and using TDA methods that ignore the probabilistic mechanism that generates the data. In this paper, lower dimensional topological features occurring as zero-density regions of density functions are introduced and thoroughly investigated. Specifically, we consider sequences of coverings for the support of a density function in which the coverings are comprised of balls with shrinking radii. We show that, when these coverings satisfy certain sufficient conditions as the sample size goes to infinity, we can detect lower dimensional, zero-density regions with increasingly higher probability while guarding against false detection. We supplement the theoretical developments with the discussion of simulated experiments that elucidate the behaviour of the methodology for different choices of the tuning parameters that govern the construction of the covering sequences and characterize the asymptotic results.Keywords: Topological data analysiscovering constructionzero-density regions AcknowledgmentsWe thank the anonymous referee, whose comments greatly improve the article. We thank the AE for helpful comments and handling.Disclosure statementNo potential conflict of interest was reported by the author(s).Additional informationFundingThis material is based upon work supported by the National Science Foundation [grants numbers DMS-1613110, DMS-2015552, and SES-1921523].\",\"PeriodicalId\":54358,\"journal\":{\"name\":\"Statistics\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":1.2000,\"publicationDate\":\"2023-09-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Statistics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/02331888.2023.2262665\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"STATISTICS & PROBABILITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/02331888.2023.2262665","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 7

摘要

拓扑数据分析(TDA)允许我们探索数据集的拓扑特征。在拓扑特征中,低维特征最近引起了数学和统计学从业者的注意,因为它们有可能帮助发现数据集中的低维结构。然而,基于有限样本和使用忽略生成数据的概率机制的TDA方法来检测低维特征通常具有挑战性。本文引入并深入研究了密度函数中作为零密度区域的低维拓扑特征。具体来说,我们考虑用于支持密度函数的覆盖序列,其中覆盖序列由半径缩小的球组成。我们证明,当这些覆盖满足一定的充分条件时,随着样本量趋于无穷,我们可以以越来越高的概率检测到低维、零密度的区域,同时防止误检测。我们用模拟实验的讨论来补充理论发展,这些实验阐明了控制覆盖序列构造和表征渐近结果的调谐参数的不同选择的方法的行为。关键词:结构零密度区域拓扑数据分析致谢感谢匿名审稿人,他的意见对本文的改进有很大的帮助。我们感谢AE提供的有益意见和处理。披露声明作者未报告潜在的利益冲突。本材料基于美国国家科学基金会支持的工作[资助号:DMS-1613110, DMS-2015552和SES-1921523]。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Asymptotics of lower dimensional zero-density regions
AbstractTopological data analysis (TDA) allows us to explore the topological features of a dataset. Among topological features, lower dimensional ones have recently drawn the attention of practitioners in mathematics and statistics due to their potential to aid the discovery of low dimensional structure in a data set. However, lower dimensional features are usually challenging to detect based on finite samples and using TDA methods that ignore the probabilistic mechanism that generates the data. In this paper, lower dimensional topological features occurring as zero-density regions of density functions are introduced and thoroughly investigated. Specifically, we consider sequences of coverings for the support of a density function in which the coverings are comprised of balls with shrinking radii. We show that, when these coverings satisfy certain sufficient conditions as the sample size goes to infinity, we can detect lower dimensional, zero-density regions with increasingly higher probability while guarding against false detection. We supplement the theoretical developments with the discussion of simulated experiments that elucidate the behaviour of the methodology for different choices of the tuning parameters that govern the construction of the covering sequences and characterize the asymptotic results.Keywords: Topological data analysiscovering constructionzero-density regions AcknowledgmentsWe thank the anonymous referee, whose comments greatly improve the article. We thank the AE for helpful comments and handling.Disclosure statementNo potential conflict of interest was reported by the author(s).Additional informationFundingThis material is based upon work supported by the National Science Foundation [grants numbers DMS-1613110, DMS-2015552, and SES-1921523].
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Statistics
Statistics 数学-统计学与概率论
CiteScore
1.00
自引率
0.00%
发文量
59
审稿时长
12 months
期刊介绍: Statistics publishes papers developing and analysing new methods for any active field of statistics, motivated by real-life problems. Papers submitted for consideration should provide interesting and novel contributions to statistical theory and its applications with rigorous mathematical results and proofs. Moreover, numerical simulations and application to real data sets can improve the quality of papers, and should be included where appropriate. Statistics does not publish papers which represent mere application of existing procedures to case studies, and papers are required to contain methodological or theoretical innovation. Topics of interest include, for example, nonparametric statistics, time series, analysis of topological or functional data. Furthermore the journal also welcomes submissions in the field of theoretical econometrics and its links to mathematical statistics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信