层次时空聚类中度量单元相关性和稳定性

IF 2.1 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY
Roy Cerqueti , Raffaele Mattera
{"title":"层次时空聚类中度量单元相关性和稳定性","authors":"Roy Cerqueti ,&nbsp;Raffaele Mattera","doi":"10.1016/j.spasta.2025.100880","DOIUrl":null,"url":null,"abstract":"<div><div>Understanding the significance of individual data points within clustering structures is critical to effective data analysis. Traditional stability methods, while valuable, often overlook the nuanced impact of individual units, particularly in spatial contexts. In this paper, we explore the concept of unit relevance in clustering analysis, emphasizing its importance in capturing the spatio-temporal nature of the clustering problem. We propose a simple measure of unit relevance, the Unit Relevance Index (URI), and define an overall measure of clustering stability based on the aggregation of computed URIs. Considering two experiments on real datasets with geo-referenced time series, we find that the use of spatial constraints in the clustering task yields more stable results. Therefore, the inclusion of the spatial dimension can be seen as a way to stabilize the clustering.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"66 ","pages":"Article 100880"},"PeriodicalIF":2.1000,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Measuring unit relevance and stability in hierarchical spatio-temporal clustering\",\"authors\":\"Roy Cerqueti ,&nbsp;Raffaele Mattera\",\"doi\":\"10.1016/j.spasta.2025.100880\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Understanding the significance of individual data points within clustering structures is critical to effective data analysis. Traditional stability methods, while valuable, often overlook the nuanced impact of individual units, particularly in spatial contexts. In this paper, we explore the concept of unit relevance in clustering analysis, emphasizing its importance in capturing the spatio-temporal nature of the clustering problem. We propose a simple measure of unit relevance, the Unit Relevance Index (URI), and define an overall measure of clustering stability based on the aggregation of computed URIs. Considering two experiments on real datasets with geo-referenced time series, we find that the use of spatial constraints in the clustering task yields more stable results. Therefore, the inclusion of the spatial dimension can be seen as a way to stabilize the clustering.</div></div>\",\"PeriodicalId\":48771,\"journal\":{\"name\":\"Spatial Statistics\",\"volume\":\"66 \",\"pages\":\"Article 100880\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2025-01-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Spatial Statistics\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2211675325000028\",\"RegionNum\":2,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"GEOSCIENCES, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spatial Statistics","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2211675325000028","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

理解聚类结构中单个数据点的重要性对于有效的数据分析至关重要。传统的稳定性方法虽然有价值,但往往忽略了单个单元的细微影响,特别是在空间环境中。在本文中,我们探讨了聚类分析中单位相关性的概念,强调了它在捕捉聚类问题的时空本质方面的重要性。我们提出了一种简单的单元相关性度量,即单元相关性索引(unit relevance Index, URI),并定义了基于计算的URI聚合的聚类稳定性的总体度量。通过对具有地理参考时间序列的真实数据集的两个实验,我们发现在聚类任务中使用空间约束可以获得更稳定的结果。因此,空间维度的包含可以看作是稳定聚类的一种方式。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Measuring unit relevance and stability in hierarchical spatio-temporal clustering
Understanding the significance of individual data points within clustering structures is critical to effective data analysis. Traditional stability methods, while valuable, often overlook the nuanced impact of individual units, particularly in spatial contexts. In this paper, we explore the concept of unit relevance in clustering analysis, emphasizing its importance in capturing the spatio-temporal nature of the clustering problem. We propose a simple measure of unit relevance, the Unit Relevance Index (URI), and define an overall measure of clustering stability based on the aggregation of computed URIs. Considering two experiments on real datasets with geo-referenced time series, we find that the use of spatial constraints in the clustering task yields more stable results. Therefore, the inclusion of the spatial dimension can be seen as a way to stabilize the clustering.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Spatial Statistics
Spatial Statistics GEOSCIENCES, MULTIDISCIPLINARY-MATHEMATICS, INTERDISCIPLINARY APPLICATIONS
CiteScore
4.00
自引率
21.70%
发文量
89
审稿时长
55 days
期刊介绍: Spatial Statistics publishes articles on the theory and application of spatial and spatio-temporal statistics. It favours manuscripts that present theory generated by new applications, or in which new theory is applied to an important practical case. A purely theoretical study will only rarely be accepted. Pure case studies without methodological development are not acceptable for publication. Spatial statistics concerns the quantitative analysis of spatial and spatio-temporal data, including their statistical dependencies, accuracy and uncertainties. Methodology for spatial statistics is typically found in probability theory, stochastic modelling and mathematical statistics as well as in information science. Spatial statistics is used in mapping, assessing spatial data quality, sampling design optimisation, modelling of dependence structures, and drawing of valid inference from a limited set of spatio-temporal data.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信