基于可达性剖面kullback-leibler散度的多空间尺度密度聚类检测

Orhun Aydin, C. Osorio-Murillo, Cheng-Chia Huang
{"title":"基于可达性剖面kullback-leibler散度的多空间尺度密度聚类检测","authors":"Orhun Aydin, C. Osorio-Murillo, Cheng-Chia Huang","doi":"10.1145/3557918.3565870","DOIUrl":null,"url":null,"abstract":"Density-based clustering methods are frequently used to define spatial clusters and outliers (noise) for location-only data. Different algorithms for solving this problem emerged over the past few decades, with their main difference being the numerical representation of the spatial density. A problem not addressed by conventional density-based clustering methods is defining alternate spatial cluster maps at statistically significant spatial scales. This problem differs from conventional clustering, as the goal of finding alternate clusters is to define different spatial cluster maps for all statistically significant spatial scales. Knowledge of distinct spatial scales pertinent to clustering is important for understanding various scales underlying the data. In addition, alternate clusters with different spatial scales can inform decisions that require to be made at different spatial granularity. In this paper, we introduce a statistical test that uses Kullback-Leibler (KL) divergence loss between different spatial density profiles to identify all statistically significant spatial scales at which clustering occurs. The proposed method defines different clustering maps that reflect different scales at which spatial clusters occur. We define the divergence on a 1-D representation of cluster density, the reachability profile, to cluster spatial units with varying spatial scales. We illustrate the use of multiple spatial clustering at different scales by comparing the proposed method to the state-of-the-art for defining a single map of multiscale clusters, HDBScan. We conclude the paper by applying the proposed method to physical and human geography problems, area of interest delineation, and wildfire cluster modeling, respectively.","PeriodicalId":428859,"journal":{"name":"Proceedings of the 5th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Density-based cluster detection at multiple spatial scales via kullback-leibler divergence of reachability profiles\",\"authors\":\"Orhun Aydin, C. Osorio-Murillo, Cheng-Chia Huang\",\"doi\":\"10.1145/3557918.3565870\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Density-based clustering methods are frequently used to define spatial clusters and outliers (noise) for location-only data. Different algorithms for solving this problem emerged over the past few decades, with their main difference being the numerical representation of the spatial density. A problem not addressed by conventional density-based clustering methods is defining alternate spatial cluster maps at statistically significant spatial scales. This problem differs from conventional clustering, as the goal of finding alternate clusters is to define different spatial cluster maps for all statistically significant spatial scales. Knowledge of distinct spatial scales pertinent to clustering is important for understanding various scales underlying the data. In addition, alternate clusters with different spatial scales can inform decisions that require to be made at different spatial granularity. In this paper, we introduce a statistical test that uses Kullback-Leibler (KL) divergence loss between different spatial density profiles to identify all statistically significant spatial scales at which clustering occurs. The proposed method defines different clustering maps that reflect different scales at which spatial clusters occur. We define the divergence on a 1-D representation of cluster density, the reachability profile, to cluster spatial units with varying spatial scales. We illustrate the use of multiple spatial clustering at different scales by comparing the proposed method to the state-of-the-art for defining a single map of multiscale clusters, HDBScan. We conclude the paper by applying the proposed method to physical and human geography problems, area of interest delineation, and wildfire cluster modeling, respectively.\",\"PeriodicalId\":428859,\"journal\":{\"name\":\"Proceedings of the 5th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery\",\"volume\":\"37 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 5th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3557918.3565870\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 5th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3557918.3565870","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

基于密度的聚类方法经常用于定义空间聚类和位置数据的异常值(噪声)。在过去的几十年里,解决这个问题的不同算法出现了,它们的主要区别在于空间密度的数值表示。传统的基于密度的聚类方法没有解决的一个问题是在统计显著的空间尺度上定义交替的空间聚类图。这个问题与传统的聚类不同,因为寻找替代聚类的目标是为所有统计上显著的空间尺度定义不同的空间聚类图。与聚类相关的不同空间尺度的知识对于理解数据背后的各种尺度是重要的。此外,具有不同空间尺度的备用集群可以为需要在不同空间粒度上做出的决策提供信息。在本文中,我们引入了一个统计检验,使用不同空间密度剖面之间的Kullback-Leibler (KL)散度损失来识别发生聚类的所有统计显著的空间尺度。该方法定义了不同的聚类图,以反映空间集群发生的不同尺度。我们将散度定义为集群密度的一维表示,即可达性剖面,以不同空间尺度的集群空间单元。我们通过将所提出的方法与最先进的用于定义多尺度集群的单一地图HDBScan的方法进行比较,说明了在不同尺度下使用多个空间集群。最后,我们将提出的方法分别应用于自然和人文地理问题、兴趣区域划分和野火聚类建模。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Density-based cluster detection at multiple spatial scales via kullback-leibler divergence of reachability profiles
Density-based clustering methods are frequently used to define spatial clusters and outliers (noise) for location-only data. Different algorithms for solving this problem emerged over the past few decades, with their main difference being the numerical representation of the spatial density. A problem not addressed by conventional density-based clustering methods is defining alternate spatial cluster maps at statistically significant spatial scales. This problem differs from conventional clustering, as the goal of finding alternate clusters is to define different spatial cluster maps for all statistically significant spatial scales. Knowledge of distinct spatial scales pertinent to clustering is important for understanding various scales underlying the data. In addition, alternate clusters with different spatial scales can inform decisions that require to be made at different spatial granularity. In this paper, we introduce a statistical test that uses Kullback-Leibler (KL) divergence loss between different spatial density profiles to identify all statistically significant spatial scales at which clustering occurs. The proposed method defines different clustering maps that reflect different scales at which spatial clusters occur. We define the divergence on a 1-D representation of cluster density, the reachability profile, to cluster spatial units with varying spatial scales. We illustrate the use of multiple spatial clustering at different scales by comparing the proposed method to the state-of-the-art for defining a single map of multiscale clusters, HDBScan. We conclude the paper by applying the proposed method to physical and human geography problems, area of interest delineation, and wildfire cluster modeling, respectively.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信