迈向地理上稳健、统计上显著的区域托管模式检测

Subhankar Ghosh, Jayant Gupta, Arun Sharma, Shuai An, S. Shekhar
{"title":"迈向地理上稳健、统计上显著的区域托管模式检测","authors":"Subhankar Ghosh, Jayant Gupta, Arun Sharma, Shuai An, S. Shekhar","doi":"10.1145/3557989.3566158","DOIUrl":null,"url":null,"abstract":"Given a set S of spatial feature-types, its feature-instances, a study area, and a neighbor relationship, the goal is to find pairs such that C is a statistically significant regional colocation pattern in region rg. For example Caribou Coffee and Starbucks are significantly co-located in Minneapolis but not in Dallas at present. This problem has applications in a wide variety of domains including ecology, economics, and sociology. The problem is computationally challenging due to the exponential number of regional colocation patterns and candidate regions. The current literature on regional colocation pattern detection has not addressed statistical significance which can result in spurious (chance) pattern instances. In this paper, we propose a novel technique for mining statistically significant regional colocation patterns. Our approach determines regions based on geographically defined boundaries (e.g., counties) unlike previous works which employed clustering, or regular polygons to enumerate candidate regions. To reduce spurious patterns, we perform a statistical significance test by modeling the observed data points with multiple Monte Carlo simulations within the corresponding regions. Using Safegraph POI dataset, this paper provides a case study on retail establishments in Minnesota for validation of proposed ideas. The paper also provides a detailed interpretation of discovered patterns using game theory and regional economics.","PeriodicalId":330320,"journal":{"name":"Proceedings of the 5th ACM SIGSPATIAL International Workshop on GeoSpatial Simulation","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Towards geographically robust statistically significant regional colocation pattern detection\",\"authors\":\"Subhankar Ghosh, Jayant Gupta, Arun Sharma, Shuai An, S. Shekhar\",\"doi\":\"10.1145/3557989.3566158\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Given a set S of spatial feature-types, its feature-instances, a study area, and a neighbor relationship, the goal is to find pairs such that C is a statistically significant regional colocation pattern in region rg. For example Caribou Coffee and Starbucks are significantly co-located in Minneapolis but not in Dallas at present. This problem has applications in a wide variety of domains including ecology, economics, and sociology. The problem is computationally challenging due to the exponential number of regional colocation patterns and candidate regions. The current literature on regional colocation pattern detection has not addressed statistical significance which can result in spurious (chance) pattern instances. In this paper, we propose a novel technique for mining statistically significant regional colocation patterns. Our approach determines regions based on geographically defined boundaries (e.g., counties) unlike previous works which employed clustering, or regular polygons to enumerate candidate regions. To reduce spurious patterns, we perform a statistical significance test by modeling the observed data points with multiple Monte Carlo simulations within the corresponding regions. Using Safegraph POI dataset, this paper provides a case study on retail establishments in Minnesota for validation of proposed ideas. The paper also provides a detailed interpretation of discovered patterns using game theory and regional economics.\",\"PeriodicalId\":330320,\"journal\":{\"name\":\"Proceedings of the 5th ACM SIGSPATIAL International Workshop on GeoSpatial Simulation\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 5th ACM SIGSPATIAL International Workshop on GeoSpatial Simulation\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3557989.3566158\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 5th ACM SIGSPATIAL International Workshop on GeoSpatial Simulation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3557989.3566158","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

摘要

给定空间特征类型、特征实例、研究区域和邻居关系的集合S,目标是找到使C在区域rg中具有统计显著性的区域配置模式对。例如,驯鹿咖啡(Caribou Coffee)和星巴克(Starbucks)目前在明尼阿波利斯(Minneapolis)有很大的合作,但在达拉斯(Dallas)却没有。这个问题在许多领域都有应用,包括生态学、经济学和社会学。由于区域托管模式和候选区域的指数数量,该问题在计算上具有挑战性。目前关于区域托管模式检测的文献没有解决统计显著性问题,这可能导致虚假(偶然)模式实例。在本文中,我们提出了一种新的技术来挖掘统计上显著的区域配置模式。我们的方法基于地理上定义的边界(例如,县)来确定区域,而不像以前的工作那样使用聚类或正则多边形来枚举候选区域。为了减少虚假模式,我们通过在相应区域内使用多个蒙特卡罗模拟对观察到的数据点进行建模来执行统计显著性检验。本文使用Safegraph POI数据集,对明尼苏达州的零售场所进行了案例研究,以验证所提出的想法。本文还运用博弈论和区域经济学对发现的模式进行了详细的解释。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Towards geographically robust statistically significant regional colocation pattern detection
Given a set S of spatial feature-types, its feature-instances, a study area, and a neighbor relationship, the goal is to find pairs such that C is a statistically significant regional colocation pattern in region rg. For example Caribou Coffee and Starbucks are significantly co-located in Minneapolis but not in Dallas at present. This problem has applications in a wide variety of domains including ecology, economics, and sociology. The problem is computationally challenging due to the exponential number of regional colocation patterns and candidate regions. The current literature on regional colocation pattern detection has not addressed statistical significance which can result in spurious (chance) pattern instances. In this paper, we propose a novel technique for mining statistically significant regional colocation patterns. Our approach determines regions based on geographically defined boundaries (e.g., counties) unlike previous works which employed clustering, or regular polygons to enumerate candidate regions. To reduce spurious patterns, we perform a statistical significance test by modeling the observed data points with multiple Monte Carlo simulations within the corresponding regions. Using Safegraph POI dataset, this paper provides a case study on retail establishments in Minnesota for validation of proposed ideas. The paper also provides a detailed interpretation of discovered patterns using game theory and regional economics.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信