Kernel density estimation with geographically masked points

Xun Shi, Jennifer A. Alford-Teaster, T. Onega
{"title":"Kernel density estimation with geographically masked points","authors":"Xun Shi, Jennifer A. Alford-Teaster, T. Onega","doi":"10.1109/GEOINFORMATICS.2009.5292881","DOIUrl":null,"url":null,"abstract":"Geographic masking displaces points to hide their identities. It has been used in health-related studies to protect patients' confidentialities. The main concern in this process is the balance between the protection of confidentiality and the preservation of the original spatial pattern. However, there is paucity in literature on quantification of this balance. We conducted a preliminary research on the most commonly used geographic masking method, the point dithering method, intending to quantify the original spatial pattern preserved under different dithering thresholds. We performed kernel density estimation (KDE) using a series of bandwidths to create density surfaces from both the original points and the dithered points, and then calculated Pearson's correlation coefficients between a density surface of the original points and its corresponding surfaces of the dithered points created with the same bandwidth. Using simulated data in New Hampshire, our test reveals a clear relationship between the bandwidth of KDE and the dithering threshold: When the bandwidth is 5 times the dithering threshold, the density surfaces of the original points and the dithered points are almost identical (in most cases the correlation coefficient ≫ 0.99). This relationship provides reference for choosing appropriate bandwidth in point pattern analysis, such as cluster detection, when working with dithered points.","PeriodicalId":121212,"journal":{"name":"2009 17th International Conference on Geoinformatics","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 17th International Conference on Geoinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GEOINFORMATICS.2009.5292881","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 17

Abstract

Geographic masking displaces points to hide their identities. It has been used in health-related studies to protect patients' confidentialities. The main concern in this process is the balance between the protection of confidentiality and the preservation of the original spatial pattern. However, there is paucity in literature on quantification of this balance. We conducted a preliminary research on the most commonly used geographic masking method, the point dithering method, intending to quantify the original spatial pattern preserved under different dithering thresholds. We performed kernel density estimation (KDE) using a series of bandwidths to create density surfaces from both the original points and the dithered points, and then calculated Pearson's correlation coefficients between a density surface of the original points and its corresponding surfaces of the dithered points created with the same bandwidth. Using simulated data in New Hampshire, our test reveals a clear relationship between the bandwidth of KDE and the dithering threshold: When the bandwidth is 5 times the dithering threshold, the density surfaces of the original points and the dithered points are almost identical (in most cases the correlation coefficient ≫ 0.99). This relationship provides reference for choosing appropriate bandwidth in point pattern analysis, such as cluster detection, when working with dithered points.
地理掩蔽点核密度估计
地理掩蔽取代了点来隐藏他们的身份。它已被用于与健康有关的研究,以保护患者的隐私。在这个过程中,主要关注的是保护保密性和保留原始空间格局之间的平衡。然而,关于这种平衡的量化文献很少。我们对最常用的地理掩蔽方法——点抖动法进行了初步研究,试图量化在不同抖动阈值下保留的原始空间格局。我们使用一系列带宽执行核密度估计(KDE),从原始点和抖动点创建密度面,然后计算原始点的密度面与使用相同带宽创建的抖动点的相应表面之间的Pearson相关系数。使用新罕布什尔州的模拟数据,我们的测试揭示了KDE的带宽和抖动阈值之间的明确关系:当带宽是抖动阈值的5倍时,原始点和抖动点的密度表面几乎相同(在大多数情况下相关系数为0.99)。这一关系为在处理抖动点时进行点图分析(如聚类检测)时选择合适的带宽提供了参考。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信