{"title":"Kernel density estimation with geographically masked points","authors":"Xun Shi, Jennifer A. Alford-Teaster, T. Onega","doi":"10.1109/GEOINFORMATICS.2009.5292881","DOIUrl":null,"url":null,"abstract":"Geographic masking displaces points to hide their identities. It has been used in health-related studies to protect patients' confidentialities. The main concern in this process is the balance between the protection of confidentiality and the preservation of the original spatial pattern. However, there is paucity in literature on quantification of this balance. We conducted a preliminary research on the most commonly used geographic masking method, the point dithering method, intending to quantify the original spatial pattern preserved under different dithering thresholds. We performed kernel density estimation (KDE) using a series of bandwidths to create density surfaces from both the original points and the dithered points, and then calculated Pearson's correlation coefficients between a density surface of the original points and its corresponding surfaces of the dithered points created with the same bandwidth. Using simulated data in New Hampshire, our test reveals a clear relationship between the bandwidth of KDE and the dithering threshold: When the bandwidth is 5 times the dithering threshold, the density surfaces of the original points and the dithered points are almost identical (in most cases the correlation coefficient ≫ 0.99). This relationship provides reference for choosing appropriate bandwidth in point pattern analysis, such as cluster detection, when working with dithered points.","PeriodicalId":121212,"journal":{"name":"2009 17th International Conference on Geoinformatics","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 17th International Conference on Geoinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GEOINFORMATICS.2009.5292881","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 17
Abstract
Geographic masking displaces points to hide their identities. It has been used in health-related studies to protect patients' confidentialities. The main concern in this process is the balance between the protection of confidentiality and the preservation of the original spatial pattern. However, there is paucity in literature on quantification of this balance. We conducted a preliminary research on the most commonly used geographic masking method, the point dithering method, intending to quantify the original spatial pattern preserved under different dithering thresholds. We performed kernel density estimation (KDE) using a series of bandwidths to create density surfaces from both the original points and the dithered points, and then calculated Pearson's correlation coefficients between a density surface of the original points and its corresponding surfaces of the dithered points created with the same bandwidth. Using simulated data in New Hampshire, our test reveals a clear relationship between the bandwidth of KDE and the dithering threshold: When the bandwidth is 5 times the dithering threshold, the density surfaces of the original points and the dithered points are almost identical (in most cases the correlation coefficient ≫ 0.99). This relationship provides reference for choosing appropriate bandwidth in point pattern analysis, such as cluster detection, when working with dithered points.