{"title":"Differentially private geospatial data publication based on grid clustering","authors":"Dongni Yang, Songyan Li, Zhaobin Liu, Xinfeng Ye","doi":"10.1504/ijes.2019.102435","DOIUrl":null,"url":null,"abstract":"Collecting geospatial data from location-based services can provide location evidence while analysing spatial information. However, releasing location data may result in the disclosure of sensitive personal information. The adaptive grid method (AG) uses differential privacy to protect information. In AG, the algorithm uses two levels of grids over data domain. However, it does not take into account the data distribution. Usually, the accuracy will be reduced in response to long-range counting queries. In this paper, the adjacent grid cells with similar data density are clustered together. Laplace noise is added to the clusters created by the clustering of the grid cells. The noisy count obtained from the grid cells that form each cluster is evenly redistributed to the grid cells in the cluster. Extensive experiments on real-world datasets showed that the query accuracy of the proposed method is higher than the existing methods.","PeriodicalId":412308,"journal":{"name":"Int. J. Embed. Syst.","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Embed. Syst.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/ijes.2019.102435","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Collecting geospatial data from location-based services can provide location evidence while analysing spatial information. However, releasing location data may result in the disclosure of sensitive personal information. The adaptive grid method (AG) uses differential privacy to protect information. In AG, the algorithm uses two levels of grids over data domain. However, it does not take into account the data distribution. Usually, the accuracy will be reduced in response to long-range counting queries. In this paper, the adjacent grid cells with similar data density are clustered together. Laplace noise is added to the clusters created by the clustering of the grid cells. The noisy count obtained from the grid cells that form each cluster is evenly redistributed to the grid cells in the cluster. Extensive experiments on real-world datasets showed that the query accuracy of the proposed method is higher than the existing methods.