Preserving privacy for sensitive values of individuals in data publishing based on a new additive noise approach

2014 23rd International Conference on Computer Communication and Networks (ICCCN) Pub Date : 2014-09-29 DOI:10.1109/ICCCN.2014.6911855

Hong Zhu, Shengli Tian, Meiyi Xie, Mengyuan Yang

{"title":"Preserving privacy for sensitive values of individuals in data publishing based on a new additive noise approach","authors":"Hong Zhu, Shengli Tian, Meiyi Xie, Mengyuan Yang","doi":"10.1109/ICCCN.2014.6911855","DOIUrl":null,"url":null,"abstract":"The disclosure of individuals' sensitive values is a potentially serious problem in privacy preserving data publishing. The popular ℓ-diversity model is able to prevent victim' sensitive value from being revealed with a certainty higher than 1/ℓ. But, the anonymized tables with ℓ-diversity generated by the methods, such as generalization, anatomy and slicing, are all vulnerable to the attacker with some individuals' sensitive values. In this paper we propose a new approach to publish anonymized table with ℓ-diversity against the attacker having publishing algorithm and some individuals' sensitive values. Such an approach replaces the sensitive attribute (SA) value of each record with an values set consisted of the real SA value and several random noise values. The method can be applied to protect numerical and nominal SA, existing additive noise only is used to protect the numerical SA of anonymized table. Comparing with randomization, our method retains the frequency distribution of SA values of original data and retains more correlations (between SA and other attributes). Experimental evaluation shows that our method preserves more correlations than that of generalization, slicing and anatomy.","PeriodicalId":404048,"journal":{"name":"2014 23rd International Conference on Computer Communication and Networks (ICCCN)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 23rd International Conference on Computer Communication and Networks (ICCCN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCN.2014.6911855","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 14

Abstract

The disclosure of individuals' sensitive values is a potentially serious problem in privacy preserving data publishing. The popular ℓ-diversity model is able to prevent victim' sensitive value from being revealed with a certainty higher than 1/ℓ. But, the anonymized tables with ℓ-diversity generated by the methods, such as generalization, anatomy and slicing, are all vulnerable to the attacker with some individuals' sensitive values. In this paper we propose a new approach to publish anonymized table with ℓ-diversity against the attacker having publishing algorithm and some individuals' sensitive values. Such an approach replaces the sensitive attribute (SA) value of each record with an values set consisted of the real SA value and several random noise values. The method can be applied to protect numerical and nominal SA, existing additive noise only is used to protect the numerical SA of anonymized table. Comparing with randomization, our method retains the frequency distribution of SA values of original data and retains more correlations (between SA and other attributes). Experimental evaluation shows that our method preserves more correlations than that of generalization, slicing and anatomy.

查看原文本刊更多论文

基于新的加性噪声方法的数据发布中个人敏感值的隐私保护

个人敏感价值观的泄露是隐私保护数据发布中一个潜在的严重问题。常用的r -分集模型能够防止受害者敏感值以大于1/ r的确定性被泄露。但是，由泛化、解剖和切片等方法生成的具有n -多样性的匿名表都容易受到具有某些个体敏感值的攻击者的攻击。本文针对具有发布算法和某些个体敏感值的攻击者，提出了一种发布具有r -分集的匿名表的新方法。这种方法将每条记录的敏感属性(SA)值替换为一个由真实SA值和几个随机噪声值组成的值集。该方法可用于保护数字和名义SA，仅利用已有的加性噪声来保护匿名表的数字SA。与随机化相比，我们的方法保留了原始数据SA值的频率分布，并保留了更多的相关性(SA与其他属性之间)。实验结果表明，该方法比泛化、切片和解剖方法保留了更多的相关性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2014 23rd International Conference on Computer Communication and Networks (ICCCN)

自引率

0.00%

发文量