基于差分隐私模型的政府数据发布研究

Chunhui Piao, Yajuan Shi, Yunzuo Zhang, Xuehong Jiang
{"title":"基于差分隐私模型的政府数据发布研究","authors":"Chunhui Piao, Yajuan Shi, Yunzuo Zhang, Xuehong Jiang","doi":"10.1109/ICEBE.2017.21","DOIUrl":null,"url":null,"abstract":"With the enforcement of the policies of opening and sharing information resources, protection of citizens' privacy has become a key issue concerned by the government and public. This paper discusses the risk of citizens' privacy disclosure related to government data publishing, and analyzes the main privacy-preserving methods for data publishing. Aiming at the problem that most of the existing privacy protection models for data publishing cannot resist the attacks based on the growing background knowledge, a differential privacy framework for publishing governmental statistical data is established. Based on the framework, a data publishing algorithm using MaxDiff histogram is proposed. Applying differential method, Laplace noises are added to the original dataset, which prevents citizens' privacy from disclosure even if attackers get strong background knowledge. According to the maximum frequency difference, the adjacent data bins are grouped, then the differential privacy histogram with minimum average error can be constructed. Through theoretical analysis and experimental comparison, it is demonstrated that the proposed data publishing algorithm can not only be used to effectively protect citizens' privacy, but also reduce the query sensitivity and improve the utility of the data published.","PeriodicalId":347774,"journal":{"name":"2017 IEEE 14th International Conference on e-Business Engineering (ICEBE)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Research on Government Data Publishing Based on Differential Privacy Model\",\"authors\":\"Chunhui Piao, Yajuan Shi, Yunzuo Zhang, Xuehong Jiang\",\"doi\":\"10.1109/ICEBE.2017.21\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the enforcement of the policies of opening and sharing information resources, protection of citizens' privacy has become a key issue concerned by the government and public. This paper discusses the risk of citizens' privacy disclosure related to government data publishing, and analyzes the main privacy-preserving methods for data publishing. Aiming at the problem that most of the existing privacy protection models for data publishing cannot resist the attacks based on the growing background knowledge, a differential privacy framework for publishing governmental statistical data is established. Based on the framework, a data publishing algorithm using MaxDiff histogram is proposed. Applying differential method, Laplace noises are added to the original dataset, which prevents citizens' privacy from disclosure even if attackers get strong background knowledge. According to the maximum frequency difference, the adjacent data bins are grouped, then the differential privacy histogram with minimum average error can be constructed. Through theoretical analysis and experimental comparison, it is demonstrated that the proposed data publishing algorithm can not only be used to effectively protect citizens' privacy, but also reduce the query sensitivity and improve the utility of the data published.\",\"PeriodicalId\":347774,\"journal\":{\"name\":\"2017 IEEE 14th International Conference on e-Business Engineering (ICEBE)\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE 14th International Conference on e-Business Engineering (ICEBE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICEBE.2017.21\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 14th International Conference on e-Business Engineering (ICEBE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEBE.2017.21","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

摘要

随着信息资源开放和共享政策的实施,公民隐私保护已成为政府和公众关注的重点问题。本文讨论了政府数据公开中公民隐私泄露的风险,分析了政府数据公开中主要的隐私保护方法。针对现有的数据发布隐私保护模型大多无法抵抗基于背景知识不断增长的攻击的问题,建立了政府统计数据发布的差分隐私保护框架。在此基础上,提出了一种基于MaxDiff直方图的数据发布算法。采用差分方法,在原始数据集中加入拉普拉斯噪声,即使攻击者获得较强的背景知识,也不会泄露公民的隐私。根据最大频率差对相邻数据箱进行分组,构造平均误差最小的差分隐私直方图。通过理论分析和实验对比,表明所提出的数据发布算法不仅可以有效地保护公民隐私,还可以降低查询灵敏度,提高发布数据的实用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Research on Government Data Publishing Based on Differential Privacy Model
With the enforcement of the policies of opening and sharing information resources, protection of citizens' privacy has become a key issue concerned by the government and public. This paper discusses the risk of citizens' privacy disclosure related to government data publishing, and analyzes the main privacy-preserving methods for data publishing. Aiming at the problem that most of the existing privacy protection models for data publishing cannot resist the attacks based on the growing background knowledge, a differential privacy framework for publishing governmental statistical data is established. Based on the framework, a data publishing algorithm using MaxDiff histogram is proposed. Applying differential method, Laplace noises are added to the original dataset, which prevents citizens' privacy from disclosure even if attackers get strong background knowledge. According to the maximum frequency difference, the adjacent data bins are grouped, then the differential privacy histogram with minimum average error can be constructed. Through theoretical analysis and experimental comparison, it is demonstrated that the proposed data publishing algorithm can not only be used to effectively protect citizens' privacy, but also reduce the query sensitivity and improve the utility of the data published.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信