Enhanced slicing models for preserving privacy in data publication

S. Kiruthika, M. Raseen
{"title":"Enhanced slicing models for preserving privacy in data publication","authors":"S. Kiruthika, M. Raseen","doi":"10.1109/ICCTET.2013.6675998","DOIUrl":null,"url":null,"abstract":"Privacy preservation in publishing of microdata has been studied extensively in recent years. Microdata contain records each of which contains information about an individual entity, such as a person, a household, or an organization. Several anonymization techniques, such as generalization, bucketization and slicing have been designed for privacy preserving microdata publishing. That generalization loses considerable amount of information, especially for high dimensional data. Bucketization does not prevent membership disclosure and does not apply for data that do not have a clear separation between quasi-identifying attributes and sensitive attributes. Slicing have a drawback when more number of similar attribute value and the sensitive value may present in the different tuples may give the original tuple while performing the random permutation. The utility of the dataset is lost by generation the fake tuples. Thus enhanced slicing models have designed to overcome the drawbacks of slicing. The suppression slicing is done by suppressing any one of the attribute value in the tuples and then perform the slicing. Thus utility is maintained with minimum loss by suppressing only very few values and privacy is maintained by random permutation. The next model is Mondrian slicing in this the random permutation is done with all the buckets not within the single bucket. Thus same utility of the original dataset is maintained.","PeriodicalId":242568,"journal":{"name":"2013 International Conference on Current Trends in Engineering and Technology (ICCTET)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 International Conference on Current Trends in Engineering and Technology (ICCTET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCTET.2013.6675998","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10

Abstract

Privacy preservation in publishing of microdata has been studied extensively in recent years. Microdata contain records each of which contains information about an individual entity, such as a person, a household, or an organization. Several anonymization techniques, such as generalization, bucketization and slicing have been designed for privacy preserving microdata publishing. That generalization loses considerable amount of information, especially for high dimensional data. Bucketization does not prevent membership disclosure and does not apply for data that do not have a clear separation between quasi-identifying attributes and sensitive attributes. Slicing have a drawback when more number of similar attribute value and the sensitive value may present in the different tuples may give the original tuple while performing the random permutation. The utility of the dataset is lost by generation the fake tuples. Thus enhanced slicing models have designed to overcome the drawbacks of slicing. The suppression slicing is done by suppressing any one of the attribute value in the tuples and then perform the slicing. Thus utility is maintained with minimum loss by suppressing only very few values and privacy is maintained by random permutation. The next model is Mondrian slicing in this the random permutation is done with all the buckets not within the single bucket. Thus same utility of the original dataset is maintained.
增强切片模型,以保护数据发布中的隐私
近年来,微数据发布中的隐私保护问题得到了广泛的研究。微数据包含记录,每条记录都包含有关个体实体的信息,例如个人、家庭或组织。为了保护微数据发布的隐私,已经设计了几种匿名化技术,如泛化、桶化和切片。这种泛化丢失了相当多的信息,特别是对于高维数据。分类不会阻止成员披露,也不适用于准识别属性和敏感属性之间没有明确分离的数据。切片有一个缺点,当在不同的元组中可能存在较多的相似属性值和敏感值时,在执行随机排列时可能会给出原始元组。生成假元组会失去数据集的效用。因此,增强的切片模型被设计来克服切片的缺点。抑制切片通过抑制元组中的任何一个属性值,然后执行切片来完成。因此,通过只抑制很少的值,以最小的损失维持效用,并通过随机排列维护隐私。下一个模型是蒙德里安切片在这个模型中,所有的桶都是随机排列的,而不是在一个桶内。因此保持了原始数据集的相同效用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信