基于增强聚类方法的k -匿名

2016 IEEE 13th International Conference on e-Business Engineering (ICEBE) Pub Date : 2016-11-01 DOI:10.1109/ICEBE.2016.024

Md. Ileas Pramanik, Raymond Y. K. Lau, Wenping Zhang

{"title":"基于增强聚类方法的k -匿名","authors":"Md. Ileas Pramanik, Raymond Y. K. Lau, Wenping Zhang","doi":"10.1109/ICEBE.2016.024","DOIUrl":null,"url":null,"abstract":"With the rise of the Social Web, there is increasingly more tendency to share personal records, and even make them publicly available on the Internet. However, such a wide spread disclosure of personal data has raised serious privacy concerns. If the released dataset is not properly anonymized, individual privacy will be at great risk. K-anonymity is a popular and practical approach to anonymize datasets. In this study, we use a new clustering approach to achieve k-anonymity through enhanced data distortion that assures minimal information loss. During a clustering process, we include an additional constraint, minimal information loss, which is not incorporated into traditional clustering approaches. Our proposed algorithm supports a data release process such that data will not be distorted more than they are needed to achieve k-anonymity. We also develop more appropriate metrics for measuring the quality of generalization. The new metrics are suitable for both numeric and categorical attributes. Our experimental results show that the proposed algorithm causes significantly less information loss than existing clustering algorithms.","PeriodicalId":305614,"journal":{"name":"2016 IEEE 13th International Conference on e-Business Engineering (ICEBE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":"{\"title\":\"K-Anonymity through the Enhanced Clustering Method\",\"authors\":\"Md. Ileas Pramanik, Raymond Y. K. Lau, Wenping Zhang\",\"doi\":\"10.1109/ICEBE.2016.024\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the rise of the Social Web, there is increasingly more tendency to share personal records, and even make them publicly available on the Internet. However, such a wide spread disclosure of personal data has raised serious privacy concerns. If the released dataset is not properly anonymized, individual privacy will be at great risk. K-anonymity is a popular and practical approach to anonymize datasets. In this study, we use a new clustering approach to achieve k-anonymity through enhanced data distortion that assures minimal information loss. During a clustering process, we include an additional constraint, minimal information loss, which is not incorporated into traditional clustering approaches. Our proposed algorithm supports a data release process such that data will not be distorted more than they are needed to achieve k-anonymity. We also develop more appropriate metrics for measuring the quality of generalization. The new metrics are suitable for both numeric and categorical attributes. Our experimental results show that the proposed algorithm causes significantly less information loss than existing clustering algorithms.\",\"PeriodicalId\":305614,\"journal\":{\"name\":\"2016 IEEE 13th International Conference on e-Business Engineering (ICEBE)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"17\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 IEEE 13th International Conference on e-Business Engineering (ICEBE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICEBE.2016.024\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE 13th International Conference on e-Business Engineering (ICEBE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEBE.2016.024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 17

摘要

随着社交网络的兴起，人们越来越倾向于分享个人记录，甚至在互联网上公开这些记录。然而，如此广泛的个人数据披露引发了严重的隐私问题。如果发布的数据集没有进行适当的匿名化处理，个人隐私将面临极大的风险。k -匿名是一种流行且实用的匿名化数据集的方法。在本研究中，我们使用一种新的聚类方法通过增强数据失真来实现k-匿名，以确保最小的信息丢失。在聚类过程中，我们包含了一个额外的约束，最小化信息丢失，这是传统聚类方法中没有的。我们提出的算法支持数据释放过程，使数据不会被扭曲，超过实现k-匿名所需的数据。我们还开发了更合适的度量泛化质量的度量标准。新的度量标准既适用于数字属性，也适用于分类属性。实验结果表明，与现有的聚类算法相比，该算法的信息丢失明显减少。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

K-Anonymity through the Enhanced Clustering Method

With the rise of the Social Web, there is increasingly more tendency to share personal records, and even make them publicly available on the Internet. However, such a wide spread disclosure of personal data has raised serious privacy concerns. If the released dataset is not properly anonymized, individual privacy will be at great risk. K-anonymity is a popular and practical approach to anonymize datasets. In this study, we use a new clustering approach to achieve k-anonymity through enhanced data distortion that assures minimal information loss. During a clustering process, we include an additional constraint, minimal information loss, which is not incorporated into traditional clustering approaches. Our proposed algorithm supports a data release process such that data will not be distorted more than they are needed to achieve k-anonymity. We also develop more appropriate metrics for measuring the quality of generalization. The new metrics are suitable for both numeric and categorical attributes. Our experimental results show that the proposed algorithm causes significantly less information loss than existing clustering algorithms.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2016 IEEE 13th International Conference on e-Business Engineering (ICEBE)

自引率

0.00%

发文量