增强隐私的大数据分析数据聚合

Q4 Decision Sciences

ECTI Transactions on Computer and Information Technology Pub Date : 2023-09-30 DOI:10.37936/ecti-cit.2023173.252952

Surapon Riyana, Kittikorn Sasujit, Nigran Homdoung

{"title":"增强隐私的大数据分析数据聚合","authors":"Surapon Riyana, Kittikorn Sasujit, Nigran Homdoung","doi":"10.37936/ecti-cit.2023173.252952","DOIUrl":null,"url":null,"abstract":"Data utility and data privacy are serious issues that must be considered when datasets are utilized in big data analytics such that they are traded off. That is, the datasets have high data utility and often have high risks in terms of privacy violation issues. To balance the data utility and the data privacy in datasets when they are provided to utilize in big data analytics, several privacy preservation models have been proposed, e.g., k-Anonymity, l-Diversity, t-Closeness, Anatomy, k-Likeness, and (lp1, . . . , lpn)-Privacy. Unfortunately, these privacy preservation models are highly complex data models and still have data utility issues that must be addressed. To rid these vulnerabilities of these models, a new privacy preservation model is proposed in this work. It is based on aggregate query answers that can guarantee the confidence of the range and the number of values that can be re-identified. Furthermore, we show that the proposed model is more effcient and effective in big data analytics by using extensive experiments.","PeriodicalId":37046,"journal":{"name":"ECTI Transactions on Computer and Information Technology","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Privacy-Enhancing Data Aggregation for Big Data Analytics\",\"authors\":\"Surapon Riyana, Kittikorn Sasujit, Nigran Homdoung\",\"doi\":\"10.37936/ecti-cit.2023173.252952\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Data utility and data privacy are serious issues that must be considered when datasets are utilized in big data analytics such that they are traded off. That is, the datasets have high data utility and often have high risks in terms of privacy violation issues. To balance the data utility and the data privacy in datasets when they are provided to utilize in big data analytics, several privacy preservation models have been proposed, e.g., k-Anonymity, l-Diversity, t-Closeness, Anatomy, k-Likeness, and (lp1, . . . , lpn)-Privacy. Unfortunately, these privacy preservation models are highly complex data models and still have data utility issues that must be addressed. To rid these vulnerabilities of these models, a new privacy preservation model is proposed in this work. It is based on aggregate query answers that can guarantee the confidence of the range and the number of values that can be re-identified. Furthermore, we show that the proposed model is more effcient and effective in big data analytics by using extensive experiments.\",\"PeriodicalId\":37046,\"journal\":{\"name\":\"ECTI Transactions on Computer and Information Technology\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ECTI Transactions on Computer and Information Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.37936/ecti-cit.2023173.252952\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Decision Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ECTI Transactions on Computer and Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.37936/ecti-cit.2023173.252952","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Decision Sciences","Score":null,"Total":0}

引用次数: 0

摘要

当数据集被用于大数据分析时，数据效用和数据隐私是必须考虑的严重问题。也就是说，数据集具有很高的数据效用，并且在隐私侵犯问题方面通常具有很高的风险。为了平衡数据集在大数据分析中使用时的数据效用和数据隐私，提出了几个隐私保护模型，如k-匿名、l-多样性、t-接近、解剖、k-相似和(lp1，…)。lpn)隐私。不幸的是，这些隐私保护模型是高度复杂的数据模型，仍然存在必须解决的数据实用问题。为了消除这些模型的漏洞，本文提出了一种新的隐私保护模型。它基于聚合查询答案，可以保证范围的置信度和可重新标识的值的数量。此外，我们通过大量的实验表明，所提出的模型在大数据分析中更加高效和有效。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Privacy-Enhancing Data Aggregation for Big Data Analytics

Data utility and data privacy are serious issues that must be considered when datasets are utilized in big data analytics such that they are traded off. That is, the datasets have high data utility and often have high risks in terms of privacy violation issues. To balance the data utility and the data privacy in datasets when they are provided to utilize in big data analytics, several privacy preservation models have been proposed, e.g., k-Anonymity, l-Diversity, t-Closeness, Anatomy, k-Likeness, and (lp1, . . . , lpn)-Privacy. Unfortunately, these privacy preservation models are highly complex data models and still have data utility issues that must be addressed. To rid these vulnerabilities of these models, a new privacy preservation model is proposed in this work. It is based on aggregate query answers that can guarantee the confidence of the range and the number of values that can be re-identified. Furthermore, we show that the proposed model is more effcient and effective in big data analytics by using extensive experiments.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

ECTI Transactions on Computer and Information Technology Engineering-Electrical and Electronic Engineering

CiteScore

1.20

自引率

0.00%

发文量

审稿时长

15 weeks