Privacy-preserving quadratic truth discovery based on Precision partitioning

IF 4.8 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Computers & Security Pub Date : 2024-08-06 DOI:10.1016/j.cose.2024.104039

{"title":"Privacy-preserving quadratic truth discovery based on Precision partitioning","authors":"","doi":"10.1016/j.cose.2024.104039","DOIUrl":null,"url":null,"abstract":"<div><p>Truth discovery technology is widely used in the field of data collection in crowdsourcing; however, with the deepening of people’s privacy awareness, ordinary truth discovery can no longer meet the current user demand for privacy protection, and solving the privacy problem is one of the critical challenges of truth discovery. A number of works have been proposed as the truth discovery mechanism of differential privacy. However, the privacy budget allocation in the known differential privacy truth discovery mechanisms does not consider the data precision differences of multi-source datasets and the anomalous results of small-value data after aggregation. We propose a precision division based on a differential privacy two-layer truth discovery framework to ensure privacy security while considering data precision and can also ensure high-accuracy truth discovery. The main challenges of this paper are how to obtain highly accurate truth values in sparse data scenarios without exposing the original values to the cloud server when performing error correction of aggregated anomalies after noise addition, as well as to improve the precision of truth value estimation of perturbed streaming data based on the refinement of the privacy protection level based on the accuracy of the data. Specifically, we first formulate a data-sampling algorithm to get the data precision of different users and to sieve out the anomalous data and duplicate data to obtain the quality of user-uploaded data. Then, we formulate a new privacy budget allocation mechanism, which synthesizes the sampling situation during data preprocessing and fully considers the data precision to quantify user privacy and turn it into specific values. We provide a quadratic truth discovery mechanism based on a predictive interpolation algorithm when dealing with small-value data, ensuring the reliability of small data aggregation results. We demonstrate that our framework achieves differential privacy for user-supplied data while we conduct extensive experiments on three real-world datasets to prove the effectiveness of our system framework.</p></div>","PeriodicalId":51004,"journal":{"name":"Computers & Security","volume":null,"pages":null},"PeriodicalIF":4.8000,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Security","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167404824003444","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Truth discovery technology is widely used in the field of data collection in crowdsourcing; however, with the deepening of people’s privacy awareness, ordinary truth discovery can no longer meet the current user demand for privacy protection, and solving the privacy problem is one of the critical challenges of truth discovery. A number of works have been proposed as the truth discovery mechanism of differential privacy. However, the privacy budget allocation in the known differential privacy truth discovery mechanisms does not consider the data precision differences of multi-source datasets and the anomalous results of small-value data after aggregation. We propose a precision division based on a differential privacy two-layer truth discovery framework to ensure privacy security while considering data precision and can also ensure high-accuracy truth discovery. The main challenges of this paper are how to obtain highly accurate truth values in sparse data scenarios without exposing the original values to the cloud server when performing error correction of aggregated anomalies after noise addition, as well as to improve the precision of truth value estimation of perturbed streaming data based on the refinement of the privacy protection level based on the accuracy of the data. Specifically, we first formulate a data-sampling algorithm to get the data precision of different users and to sieve out the anomalous data and duplicate data to obtain the quality of user-uploaded data. Then, we formulate a new privacy budget allocation mechanism, which synthesizes the sampling situation during data preprocessing and fully considers the data precision to quantify user privacy and turn it into specific values. We provide a quadratic truth discovery mechanism based on a predictive interpolation algorithm when dealing with small-value data, ensuring the reliability of small data aggregation results. We demonstrate that our framework achieves differential privacy for user-supplied data while we conduct extensive experiments on three real-world datasets to prove the effectiveness of our system framework.

查看原文本刊更多论文

基于精确分区的保护隐私的二次方真相发现

真相发现技术在众包数据采集领域得到了广泛应用，但随着人们隐私意识的加深，普通的真相发现已不能满足当前用户对隐私保护的需求，解决隐私问题是真相发现的关键挑战之一。作为差异化隐私的真相发现机制，已经有不少作品被提出。然而，已知的差分隐私真相发现机制中的隐私预算分配没有考虑多源数据集的数据精度差异和小值数据聚合后的异常结果。我们提出了基于差分隐私双层真相发现框架的精度划分方法，在考虑数据精度的同时确保隐私安全，还能保证高精度的真相发现。本文的主要挑战是如何在对加噪后的聚合异常数据进行纠错时，在不向云服务器暴露原始值的情况下，获取稀疏数据场景下的高精度真值，以及在根据数据精度细化隐私保护等级的基础上，提高扰动流数据真值估计的精度。具体来说，我们首先制定了一种数据采样算法，以获得不同用户的数据精度，并筛除异常数据和重复数据，从而获得用户上传数据的质量。然后，我们制定了一种新的隐私预算分配机制，综合数据预处理过程中的采样情况，充分考虑数据精度，量化用户隐私并转化为具体数值。在处理小值数据时，我们提供了一种基于预测插值算法的二次真相发现机制，确保了小数据聚合结果的可靠性。我们在三个真实世界数据集上进行了大量实验，以证明我们的系统框架的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computers & Security 工程技术-计算机：信息系统

CiteScore

12.40

自引率

7.10%

发文量

365

审稿时长

10.7 months

期刊介绍： Computers & Security is the most respected technical journal in the IT security field. With its high-profile editorial board and informative regular features and columns, the journal is essential reading for IT security professionals around the world. Computers & Security provides you with a unique blend of leading edge research and sound practical management advice. It is aimed at the professional involved with computer security, audit, control and data integrity in all sectors - industry, commerce and academia. Recognized worldwide as THE primary source of reference for applied research and technical expertise it is your first step to fully secure systems.

文献相关原料

公司名称	产品信息	采购帮参考价格