{"title":"Frequent Itemset Mining with Hadamard Response Under Local Differential Privacy","authors":"Haijiang Liu, Xiangyu Bai, Xuebin Ma, L. Cui","doi":"10.1109/ICEIEC49280.2020.9152248","DOIUrl":null,"url":null,"abstract":"Frequent itemset mining is a basic data mining task and has many applications in other data mining tasks. However, users’ personal privacy information will be leaked in the mining process. In recent years, application of local differential privacy protection models to mine frequent itemsets is a relatively reliable and secure protection method. Local differential privacy means that users first perturb the original data and then send these data to the aggregator, preventing the aggregator from revealing the user’s private information. Data mining using local differential privacy involves two major problems. The first one is that the accuracy of the results after mining is low, and the other one is that the user transmits a large amount of data to the server, which results in higher communication costs. In this study, we demonstrate that the Hadamard response (HR) algorithm improves the accuracy of the results and reduces the communication cost from k to log k. Finally, we use the Frequent pattern tree (FP-tree) algorithm for frequent itemset mining to compare the existing algorithms.","PeriodicalId":352285,"journal":{"name":"2020 IEEE 10th International Conference on Electronics Information and Emergency Communication (ICEIEC)","volume":"31 15","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 10th International Conference on Electronics Information and Emergency Communication (ICEIEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEIEC49280.2020.9152248","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Frequent itemset mining is a basic data mining task and has many applications in other data mining tasks. However, users’ personal privacy information will be leaked in the mining process. In recent years, application of local differential privacy protection models to mine frequent itemsets is a relatively reliable and secure protection method. Local differential privacy means that users first perturb the original data and then send these data to the aggregator, preventing the aggregator from revealing the user’s private information. Data mining using local differential privacy involves two major problems. The first one is that the accuracy of the results after mining is low, and the other one is that the user transmits a large amount of data to the server, which results in higher communication costs. In this study, we demonstrate that the Hadamard response (HR) algorithm improves the accuracy of the results and reduces the communication cost from k to log k. Finally, we use the Frequent pattern tree (FP-tree) algorithm for frequent itemset mining to compare the existing algorithms.