{"title":"Privacy Preserving Frequent Itemsets Mining Based on Database Reconstruction","authors":"Shaoxin Li, Nankun Mu, X. Liao","doi":"10.1109/ICIST.2018.8426074","DOIUrl":null,"url":null,"abstract":"Privacy preserving frequent itemsets mining (PP-FIM) aims at transforming a database so as to efficiently achieve frequent itemsets mining without revealing any sensitive knowledge. However, the majority of the proposed PPFIM methods are based on the idea of sanitizing database. The conflict between knowledge mining and privacy preserving is hard to avoid. To this end, we propose a novel PPFIM algorithm based on database reconstruction called DR-PPFIM, which can afford high data utility as well as high degree of privacy. In DR-PPFIM, a sanitization algorithm is first performed to remove all sensitive knowledge. Then a novel database reconstruction scheme is designed to reconstruct a new database based on the remained non-sensitive frequent itemsets. In addition, we propose a further hiding strategy to further decrease the importance of sensitive itemsets so that the threat of disclosing confidential knowledge can be reduced. Experimental evaluations of the proposed DR-PPFIM on real datasets are reported to show the superiority of DR-PPFIM compared with other state-of-the-art algorithms.","PeriodicalId":331555,"journal":{"name":"2018 Eighth International Conference on Information Science and Technology (ICIST)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Eighth International Conference on Information Science and Technology (ICIST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIST.2018.8426074","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Privacy preserving frequent itemsets mining (PP-FIM) aims at transforming a database so as to efficiently achieve frequent itemsets mining without revealing any sensitive knowledge. However, the majority of the proposed PPFIM methods are based on the idea of sanitizing database. The conflict between knowledge mining and privacy preserving is hard to avoid. To this end, we propose a novel PPFIM algorithm based on database reconstruction called DR-PPFIM, which can afford high data utility as well as high degree of privacy. In DR-PPFIM, a sanitization algorithm is first performed to remove all sensitive knowledge. Then a novel database reconstruction scheme is designed to reconstruct a new database based on the remained non-sensitive frequent itemsets. In addition, we propose a further hiding strategy to further decrease the importance of sensitive itemsets so that the threat of disclosing confidential knowledge can be reduced. Experimental evaluations of the proposed DR-PPFIM on real datasets are reported to show the superiority of DR-PPFIM compared with other state-of-the-art algorithms.