{"title":"Privacy Preserving Mining Sequential Pattern based on Random Data Perturbation","authors":"Weimin Ouyang","doi":"10.1109/icsai53574.2021.9664200","DOIUrl":null,"url":null,"abstract":"Privacy preserving data mining is a hot research direction of data mining in the big data environment. If Data mining has been used properly, it will endanger the privacy and the information. Inspired by that, we researched a method via random data perturbation method to keep privacy preserving in sequential patterns mining. Its strategy is as follows: First, noisy events are added into each event sequence of the original database. Then, an algorithm PP-Span (Privacy-Preserving Sequential pattern mining) is proposed to reconstruct the frequent sequences from these noise-added data sequences. Our results showed that it can outperform a lot of traditional methods.","PeriodicalId":131284,"journal":{"name":"2021 7th International Conference on Systems and Informatics (ICSAI)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 7th International Conference on Systems and Informatics (ICSAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icsai53574.2021.9664200","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Privacy preserving data mining is a hot research direction of data mining in the big data environment. If Data mining has been used properly, it will endanger the privacy and the information. Inspired by that, we researched a method via random data perturbation method to keep privacy preserving in sequential patterns mining. Its strategy is as follows: First, noisy events are added into each event sequence of the original database. Then, an algorithm PP-Span (Privacy-Preserving Sequential pattern mining) is proposed to reconstruct the frequent sequences from these noise-added data sequences. Our results showed that it can outperform a lot of traditional methods.