{"title":"Privacy-Preserving SRS Data Anonymization by Incorporating Missing Values","authors":"Wen-Yang Lin, Kuang-Yung Hsu, Zih-Xun Shen","doi":"10.1109/TAAI.2018.00032","DOIUrl":null,"url":null,"abstract":"Spontaneous Reporting Systems (SRSs) refer to systems used to collect voluntary reporting of adverse drug events (ADEs), which usually contain sensitive personal privacy information. Although many scholars have proposed various privacy protection models, they overlooked characteristics of SRS data. We previously have proposed a feasible privacy model and anonymization method dedicate to SRS data. However, this method is only applicable to complete data, not considering the fact that SRS data contain a lot of missing data. In this paper, we propose a new privacy model Closed MS(k, θ*)-bounding and a new anonymization method, Closed-MSpartition, to process SRS data with missing values. We used US FDA's FAERS data to evaluate our proposed method from the aspects of information loss, privacy risk, and data utility. The results show that our proposed new method can effectively prevent attackers from learning personal privacy without sacrificing data quality and utility.","PeriodicalId":211734,"journal":{"name":"2018 Conference on Technologies and Applications of Artificial Intelligence (TAAI)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Conference on Technologies and Applications of Artificial Intelligence (TAAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TAAI.2018.00032","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Spontaneous Reporting Systems (SRSs) refer to systems used to collect voluntary reporting of adverse drug events (ADEs), which usually contain sensitive personal privacy information. Although many scholars have proposed various privacy protection models, they overlooked characteristics of SRS data. We previously have proposed a feasible privacy model and anonymization method dedicate to SRS data. However, this method is only applicable to complete data, not considering the fact that SRS data contain a lot of missing data. In this paper, we propose a new privacy model Closed MS(k, θ*)-bounding and a new anonymization method, Closed-MSpartition, to process SRS data with missing values. We used US FDA's FAERS data to evaluate our proposed method from the aspects of information loss, privacy risk, and data utility. The results show that our proposed new method can effectively prevent attackers from learning personal privacy without sacrificing data quality and utility.