{"title":"An Enhanced Approach for Privacy Preserving Record Linkage during Data Integration","authors":"N. Shekokar, V. Shelake","doi":"10.1109/ICIM49319.2020.244689","DOIUrl":null,"url":null,"abstract":"Today collecting and integrating data from multiple datasets has become a vital part to perform various analysis tasks. Record linkage plays an important component during data integration to detect and link similar data instances. However, personal and sensitive data need to be protected in a manner so that there is no re-identification of original attribute values by the party performing record linkage. Now-a-days, the Bloom filter encoding has utilized across many countries for privacy preserving record linkage. Moreover, the hardened approaches of Bloom filter encoding enhance privacy at the cost of reduced linkage accuracy. Still the security concerns remain with the Bloom filter encoding techniques because attackers can re-identify the obfuscated data with the use of available public resources. We propose an enhanced approach for privacy preserving record linkage (EPPRL) during data integration to achieve better privacy with acceptable linkage accuracy. The results show that the proposed approach EPPRL outperforms in comparison with Balanced Bloom filter encoding technique in terms of precision, recall, f-measure and re-identification of attribute values.","PeriodicalId":129517,"journal":{"name":"2020 6th International Conference on Information Management (ICIM)","volume":"143 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 6th International Conference on Information Management (ICIM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIM49319.2020.244689","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Today collecting and integrating data from multiple datasets has become a vital part to perform various analysis tasks. Record linkage plays an important component during data integration to detect and link similar data instances. However, personal and sensitive data need to be protected in a manner so that there is no re-identification of original attribute values by the party performing record linkage. Now-a-days, the Bloom filter encoding has utilized across many countries for privacy preserving record linkage. Moreover, the hardened approaches of Bloom filter encoding enhance privacy at the cost of reduced linkage accuracy. Still the security concerns remain with the Bloom filter encoding techniques because attackers can re-identify the obfuscated data with the use of available public resources. We propose an enhanced approach for privacy preserving record linkage (EPPRL) during data integration to achieve better privacy with acceptable linkage accuracy. The results show that the proposed approach EPPRL outperforms in comparison with Balanced Bloom filter encoding technique in terms of precision, recall, f-measure and re-identification of attribute values.