What are the optimum quasi-identifiers to re-identify medical records?

2018 20th International Conference on Advanced Communication Technology (ICACT) Pub Date : 2018-03-23 DOI:10.23919/ICACT.2018.8323926

Yong Ju Lee, K. Lee

{"title":"What are the optimum quasi-identifiers to re-identify medical records?","authors":"Yong Ju Lee, K. Lee","doi":"10.23919/ICACT.2018.8323926","DOIUrl":null,"url":null,"abstract":"Recently, medical records are shared to online for a purpose of medical research and expert opinion. There is a problem with sharing the medical records. If someone knows the subject of the record by using various methods, it can result in an invasion of the patient's privacy. To solve the problem, it is important to carefully address the tradeoff between data sharing and privacy. For this reason, de-identification techniques are applicable to address the problem. However, de-identified data has a risk of re-identification. There are two problems with using de-identification techniques. First, de-identification techniques may damage data utility although it may decrease a risk of re-identification. Second, de-identified data can be re-identified from inference using background knowledge. The objective of this paper is to analyze the probability of re-identification according to inferable quasi-identifiers. We analyzed factors, inferable quasi-identifiers, which can be inferred from background knowledge. Then, we estimated the probability of re-identification from taking advantage of the factors. As a result, we determined the effect of the re-identification according to the type and the range of inferable quasi-identifiers. This paper contributes to a decision on de-identification target and level for protecting patient's privacy through a comparative analysis of the probability of re-identification according to the type and the range of inference.","PeriodicalId":228625,"journal":{"name":"2018 20th International Conference on Advanced Communication Technology (ICACT)","volume":"88 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 20th International Conference on Advanced Communication Technology (ICACT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/ICACT.2018.8323926","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

Recently, medical records are shared to online for a purpose of medical research and expert opinion. There is a problem with sharing the medical records. If someone knows the subject of the record by using various methods, it can result in an invasion of the patient's privacy. To solve the problem, it is important to carefully address the tradeoff between data sharing and privacy. For this reason, de-identification techniques are applicable to address the problem. However, de-identified data has a risk of re-identification. There are two problems with using de-identification techniques. First, de-identification techniques may damage data utility although it may decrease a risk of re-identification. Second, de-identified data can be re-identified from inference using background knowledge. The objective of this paper is to analyze the probability of re-identification according to inferable quasi-identifiers. We analyzed factors, inferable quasi-identifiers, which can be inferred from background knowledge. Then, we estimated the probability of re-identification from taking advantage of the factors. As a result, we determined the effect of the re-identification according to the type and the range of inferable quasi-identifiers. This paper contributes to a decision on de-identification target and level for protecting patient's privacy through a comparative analysis of the probability of re-identification according to the type and the range of inference.

查看原文本刊更多论文

重新识别医疗记录的最佳准标识符是什么?

最近，出于医学研究和专家意见的目的，医疗记录在网上共享。共享医疗记录有个问题。如果有人通过各种方法知道记录的主题，可能会导致侵犯患者的隐私。要解决这个问题，重要的是仔细处理数据共享和隐私之间的权衡。出于这个原因，去识别技术适用于解决这个问题。然而，去识别的数据有被重新识别的风险。使用去识别技术有两个问题。首先，去识别技术可能会损害数据效用，尽管它可能会降低重新识别的风险。其次，可以使用背景知识从推理中重新识别去识别的数据。本文的目的是分析可推理准标识符的再识别概率。我们分析了可以从背景知识中推断出的因素，可推断的准标识符。然后，我们利用这些因素估计了重新识别的概率。因此，我们根据可推理准标识符的类型和范围来确定再识别的效果。本文根据推理的类型和范围，通过对再识别概率的比较分析，有助于决定患者隐私的去识别目标和保护水平。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 20th International Conference on Advanced Communication Technology (ICACT)

自引率

0.00%

发文量