Privacy-Preservation for Publishing Sample Availability Data with Personal Identifiers

A. Gholami, E. Laure, P. Somogyi, O. Spjuth, Salman Niazi, J. Dowling
{"title":"Privacy-Preservation for Publishing Sample Availability Data with Personal Identifiers","authors":"A. Gholami, E. Laure, P. Somogyi, O. Spjuth, Salman Niazi, J. Dowling","doi":"10.12720/JOMB.4.2.117-125","DOIUrl":null,"url":null,"abstract":"Medical organizations collect, store and process vast amounts of sensitive information about patients. Easy access to this information by researchers is crucial to improving medical research, but in many institutions, cumbersome security measures and walled-gardens have created a situation where even information about what medical data is out there is not available. One of the main security challenges in this area, is enabling researchers to cross-link different medical studies, while preserving the privacy of the patients involved. In this paper, we introduce a privacy-preserving system for publishing sample availability data that allows researchers to make queries that crosscut different studies. That is, researchers can ask questions such as how many patients have had both diabetes and prostate cancer, where the diabetes and prostate cancer information originates from different clinical registries. We realize our solution by having a two-level anonymiziation mechanism, where our toolkit for publishing availability data first pseudonymizes personal identifiers and then anonymizes sensitive attributes. Our toolkit also includes a web-based server that stores the encrypted pseudonymized sample data and allows researchers to execute cross-linked queries across different study data. We believe that our toolkit contributes a first step to support the privacy preserving publication of data containing personal identifiers.","PeriodicalId":437476,"journal":{"name":"Journal of medical and bioengineering","volume":"73 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of medical and bioengineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12720/JOMB.4.2.117-125","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9

Abstract

Medical organizations collect, store and process vast amounts of sensitive information about patients. Easy access to this information by researchers is crucial to improving medical research, but in many institutions, cumbersome security measures and walled-gardens have created a situation where even information about what medical data is out there is not available. One of the main security challenges in this area, is enabling researchers to cross-link different medical studies, while preserving the privacy of the patients involved. In this paper, we introduce a privacy-preserving system for publishing sample availability data that allows researchers to make queries that crosscut different studies. That is, researchers can ask questions such as how many patients have had both diabetes and prostate cancer, where the diabetes and prostate cancer information originates from different clinical registries. We realize our solution by having a two-level anonymiziation mechanism, where our toolkit for publishing availability data first pseudonymizes personal identifiers and then anonymizes sensitive attributes. Our toolkit also includes a web-based server that stores the encrypted pseudonymized sample data and allows researchers to execute cross-linked queries across different study data. We believe that our toolkit contributes a first step to support the privacy preserving publication of data containing personal identifiers.
个人标识符发布样本可用性数据的隐私保护
医疗机构收集、存储和处理大量关于患者的敏感信息。研究人员方便地获取这些信息对于改善医学研究至关重要,但在许多机构中,繁琐的安全措施和封闭的花园造成了一种情况,即甚至无法获得有关医疗数据的信息。这一领域的主要安全挑战之一是使研究人员能够交叉链接不同的医学研究,同时保护所涉及患者的隐私。在本文中,我们介绍了一个用于发布样本可用性数据的隐私保护系统,该系统允许研究人员进行横切不同研究的查询。也就是说,研究人员可以询问诸如有多少患者同时患有糖尿病和前列腺癌之类的问题,糖尿病和前列腺癌的信息来自不同的临床登记处。我们通过采用两级匿名化机制来实现我们的解决方案,我们发布可用性数据的工具包首先对个人标识符进行假名处理,然后对敏感属性进行匿名处理。我们的工具包还包括一个基于web的服务器,该服务器存储加密的假名化样本数据,并允许研究人员跨不同的研究数据执行交叉链接查询。我们相信,我们的工具包为支持包含个人标识符的数据的隐私保护发布迈出了第一步。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信