个人标识符发布样本可用性数据的隐私保护

Journal of medical and bioengineering Pub Date : 1900-01-01 DOI:10.12720/JOMB.4.2.117-125

A. Gholami, E. Laure, P. Somogyi, O. Spjuth, Salman Niazi, J. Dowling

{"title":"个人标识符发布样本可用性数据的隐私保护","authors":"A. Gholami, E. Laure, P. Somogyi, O. Spjuth, Salman Niazi, J. Dowling","doi":"10.12720/JOMB.4.2.117-125","DOIUrl":null,"url":null,"abstract":"Medical organizations collect, store and process vast amounts of sensitive information about patients. Easy access to this information by researchers is crucial to improving medical research, but in many institutions, cumbersome security measures and walled-gardens have created a situation where even information about what medical data is out there is not available. One of the main security challenges in this area, is enabling researchers to cross-link different medical studies, while preserving the privacy of the patients involved. In this paper, we introduce a privacy-preserving system for publishing sample availability data that allows researchers to make queries that crosscut different studies. That is, researchers can ask questions such as how many patients have had both diabetes and prostate cancer, where the diabetes and prostate cancer information originates from different clinical registries. We realize our solution by having a two-level anonymiziation mechanism, where our toolkit for publishing availability data first pseudonymizes personal identifiers and then anonymizes sensitive attributes. Our toolkit also includes a web-based server that stores the encrypted pseudonymized sample data and allows researchers to execute cross-linked queries across different study data. We believe that our toolkit contributes a first step to support the privacy preserving publication of data containing personal identifiers.","PeriodicalId":437476,"journal":{"name":"Journal of medical and bioengineering","volume":"73 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Privacy-Preservation for Publishing Sample Availability Data with Personal Identifiers\",\"authors\":\"A. Gholami, E. Laure, P. Somogyi, O. Spjuth, Salman Niazi, J. Dowling\",\"doi\":\"10.12720/JOMB.4.2.117-125\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Medical organizations collect, store and process vast amounts of sensitive information about patients. Easy access to this information by researchers is crucial to improving medical research, but in many institutions, cumbersome security measures and walled-gardens have created a situation where even information about what medical data is out there is not available. One of the main security challenges in this area, is enabling researchers to cross-link different medical studies, while preserving the privacy of the patients involved. In this paper, we introduce a privacy-preserving system for publishing sample availability data that allows researchers to make queries that crosscut different studies. That is, researchers can ask questions such as how many patients have had both diabetes and prostate cancer, where the diabetes and prostate cancer information originates from different clinical registries. We realize our solution by having a two-level anonymiziation mechanism, where our toolkit for publishing availability data first pseudonymizes personal identifiers and then anonymizes sensitive attributes. Our toolkit also includes a web-based server that stores the encrypted pseudonymized sample data and allows researchers to execute cross-linked queries across different study data. We believe that our toolkit contributes a first step to support the privacy preserving publication of data containing personal identifiers.\",\"PeriodicalId\":437476,\"journal\":{\"name\":\"Journal of medical and bioengineering\",\"volume\":\"73 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of medical and bioengineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.12720/JOMB.4.2.117-125\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of medical and bioengineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12720/JOMB.4.2.117-125","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 9

摘要

医疗机构收集、存储和处理大量关于患者的敏感信息。研究人员方便地获取这些信息对于改善医学研究至关重要，但在许多机构中，繁琐的安全措施和封闭的花园造成了一种情况，即甚至无法获得有关医疗数据的信息。这一领域的主要安全挑战之一是使研究人员能够交叉链接不同的医学研究，同时保护所涉及患者的隐私。在本文中，我们介绍了一个用于发布样本可用性数据的隐私保护系统，该系统允许研究人员进行横切不同研究的查询。也就是说，研究人员可以询问诸如有多少患者同时患有糖尿病和前列腺癌之类的问题，糖尿病和前列腺癌的信息来自不同的临床登记处。我们通过采用两级匿名化机制来实现我们的解决方案，我们发布可用性数据的工具包首先对个人标识符进行假名处理，然后对敏感属性进行匿名处理。我们的工具包还包括一个基于web的服务器，该服务器存储加密的假名化样本数据，并允许研究人员跨不同的研究数据执行交叉链接查询。我们相信，我们的工具包为支持包含个人标识符的数据的隐私保护发布迈出了第一步。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Privacy-Preservation for Publishing Sample Availability Data with Personal Identifiers

Medical organizations collect, store and process vast amounts of sensitive information about patients. Easy access to this information by researchers is crucial to improving medical research, but in many institutions, cumbersome security measures and walled-gardens have created a situation where even information about what medical data is out there is not available. One of the main security challenges in this area, is enabling researchers to cross-link different medical studies, while preserving the privacy of the patients involved. In this paper, we introduce a privacy-preserving system for publishing sample availability data that allows researchers to make queries that crosscut different studies. That is, researchers can ask questions such as how many patients have had both diabetes and prostate cancer, where the diabetes and prostate cancer information originates from different clinical registries. We realize our solution by having a two-level anonymiziation mechanism, where our toolkit for publishing availability data first pseudonymizes personal identifiers and then anonymizes sensitive attributes. Our toolkit also includes a web-based server that stores the encrypted pseudonymized sample data and allows researchers to execute cross-linked queries across different study data. We believe that our toolkit contributes a first step to support the privacy preserving publication of data containing personal identifiers.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of medical and bioengineering

自引率

0.00%

发文量