MAGICPL: A Generic Process Description Language for Distributed Pseudonymization Scenarios.

IF 1.3 4区医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Methods of Information in Medicine Pub Date : 2021-05-01 Epub Date: 2021-07-05 DOI:10.1055/s-0041-1731387

Galina Tremper, Torben Brenner, Florian Stampe, Andreas Borg, Martin Bialke, David Croft, Esther Schmidt, Martin Lablans

{"title":"MAGICPL: A Generic Process Description Language for Distributed Pseudonymization Scenarios.","authors":"Galina Tremper, Torben Brenner, Florian Stampe, Andreas Borg, Martin Bialke, David Croft, Esther Schmidt, Martin Lablans","doi":"10.1055/s-0041-1731387","DOIUrl":null,"url":null,"abstract":"Objectives: Pseudonymization is an important aspect of projects dealing with sensitive patient data. Most projects build their own specialized, hard-coded, solutions. However, these overlap in many aspects of their functionality. As any re-implementation binds resources, we would like to propose a solution that facilitates and encourages the reuse of existing components.Methods: We analyzed already-established data protection concepts to gain an insight into their common features and the ways in which their components were linked together. We found that we could represent these pseudonymization processes with a simple descriptive language, which we have called MAGICPL, plus a relatively small set of components. We designed MAGICPL as an XML-based language, to make it human-readable and accessible to nonprogrammers. Additionally, a prototype implementation of the components was written in Java. MAGICPL makes it possible to reference the components using their class names, making it easy to extend or exchange the component set. Furthermore, there is a simple HTTP application programming interface (API) that runs the tasks and allows other systems to communicate with the pseudonymization process.Results: MAGICPL has been used in at least three projects, including the re-implementation of the pseudonymization process of the German Cancer Consortium, clinical data flows in a large-scale translational research network (National Network Genomic Medicine), and for our own institute's pseudonymization service.Conclusions: Putting our solution into productive use at both our own institute and at our partner sites facilitated a reduction in the time and effort required to build pseudonymization pipelines in medical research.","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"60 1-02","pages":"21-31"},"PeriodicalIF":1.3000,"publicationDate":"2021-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Methods of Information in Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1055/s-0041-1731387","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2021/7/5 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 2

Abstract

Objectives: Pseudonymization is an important aspect of projects dealing with sensitive patient data. Most projects build their own specialized, hard-coded, solutions. However, these overlap in many aspects of their functionality. As any re-implementation binds resources, we would like to propose a solution that facilitates and encourages the reuse of existing components.

Methods: We analyzed already-established data protection concepts to gain an insight into their common features and the ways in which their components were linked together. We found that we could represent these pseudonymization processes with a simple descriptive language, which we have called MAGICPL, plus a relatively small set of components. We designed MAGICPL as an XML-based language, to make it human-readable and accessible to nonprogrammers. Additionally, a prototype implementation of the components was written in Java. MAGICPL makes it possible to reference the components using their class names, making it easy to extend or exchange the component set. Furthermore, there is a simple HTTP application programming interface (API) that runs the tasks and allows other systems to communicate with the pseudonymization process.

Results: MAGICPL has been used in at least three projects, including the re-implementation of the pseudonymization process of the German Cancer Consortium, clinical data flows in a large-scale translational research network (National Network Genomic Medicine), and for our own institute's pseudonymization service.

Conclusions: Putting our solution into productive use at both our own institute and at our partner sites facilitated a reduction in the time and effort required to build pseudonymization pipelines in medical research.

查看原文本刊更多论文

MAGICPL:分布式假名场景的通用进程描述语言。

目的:假名化是处理敏感患者数据项目的一个重要方面。大多数项目都构建自己专门的、硬编码的解决方案。然而，它们在功能的许多方面是重叠的。由于任何重新实现都会绑定资源，我们希望提出一种促进并鼓励重用现有组件的解决方案。方法:我们分析了已经建立的数据保护概念，以深入了解它们的共同特征以及它们的组成部分联系在一起的方式。我们发现，我们可以用一种简单的描述性语言来表示这些假名化过程，我们称之为MAGICPL，再加上一组相对较小的组件。我们将MAGICPL设计为一种基于xml的语言，使其易于人类阅读，非程序员也可以访问。此外，组件的原型实现是用Java编写的。MAGICPL允许使用组件的类名来引用组件，从而很容易扩展或交换组件集。此外，还有一个简单的HTTP应用程序编程接口(API)来运行任务，并允许其他系统与假名化过程通信。结果:MAGICPL已在至少三个项目中使用，包括德国癌症联盟的假名过程的重新实施，大规模转化研究网络(国家网络基因组医学)的临床数据流，以及我们自己研究所的假名服务。结论:将我们的解决方案在我们自己的研究所和我们的合作伙伴站点投入生产使用，有助于减少在医学研究中建立假名管道所需的时间和精力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Methods of Information in Medicine 医学-计算机：信息系统

CiteScore

3.70

自引率

11.80%

发文量

审稿时长

6-12 weeks

期刊介绍： Good medicine and good healthcare demand good information. Since the journal''s founding in 1962, Methods of Information in Medicine has stressed the methodology and scientific fundamentals of organizing, representing and analyzing data, information and knowledge in biomedicine and health care. Covering publications in the fields of biomedical and health informatics, medical biometry, and epidemiology, the journal publishes original papers, reviews, reports, opinion papers, editorials, and letters to the editor. From time to time, the journal publishes articles on particular focus themes as part of a journal''s issue.