N. William Rayner, Young-Chan Park, Christian Fuchsberger, Andrei Barysenka, Eleftheria Zeggini
{"title":"Toward GDPR compliance with the Helmholtz Munich genotype imputation server","authors":"N. William Rayner, Young-Chan Park, Christian Fuchsberger, Andrei Barysenka, Eleftheria Zeggini","doi":"10.1038/s41588-024-02012-1","DOIUrl":null,"url":null,"abstract":"<p>Genomics has the potential to revolutionize healthcare, empowering personalized disease management, including precision prevention. Genome-wide association studies (GWAS) have been instrumental in generating new biological insights into complex human diseases<sup>1</sup>. The power of GWAS can be increased by increasing sample size through meta-analysis, which requires the imputation and analysis of genotypes that may be untyped across some studies. Imputation relies on the availability of phased haplotype reference panels of whole-genome-sequenced individuals<sup>2</sup>. These are not amenable to sharing with researchers who need to impute their GWAS data, primarily for reasons of data access and security, dataset size, and scale of computing resources required to enable imputation. Imputation servers have, therefore, been developed to provide a solution: researchers upload their genotyped dataset to the imputation server that hosts the reference panels and imputation machinery, where the data are imputed, and then downloaded back to the researchers’ individual local computing environment. There are a number of imputation servers that serve the global community of researchers, including two based in the USA (University of Michigan, https://imputationserver.sph.umich.edu/index.html and TOPMed, https://imputation.biodatacatalyst.nhlbi.nih.gov/), one based in the UK (Wellcome Sanger Institute, https://imputation.sanger.ac.uk/?about=1) and one based at Kiel University in Germany (https://hybridcomputing.ikmb.uni-kiel.de). Here, we have developed a European Union (EU)-based imputation server serving the community at large, based in Munich, Germany (https://imputationserver.helmholtz-munich.de/), to assist users in complying with their General Data Protection Regulation (GDPR) requirements.</p><p>The need for EU-based imputation servers arises from restrictions imposed by GDPR law<sup>3</sup>, a comprehensive data privacy law in the EU. Genetic data are considered a special category of personal data under GDPR, and hence they are subject to strict data sharing rules and safeguards<sup>4</sup>. Uploading of genotype data to imputation servers not residing within the EU or covered by an adequacy agreement constitutes a breach of GDPR, unless explicitly covered in informed consent forms for the respective study. Here, we introduce the Helmholtz Munich Imputation Server, designed to provide a cost-free genotype imputation service in a GDPR-compliant manner for EU-based researchers, as well as for researchers globally.</p>","PeriodicalId":18985,"journal":{"name":"Nature genetics","volume":null,"pages":null},"PeriodicalIF":31.7000,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature genetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1038/s41588-024-02012-1","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
Genomics has the potential to revolutionize healthcare, empowering personalized disease management, including precision prevention. Genome-wide association studies (GWAS) have been instrumental in generating new biological insights into complex human diseases1. The power of GWAS can be increased by increasing sample size through meta-analysis, which requires the imputation and analysis of genotypes that may be untyped across some studies. Imputation relies on the availability of phased haplotype reference panels of whole-genome-sequenced individuals2. These are not amenable to sharing with researchers who need to impute their GWAS data, primarily for reasons of data access and security, dataset size, and scale of computing resources required to enable imputation. Imputation servers have, therefore, been developed to provide a solution: researchers upload their genotyped dataset to the imputation server that hosts the reference panels and imputation machinery, where the data are imputed, and then downloaded back to the researchers’ individual local computing environment. There are a number of imputation servers that serve the global community of researchers, including two based in the USA (University of Michigan, https://imputationserver.sph.umich.edu/index.html and TOPMed, https://imputation.biodatacatalyst.nhlbi.nih.gov/), one based in the UK (Wellcome Sanger Institute, https://imputation.sanger.ac.uk/?about=1) and one based at Kiel University in Germany (https://hybridcomputing.ikmb.uni-kiel.de). Here, we have developed a European Union (EU)-based imputation server serving the community at large, based in Munich, Germany (https://imputationserver.helmholtz-munich.de/), to assist users in complying with their General Data Protection Regulation (GDPR) requirements.
The need for EU-based imputation servers arises from restrictions imposed by GDPR law3, a comprehensive data privacy law in the EU. Genetic data are considered a special category of personal data under GDPR, and hence they are subject to strict data sharing rules and safeguards4. Uploading of genotype data to imputation servers not residing within the EU or covered by an adequacy agreement constitutes a breach of GDPR, unless explicitly covered in informed consent forms for the respective study. Here, we introduce the Helmholtz Munich Imputation Server, designed to provide a cost-free genotype imputation service in a GDPR-compliant manner for EU-based researchers, as well as for researchers globally.
期刊介绍:
Nature Genetics publishes the very highest quality research in genetics. It encompasses genetic and functional genomic studies on human and plant traits and on other model organisms. Current emphasis is on the genetic basis for common and complex diseases and on the functional mechanism, architecture and evolution of gene networks, studied by experimental perturbation.
Integrative genetic topics comprise, but are not limited to:
-Genes in the pathology of human disease
-Molecular analysis of simple and complex genetic traits
-Cancer genetics
-Agricultural genomics
-Developmental genetics
-Regulatory variation in gene expression
-Strategies and technologies for extracting function from genomic data
-Pharmacological genomics
-Genome evolution