{"title":"Beacon Reconstruction Attack: Reconstruction of genomes in genomic data-sharing beacons using summary statistics.","authors":"Kousar Saleem, A Ercument Cicek, Sinem Sav","doi":"10.1093/bioinformatics/btaf273","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Genomic data sharing beacon protocol, developed by the Global Alliance for Genomics and Health (GA4GH), offers a privacy-preserving mechanism for querying genomic datasets while restricting direct data access. Despite their design, beacons remain vulnerable to privacy attacks. This study introduces a novel privacy vulnerability of the protocol: One can reconstruct large portions of the genomes of all beacon participants by only using the summary statistics reported by the protocol.</p><p><strong>Results: </strong>We introduce a novel optimization-based algorithm that leverages beacon responses and single nucleotide polymorphism (SNP) correlations for reconstruction. By optimizing for the SNP correlations and allele frequencies, the proposed approach achieves genome reconstruction with a substantially higher F1-score (70%) compared to baseline methods (45%) on beacons generated using individuals from the HapMap and OpenSNP datasets. We show that reconstructed genomes can be used by downstream applications such as in membership inference attacks against other beacons. Our findings reveal that beacons releasing allele frequencies substantially increases the reconstruction risk, underscoring the need for enhanced privacy-preserving mechanisms to protect genomic data.</p><p><strong>Availability and implementation: </strong>Our implementation is available at https://github.com/ASAP-Bilkent/Beacon-Reconstruction-Attack.</p><p><strong>Supplementary information: </strong>Supplementary data are available at Bioinformatics online.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btaf273","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Motivation: Genomic data sharing beacon protocol, developed by the Global Alliance for Genomics and Health (GA4GH), offers a privacy-preserving mechanism for querying genomic datasets while restricting direct data access. Despite their design, beacons remain vulnerable to privacy attacks. This study introduces a novel privacy vulnerability of the protocol: One can reconstruct large portions of the genomes of all beacon participants by only using the summary statistics reported by the protocol.
Results: We introduce a novel optimization-based algorithm that leverages beacon responses and single nucleotide polymorphism (SNP) correlations for reconstruction. By optimizing for the SNP correlations and allele frequencies, the proposed approach achieves genome reconstruction with a substantially higher F1-score (70%) compared to baseline methods (45%) on beacons generated using individuals from the HapMap and OpenSNP datasets. We show that reconstructed genomes can be used by downstream applications such as in membership inference attacks against other beacons. Our findings reveal that beacons releasing allele frequencies substantially increases the reconstruction risk, underscoring the need for enhanced privacy-preserving mechanisms to protect genomic data.
Availability and implementation: Our implementation is available at https://github.com/ASAP-Bilkent/Beacon-Reconstruction-Attack.
Supplementary information: Supplementary data are available at Bioinformatics online.