PGB: Benchmarking Differentially Private Synthetic Graph Generation Algorithms

arXiv - CS - Databases Pub Date : 2024-08-06 DOI:arxiv-2408.02928

Shang Liu, Hao Du, Yang Cao, Bo Yan, Jinfei Liu, Masatoshi Yoshikawa

{"title":"PGB: Benchmarking Differentially Private Synthetic Graph Generation Algorithms","authors":"Shang Liu, Hao Du, Yang Cao, Bo Yan, Jinfei Liu, Masatoshi Yoshikawa","doi":"arxiv-2408.02928","DOIUrl":null,"url":null,"abstract":"Differentially private graph analysis is a powerful tool for deriving\ninsights from diverse graph data while protecting individual information.\nDesigning private analytic algorithms for different graph queries often\nrequires starting from scratch. In contrast, differentially private synthetic\ngraph generation offers a general paradigm that supports one-time generation\nfor multiple queries. Although a rich set of differentially private graph\ngeneration algorithms has been proposed, comparing them effectively remains\nchallenging due to various factors, including differing privacy definitions,\ndiverse graph datasets, varied privacy requirements, and multiple utility\nmetrics. To this end, we propose PGB (Private Graph Benchmark), a comprehensive\nbenchmark designed to enable researchers to compare differentially private\ngraph generation algorithms fairly. We begin by identifying four essential\nelements of existing works as a 4-tuple: mechanisms, graph datasets, privacy\nrequirements, and utility metrics. We discuss principles regarding these\nelements to ensure the comprehensiveness of a benchmark. Next, we present a\nbenchmark instantiation that adheres to all principles, establishing a new\nmethod to evaluate existing and newly proposed graph generation algorithms.\nThrough extensive theoretical and empirical analysis, we gain valuable insights\ninto the strengths and weaknesses of prior algorithms. Our results indicate\nthat there is no universal solution for all possible cases. Finally, we provide\nguidelines to help researchers select appropriate mechanisms for various\nscenarios.","PeriodicalId":501123,"journal":{"name":"arXiv - CS - Databases","volume":"14 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Databases","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.02928","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Differentially private graph analysis is a powerful tool for deriving insights from diverse graph data while protecting individual information. Designing private analytic algorithms for different graph queries often requires starting from scratch. In contrast, differentially private synthetic graph generation offers a general paradigm that supports one-time generation for multiple queries. Although a rich set of differentially private graph generation algorithms has been proposed, comparing them effectively remains challenging due to various factors, including differing privacy definitions, diverse graph datasets, varied privacy requirements, and multiple utility metrics. To this end, we propose PGB (Private Graph Benchmark), a comprehensive benchmark designed to enable researchers to compare differentially private graph generation algorithms fairly. We begin by identifying four essential elements of existing works as a 4-tuple: mechanisms, graph datasets, privacy requirements, and utility metrics. We discuss principles regarding these elements to ensure the comprehensiveness of a benchmark. Next, we present a benchmark instantiation that adheres to all principles, establishing a new method to evaluate existing and newly proposed graph generation algorithms. Through extensive theoretical and empirical analysis, we gain valuable insights into the strengths and weaknesses of prior algorithms. Our results indicate that there is no universal solution for all possible cases. Finally, we provide guidelines to help researchers select appropriate mechanisms for various scenarios.

查看原文本刊更多论文

PGB：差异化私有合成图生成算法基准测试

为不同的图查询设计私有分析算法往往需要从头开始。相比之下，差异化私有合成图生成提供了一种通用范式，支持一次性生成多个查询。虽然已经提出了丰富的差异化私有图生成算法，但由于隐私定义不同、图数据集不同、隐私要求不同以及实用性指标不同等各种因素，对这些算法进行有效比较仍然是一项挑战。为此，我们提出了 PGB（隐私图基准），这是一个综合性基准，旨在帮助研究人员公平地比较不同的隐私图生成算法。我们首先确定了现有工作的四个基本要素：机制、图数据集、隐私要求和效用度量。我们讨论了有关这些要素的原则，以确保基准的全面性。通过广泛的理论和实证分析，我们对现有算法的优缺点有了宝贵的认识。我们的结果表明，并不存在适用于所有可能情况的通用解决方案。最后，我们提供了指导原则，帮助研究人员针对各种情况选择合适的机制。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

arXiv - CS - Databases

自引率

0.00%

发文量