Cross-silo Federated Learning with Record-level Personalized Differential Privacy.

Conference on Computer and Communications Security : proceedings of the ... conference on computer and communications security. ACM Conference on Computer and Communications Security Pub Date : 2024-10-01 Epub Date: 2024-12-09 DOI:10.1145/3658644.3670351

Junxu Liu, Jian Lou, Li Xiong, Jinfei Liu, Xiaofeng Meng

{"title":"Cross-silo Federated Learning with Record-level Personalized Differential Privacy.","authors":"Junxu Liu, Jian Lou, Li Xiong, Jinfei Liu, Xiaofeng Meng","doi":"10.1145/3658644.3670351","DOIUrl":null,"url":null,"abstract":"Federated learning (FL) enhanced by differential privacy has emerged as a popular approach to better safeguard the privacy of client-side data by protecting clients' contributions during the training process. Existing solutions typically assume a uniform privacy budget for all records and provide one-size-fits-all solutions that may not be adequate to meet each record's privacy requirement. In this paper, we explore the uncharted territory of cross-silo FL with record-level personalized differential privacy. We devise a novel framework named rPDP-FL, employing a two-stage hybrid sampling scheme with both uniform client-level sampling and non-uniform record-level sampling to accommodate varying privacy requirements. A critical and non-trivial problem is how to determine the ideal per-record sampling probability <math><mi>q</mi></math> given the personalized privacy budget <math><mi>ε</mi></math> . We introduce a versatile solution named Simulation-CurveFitting, allowing us to uncover a significant insight into the nonlinear correlation between <math><mi>q</mi></math> and <math><mi>ε</mi></math> and derive an elegant mathematical model to tackle the problem. Our evaluation demonstrates that our solution can provide significant performance gains over the baselines that do not consider personalized privacy preservation.","PeriodicalId":72687,"journal":{"name":"Conference on Computer and Communications Security : proceedings of the ... conference on computer and communications security. ACM Conference on Computer and Communications Security","volume":"2024 ","pages":"303-317"},"PeriodicalIF":0.0000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12241667/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference on Computer and Communications Security : proceedings of the ... conference on computer and communications security. ACM Conference on Computer and Communications Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3658644.3670351","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/9 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Federated learning (FL) enhanced by differential privacy has emerged as a popular approach to better safeguard the privacy of client-side data by protecting clients' contributions during the training process. Existing solutions typically assume a uniform privacy budget for all records and provide one-size-fits-all solutions that may not be adequate to meet each record's privacy requirement. In this paper, we explore the uncharted territory of cross-silo FL with record-level personalized differential privacy. We devise a novel framework named rPDP-FL, employing a two-stage hybrid sampling scheme with both uniform client-level sampling and non-uniform record-level sampling to accommodate varying privacy requirements. A critical and non-trivial problem is how to determine the ideal per-record sampling probability $q$ given the personalized privacy budget $ε$ . We introduce a versatile solution named Simulation-CurveFitting, allowing us to uncover a significant insight into the nonlinear correlation between $q$ and $ε$ and derive an elegant mathematical model to tackle the problem. Our evaluation demonstrates that our solution can provide significant performance gains over the baselines that do not consider personalized privacy preservation.

查看原文本刊更多论文

具有记录级个性化差异隐私的跨竖井联合学习。

通过差分隐私增强的联邦学习（FL）已经成为一种流行的方法，通过在训练过程中保护客户的贡献来更好地保护客户端数据的隐私。现有的解决方案通常为所有记录假定统一的隐私预算，并提供可能不足以满足每个记录隐私需求的“一刀切”解决方案。在本文中，我们探索了具有记录级个性化差异隐私的跨筒仓FL的未知领域。我们设计了一个名为rPDP-FL的新框架，采用两阶段混合采样方案，包括统一的客户端级采样和非统一的记录级采样，以适应不同的隐私需求。在给定个性化隐私预算ε的情况下，如何确定理想的每条记录抽样概率q是一个重要的问题。我们介绍了一个名为Simulation-CurveFitting的通用解决方案，使我们能够揭示q和ε之间非线性相关性的重要见解，并推导出一个优雅的数学模型来解决这个问题。我们的评估表明，与不考虑个性化隐私保护的基线相比，我们的解决方案可以提供显著的性能提升。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Conference on Computer and Communications Security : proceedings of the ... conference on computer and communications security. ACM Conference on Computer and Communications Security

CiteScore

9.20

自引率

0.00%

发文量