Cluster Based Secure Multi-Party Computation in Federated Learning for Histopathology Images

S. M. Hosseini, Milad Sikaroudi, Morteza Babaie, H. Tizhoosh
{"title":"Cluster Based Secure Multi-Party Computation in Federated Learning for Histopathology Images","authors":"S. M. Hosseini, Milad Sikaroudi, Morteza Babaie, H. Tizhoosh","doi":"10.48550/arXiv.2208.10919","DOIUrl":null,"url":null,"abstract":". Federated learning (FL) is a decentralized method enabling hospitals to collaboratively learn a model without sharing private pa-tient data for training. In FL, participant hospitals periodically exchange training results rather than training samples with a central server. How-ever, having access to model parameters or gradients can expose private training data samples. To address this challenge, we adopt secure multiparty computation (SMC) to establish a privacy-preserving federated learning framework. In our proposed method, the hospitals are divided into clusters. After local training, each hospital splits its model weights among other hospitals in the same cluster such that no single hospital can retrieve other hospitals’ weights on its own. Then, all hospitals sum up the received weights, sending the results to the central server. Fi-nally, the central server aggregates the results, retrieving the average of models’ weights and updating the model without having access to individual hospitals’ weights. We conduct experiments on a publicly available repository, The Cancer Genome Atlas (TCGA). We compare the performance of the proposed framework with differential privacy and federated averaging as the baseline. The results reveal that compared to differential privacy, our framework can achieve higher accuracy with no privacy leakage risk at a cost of higher communication overhead.","PeriodicalId":347091,"journal":{"name":"DeCaF/FAIR@MICCAI","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"DeCaF/FAIR@MICCAI","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2208.10919","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

. Federated learning (FL) is a decentralized method enabling hospitals to collaboratively learn a model without sharing private pa-tient data for training. In FL, participant hospitals periodically exchange training results rather than training samples with a central server. How-ever, having access to model parameters or gradients can expose private training data samples. To address this challenge, we adopt secure multiparty computation (SMC) to establish a privacy-preserving federated learning framework. In our proposed method, the hospitals are divided into clusters. After local training, each hospital splits its model weights among other hospitals in the same cluster such that no single hospital can retrieve other hospitals’ weights on its own. Then, all hospitals sum up the received weights, sending the results to the central server. Fi-nally, the central server aggregates the results, retrieving the average of models’ weights and updating the model without having access to individual hospitals’ weights. We conduct experiments on a publicly available repository, The Cancer Genome Atlas (TCGA). We compare the performance of the proposed framework with differential privacy and federated averaging as the baseline. The results reveal that compared to differential privacy, our framework can achieve higher accuracy with no privacy leakage risk at a cost of higher communication overhead.
组织病理学图像联邦学习中基于聚类的安全多方计算
. 联邦学习(FL)是一种分散的方法,使医院能够协作学习模型,而无需共享用于训练的私人患者数据。在FL中,参与医院定期交换训练结果,而不是与中央服务器交换训练样本。然而,访问模型参数或梯度可能会暴露私有训练数据样本。为了解决这一挑战,我们采用安全多方计算(SMC)来建立一个保护隐私的联邦学习框架。在我们提出的方法中,医院被划分为集群。经过局部训练后,每个医院将自己的模型权值在同一集群的其他医院之间进行分割,这样就没有一家医院可以单独检索其他医院的权值。然后,所有医院汇总接收到的权重,将结果发送到中央服务器。最后,中央服务器汇总结果,检索模型权重的平均值并更新模型,而无需访问各个医院的权重。我们在一个公开可用的存储库——癌症基因组图谱(TCGA)上进行实验。我们将所提出的框架的性能与差分隐私和联邦平均作为基线进行比较。结果表明,与差分隐私相比,我们的框架可以在没有隐私泄露风险的情况下实现更高的准确性,但代价是更高的通信开销。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
文献相关原料
公司名称 产品信息 采购帮参考价格
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信