On the Trade-off Between Benefit and Contribution for Clients in Federated Learning in Healthcare

Christoph Düsing, P. Cimiano
{"title":"On the Trade-off Between Benefit and Contribution for Clients in Federated Learning in Healthcare","authors":"Christoph Düsing, P. Cimiano","doi":"10.1109/ICMLA55696.2022.00257","DOIUrl":null,"url":null,"abstract":"Federated Learning (FL) is a learning paradigm that allows clients to profit from the data that is available across multiple clients to train a joint model. As FL allows to train such a joint model without explicitly sharing data, but only sharing model updates, it has attained popularity in healthcare settings where patient data is subject to strict privacy policies and needs to be locally stored at each hospital or healthcare provider. A particular challenge for FL settings is data imbalance across clients, as it has been found to be detrimental to model performance and impact the influence of each client on the learning process. Unfortunately, the healthcare domain is particularly prone to such imbalanced data due to regional differences in disease management, prescription behavior etc. In this paper, we introduce the two novel metrics Benefit and Contribution to quantify to which degree individual clients benefit from participation in FL and how they contribute to its success, respectively. Therefore, we measure Benefit and Contribution with respect to four types of imbalances present in data at each client side. Our results show that both client Benefit and Contribution are influenced by data imbalance in such a way that high imbalance in data quantity, label distribution and feature distribution reduces or nullifies clients’ Benefit while increasing their Contribution. Thus, the most valuable clients within a cohort benefit the least from their participation, exposing a critical thread to the success of clinical FL cohorts by withdrawing participation.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"263 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA55696.2022.00257","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Federated Learning (FL) is a learning paradigm that allows clients to profit from the data that is available across multiple clients to train a joint model. As FL allows to train such a joint model without explicitly sharing data, but only sharing model updates, it has attained popularity in healthcare settings where patient data is subject to strict privacy policies and needs to be locally stored at each hospital or healthcare provider. A particular challenge for FL settings is data imbalance across clients, as it has been found to be detrimental to model performance and impact the influence of each client on the learning process. Unfortunately, the healthcare domain is particularly prone to such imbalanced data due to regional differences in disease management, prescription behavior etc. In this paper, we introduce the two novel metrics Benefit and Contribution to quantify to which degree individual clients benefit from participation in FL and how they contribute to its success, respectively. Therefore, we measure Benefit and Contribution with respect to four types of imbalances present in data at each client side. Our results show that both client Benefit and Contribution are influenced by data imbalance in such a way that high imbalance in data quantity, label distribution and feature distribution reduces or nullifies clients’ Benefit while increasing their Contribution. Thus, the most valuable clients within a cohort benefit the least from their participation, exposing a critical thread to the success of clinical FL cohorts by withdrawing participation.
医疗保健联合学习中客户利益与贡献的权衡
联邦学习(FL)是一种学习范例,它允许客户从跨多个客户端可用的数据中获利,以训练联合模型。由于FL允许在不显式共享数据的情况下训练这样的联合模型,而只共享模型更新,因此它在患者数据受严格隐私政策约束并且需要在每个医院或医疗保健提供商本地存储的医疗保健环境中得到了普及。FL设置的一个特殊挑战是客户机之间的数据不平衡,因为它已被发现对模型性能有害,并影响每个客户机对学习过程的影响。不幸的是,由于疾病管理、处方行为等方面的地区差异,医疗保健领域特别容易出现这种数据不平衡。在本文中,我们引入了两个新的指标,分别量化个人客户从参与FL中受益的程度以及他们如何为其成功做出贡献。因此,我们根据每个客户端数据中存在的四种失衡类型来衡量收益和贡献。我们的研究结果表明,客户的利益和贡献都受到数据不平衡的影响,数据数量、标签分布和特征分布的高度不平衡会降低或抵消客户的利益,同时增加客户的贡献。因此,队列中最有价值的客户从他们的参与中获益最少,通过退出参与暴露了临床FL队列成功的关键线索。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信