AggEnhance: Aggregation Enhancement by Class Interior Points in Federated Learning with Non-IID Data

ACM Transactions on Intelligent Systems and Technology (TIST) Pub Date : 2022-06-16 DOI:10.1145/3544495

Jinxiang Ou, Yunheng Shen, Feng Wang, Qiao Liu, Xuegong Zhang, Hairong Lv

{"title":"AggEnhance: Aggregation Enhancement by Class Interior Points in Federated Learning with Non-IID Data","authors":"Jinxiang Ou, Yunheng Shen, Feng Wang, Qiao Liu, Xuegong Zhang, Hairong Lv","doi":"10.1145/3544495","DOIUrl":null,"url":null,"abstract":"Federated learning (FL) is a privacy-preserving paradigm for multi-institutional collaborations, where the aggregation is an essential procedure after training on the local datasets. Conventional aggregation algorithms often apply a weighted averaging of the updates generated from distributed machines to update the global model. However, while the data distributions are non-IID, the large discrepancy between the local updates might lead to a poor averaged result and a lower convergence speed, i.e., more iterations required to achieve a certain performance. To solve this problem, this article proposes a novel method named AggEnhance for enhancing the aggregation, where we synthesize a group of reliable samples from the local models and tune the aggregated result on them. These samples, named class interior points (CIPs) in this work, bound the relevant decision boundaries that ensure the performance of aggregated result. To the best of our knowledge, this is the first work to explicitly design an enhancing method for the aggregation in prevailing FL pipelines. A series of experiments on real data demonstrate that our method has noticeable improvements of the convergence in non-IID scenarios. In particular, our approach reduces the iterations by 31.87% on average for the CIFAR10 dataset and 43.90% for the PASCAL VOC dataset. Since our method does not modify other procedures of FL pipelines, it is easy to apply to most existing FL frameworks. Furthermore, it does not require additional data transmitted from the local clients to the global server, thus holding the same security level as the original FL algorithms.","PeriodicalId":123526,"journal":{"name":"ACM Transactions on Intelligent Systems and Technology (TIST)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Intelligent Systems and Technology (TIST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3544495","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Federated learning (FL) is a privacy-preserving paradigm for multi-institutional collaborations, where the aggregation is an essential procedure after training on the local datasets. Conventional aggregation algorithms often apply a weighted averaging of the updates generated from distributed machines to update the global model. However, while the data distributions are non-IID, the large discrepancy between the local updates might lead to a poor averaged result and a lower convergence speed, i.e., more iterations required to achieve a certain performance. To solve this problem, this article proposes a novel method named AggEnhance for enhancing the aggregation, where we synthesize a group of reliable samples from the local models and tune the aggregated result on them. These samples, named class interior points (CIPs) in this work, bound the relevant decision boundaries that ensure the performance of aggregated result. To the best of our knowledge, this is the first work to explicitly design an enhancing method for the aggregation in prevailing FL pipelines. A series of experiments on real data demonstrate that our method has noticeable improvements of the convergence in non-IID scenarios. In particular, our approach reduces the iterations by 31.87% on average for the CIFAR10 dataset and 43.90% for the PASCAL VOC dataset. Since our method does not modify other procedures of FL pipelines, it is easy to apply to most existing FL frameworks. Furthermore, it does not require additional data transmitted from the local clients to the global server, thus holding the same security level as the original FL algorithms.

查看原文本刊更多论文

AggEnhance:非iid数据联邦学习中类内点的聚合增强

联邦学习(FL)是一种多机构协作的隐私保护范例，其中聚合是在本地数据集上训练后的基本过程。传统的聚合算法通常对分布式机器生成的更新进行加权平均，以更新全局模型。然而，当数据分布是非iid时，本地更新之间的较大差异可能导致平均结果较差，收敛速度较慢，即需要更多的迭代才能达到一定的性能。为了解决这个问题，本文提出了一种名为AggEnhance的新方法来增强聚合，我们从局部模型中合成一组可靠的样本，并对它们的聚合结果进行调优。这些样本在本文中被称为类内点(class interior points, cip)，它们约束了相关的决策边界，从而保证了聚合结果的性能。据我们所知，这是第一次明确设计一种增强方法来增强当前FL管道中的聚合。在实际数据上的一系列实验表明，我们的方法在非iid场景下的收敛性有明显的提高。特别是，我们的方法平均减少了CIFAR10数据集的31.87%和PASCAL VOC数据集的43.90%的迭代。由于我们的方法不需要修改FL管道的其他程序，因此很容易适用于大多数现有的FL框架。此外，它不需要从本地客户端向全局服务器传输额外的数据，因此具有与原始FL算法相同的安全级别。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACM Transactions on Intelligent Systems and Technology (TIST)

自引率

0.00%

发文量