面向基于原型的分散学习的有效压缩和通信

IF 7.2 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Soft Computing Pub Date : 2025-05-28 DOI:10.1016/j.asoc.2025.113270

Pablo Fernández-Piñeiro , Manuel Fernández-Veiga , Rebeca P. Díaz-Redondo , Ana Fernández-Vilas , Martín González Soto

{"title":"面向基于原型的分散学习的有效压缩和通信","authors":"Pablo Fernández-Piñeiro , Manuel Fernández-Veiga , Rebeca P. Díaz-Redondo , Ana Fernández-Vilas , Martín González Soto","doi":"10.1016/j.asoc.2025.113270","DOIUrl":null,"url":null,"abstract":"<div><div>In prototype-based federated learning, the exchange of model parameters between clients and the master server is replaced by transmission of prototypes or quantized versions of the data samples to the aggregation server. A fully decentralized deployment of prototype-based learning, without a central aggregator of prototypes, is more robust upon network failures and reacts faster to changes in the statistical distribution of the data, suggesting potential advantages and quick adaptation in dynamic learning tasks, e.g., when the data sources are IoT devices or when data is non-iid. In this paper, we face the challenge of designing an efficient prototype-based decentralized learning network by reducing the overheads in communication and computation. This allows enhancing the scalability of the global system, specially for IoT settings with resource-limited devices. First, we compress the prototype size by applying a clustering algorithm. After that, we filter the prototypes to be disseminate using an information-theoretic measure to share only relevant models or models that provide new knowledge to their neighbors. Then, we define a parallel gossip algorithm to disseminate these models within the learning network. Finally, we define a suitable scheduler able to manage the set of prototypes received to optimize the aggregation phase. In order to validate our proposal we present an analysis of the parallel gossip algorithm regarding the age-of-information (AoI). Our experimental results show the communications load can be substantially reduced without decreasing the convergence rate of the learning algorithm.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"178 ","pages":"Article 113270"},"PeriodicalIF":7.2000,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Towards efficient compression and communication for prototype-based decentralized learning\",\"authors\":\"Pablo Fernández-Piñeiro , Manuel Fernández-Veiga , Rebeca P. Díaz-Redondo , Ana Fernández-Vilas , Martín González Soto\",\"doi\":\"10.1016/j.asoc.2025.113270\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In prototype-based federated learning, the exchange of model parameters between clients and the master server is replaced by transmission of prototypes or quantized versions of the data samples to the aggregation server. A fully decentralized deployment of prototype-based learning, without a central aggregator of prototypes, is more robust upon network failures and reacts faster to changes in the statistical distribution of the data, suggesting potential advantages and quick adaptation in dynamic learning tasks, e.g., when the data sources are IoT devices or when data is non-iid. In this paper, we face the challenge of designing an efficient prototype-based decentralized learning network by reducing the overheads in communication and computation. This allows enhancing the scalability of the global system, specially for IoT settings with resource-limited devices. First, we compress the prototype size by applying a clustering algorithm. After that, we filter the prototypes to be disseminate using an information-theoretic measure to share only relevant models or models that provide new knowledge to their neighbors. Then, we define a parallel gossip algorithm to disseminate these models within the learning network. Finally, we define a suitable scheduler able to manage the set of prototypes received to optimize the aggregation phase. In order to validate our proposal we present an analysis of the parallel gossip algorithm regarding the age-of-information (AoI). Our experimental results show the communications load can be substantially reduced without decreasing the convergence rate of the learning algorithm.</div></div>\",\"PeriodicalId\":50737,\"journal\":{\"name\":\"Applied Soft Computing\",\"volume\":\"178 \",\"pages\":\"Article 113270\"},\"PeriodicalIF\":7.2000,\"publicationDate\":\"2025-05-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Soft Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1568494625005812\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494625005812","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

在基于原型的联邦学习中，客户端和主服务器之间模型参数的交换被传输原型或数据样本的量化版本到聚合服务器所取代。基于原型的学习的完全分散部署，没有原型的中央聚合器，在网络故障时更健壮，对数据统计分布的变化反应更快，这表明了动态学习任务的潜在优势和快速适应，例如，当数据源是物联网设备或数据是非iid时。在本文中，我们面临的挑战是通过减少通信和计算开销来设计一个高效的基于原型的分散学习网络。这可以增强全局系统的可扩展性，特别是对于资源有限设备的物联网设置。首先，采用聚类算法压缩原型尺寸。在此之后，我们使用信息论方法过滤要传播的原型，只共享相关模型或向其邻居提供新知识的模型。然后，我们定义了一种并行八卦算法来在学习网络中传播这些模型。最后，我们定义了一个合适的调度器，该调度器能够管理接收到的原型集，以优化聚合阶段。为了验证我们的建议，我们提出了一个关于信息年龄（AoI）的并行八卦算法的分析。实验结果表明，在不降低学习算法收敛速度的前提下，可以大大降低通信负荷。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Towards efficient compression and communication for prototype-based decentralized learning

In prototype-based federated learning, the exchange of model parameters between clients and the master server is replaced by transmission of prototypes or quantized versions of the data samples to the aggregation server. A fully decentralized deployment of prototype-based learning, without a central aggregator of prototypes, is more robust upon network failures and reacts faster to changes in the statistical distribution of the data, suggesting potential advantages and quick adaptation in dynamic learning tasks, e.g., when the data sources are IoT devices or when data is non-iid. In this paper, we face the challenge of designing an efficient prototype-based decentralized learning network by reducing the overheads in communication and computation. This allows enhancing the scalability of the global system, specially for IoT settings with resource-limited devices. First, we compress the prototype size by applying a clustering algorithm. After that, we filter the prototypes to be disseminate using an information-theoretic measure to share only relevant models or models that provide new knowledge to their neighbors. Then, we define a parallel gossip algorithm to disseminate these models within the learning network. Finally, we define a suitable scheduler able to manage the set of prototypes received to optimize the aggregation phase. In order to validate our proposal we present an analysis of the parallel gossip algorithm regarding the age-of-information (AoI). Our experimental results show the communications load can be substantially reduced without decreasing the convergence rate of the learning algorithm.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Applied Soft Computing 工程技术-计算机：跨学科应用

CiteScore

15.80

自引率

6.90%

发文量

874

审稿时长

10.9 months

期刊介绍： Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities. Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.