A comprehensive survey of federated transfer learning: challenges, methods and applications

IF 4.6 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Frontiers of Computer Science Pub Date : 2024-07-23 DOI:10.1007/s11704-024-40065-x

Wei Guo, Fuzhen Zhuang, Xiao Zhang, Yiqi Tong, Jin Dong

{"title":"A comprehensive survey of federated transfer learning: challenges, methods and applications","authors":"Wei Guo, Fuzhen Zhuang, Xiao Zhang, Yiqi Tong, Jin Dong","doi":"10.1007/s11704-024-40065-x","DOIUrl":null,"url":null,"abstract":"<p>Federated learning (FL) is a novel distributed machine learning paradigm that enables participants to collaboratively train a centralized model with privacy preservation by eliminating the requirement of data sharing. In practice, FL often involves multiple participants and requires the third party to aggregate global information to guide the update of the target participant. Therefore, many FL methods do not work well due to the training and test data of each participant may not be sampled from the same feature space and the same underlying distribution. Meanwhile, the differences in their local devices (system heterogeneity), the continuous influx of online data (incremental data), and labeled data scarcity may further influence the performance of these methods. To solve this problem, federated transfer learning (FTL), which integrates transfer learning (TL) into FL, has attracted the attention of numerous researchers. However, since FL enables a continuous share of knowledge among participants with each communication round while not allowing local data to be accessed by other participants, FTL faces many unique challenges that are not present in TL. In this survey, we focus on categorizing and reviewing the current progress on federated transfer learning, and outlining corresponding solutions and applications. Furthermore, the common setting of FTL scenarios, available datasets, and significant related research are summarized in this survey.</p>","PeriodicalId":12640,"journal":{"name":"Frontiers of Computer Science","volume":"93 1","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers of Computer Science","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s11704-024-40065-x","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Federated learning (FL) is a novel distributed machine learning paradigm that enables participants to collaboratively train a centralized model with privacy preservation by eliminating the requirement of data sharing. In practice, FL often involves multiple participants and requires the third party to aggregate global information to guide the update of the target participant. Therefore, many FL methods do not work well due to the training and test data of each participant may not be sampled from the same feature space and the same underlying distribution. Meanwhile, the differences in their local devices (system heterogeneity), the continuous influx of online data (incremental data), and labeled data scarcity may further influence the performance of these methods. To solve this problem, federated transfer learning (FTL), which integrates transfer learning (TL) into FL, has attracted the attention of numerous researchers. However, since FL enables a continuous share of knowledge among participants with each communication round while not allowing local data to be accessed by other participants, FTL faces many unique challenges that are not present in TL. In this survey, we focus on categorizing and reviewing the current progress on federated transfer learning, and outlining corresponding solutions and applications. Furthermore, the common setting of FTL scenarios, available datasets, and significant related research are summarized in this survey.

查看原文本刊更多论文

联合转移学习综合调查：挑战、方法和应用

联合学习（FL）是一种新颖的分布式机器学习范式，它通过消除数据共享要求，使参与者能够在保护隐私的前提下协作训练一个集中模型。在实践中，FL 通常涉及多个参与者，需要第三方汇总全局信息来指导目标参与者的更新。因此，由于每个参与者的训练数据和测试数据可能不是从相同的特征空间和相同的底层分布中采样，许多 FL 方法都不能很好地发挥作用。同时，他们本地设备的差异（系统异质性）、在线数据（增量数据）的不断涌入以及标记数据的稀缺性可能会进一步影响这些方法的性能。为解决这一问题，将迁移学习（TL）集成到 FL 中的联合迁移学习（FTL）吸引了众多研究人员的关注。然而，由于 FL 能够让参与者在每一轮交流中持续共享知识，同时又不允许其他参与者访问本地数据，因此 FTL 面临着许多 TL 中不存在的独特挑战。在本调查报告中，我们将重点对当前联合迁移学习的进展进行分类和回顾，并概述相应的解决方案和应用。此外，本调查还总结了 FTL 的常见场景设置、可用数据集和重要的相关研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Frontiers of Computer Science COMPUTER SCIENCE, INFORMATION SYSTEMS-COMPUTER SCIENCE, SOFTWARE ENGINEERING

CiteScore

8.60

自引率

2.40%

发文量

799

审稿时长

6-12 weeks

期刊介绍： Frontiers of Computer Science aims to provide a forum for the publication of peer-reviewed papers to promote rapid communication and exchange between computer scientists. The journal publishes research papers and review articles in a wide range of topics, including: architecture, software, artificial intelligence, theoretical computer science, networks and communication, information systems, multimedia and graphics, information security, interdisciplinary, etc. The journal especially encourages papers from new emerging and multidisciplinary areas, as well as papers reflecting the international trends of research and development and on special topics reporting progress made by Chinese computer scientists.