Privacy and Efficiency of Communications in Federated Split Learning

IF 7.5 3区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Transactions on Big Data Pub Date : 2023-03-29 DOI:10.1109/TBDATA.2023.3280405

Zongshun Zhang;Andrea Pinto;Valeria Turina;Flavio Esposito;Ibrahim Matta

{"title":"Privacy and Efficiency of Communications in Federated Split Learning","authors":"Zongshun Zhang;Andrea Pinto;Valeria Turina;Flavio Esposito;Ibrahim Matta","doi":"10.1109/TBDATA.2023.3280405","DOIUrl":null,"url":null,"abstract":"Every day, large amounts of sensitive data are distributed across mobile phones, wearable devices, and other sensors. Traditionally, these enormous datasets have been processed on a single system, with complex models being trained to make valuable predictions. Distributed machine learning techniques such as Federated and Split Learning have recently been developed to protect user data and privacy better while ensuring high performance. Both of these distributed learning architectures have advantages and disadvantages. In this article, we examine these tradeoffs and suggest a new hybrid Federated Split Learning architecture that combines the efficiency and privacy benefits of both. Our evaluation demonstrates how our hybrid Federated Split Learning approach can lower the amount of processing power required by each client running a distributed learning system, and reduce training and inference time while keeping a similar accuracy. We also discuss the resiliency of our approach to deep learning privacy inference attacks and compare our solution to other recently proposed benchmarks.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"9 5","pages":"1380-1391"},"PeriodicalIF":7.5000,"publicationDate":"2023-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Big Data","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10138067/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 6

Abstract

Every day, large amounts of sensitive data are distributed across mobile phones, wearable devices, and other sensors. Traditionally, these enormous datasets have been processed on a single system, with complex models being trained to make valuable predictions. Distributed machine learning techniques such as Federated and Split Learning have recently been developed to protect user data and privacy better while ensuring high performance. Both of these distributed learning architectures have advantages and disadvantages. In this article, we examine these tradeoffs and suggest a new hybrid Federated Split Learning architecture that combines the efficiency and privacy benefits of both. Our evaluation demonstrates how our hybrid Federated Split Learning approach can lower the amount of processing power required by each client running a distributed learning system, and reduce training and inference time while keeping a similar accuracy. We also discuss the resiliency of our approach to deep learning privacy inference attacks and compare our solution to other recently proposed benchmarks.

查看原文本刊更多论文

联邦分裂学习中通信的保密性和效率

每天，大量敏感数据分布在手机、可穿戴设备和其他传感器上。传统上，这些庞大的数据集是在一个系统上处理的，复杂的模型被训练来做出有价值的预测。最近开发了分布式机器学习技术，如联合学习和分割学习，以更好地保护用户数据和隐私，同时确保高性能。这两种分布式学习体系结构都有优点和缺点。在本文中，我们研究了这些权衡，并提出了一种新的混合联合拆分学习架构，该架构结合了两者的效率和隐私优势。我们的评估表明，我们的混合联合分割学习方法可以降低运行分布式学习系统的每个客户端所需的处理能力，减少训练和推理时间，同时保持类似的准确性。我们还讨论了我们的方法对深度学习隐私推断攻击的弹性，并将我们的解决方案与最近提出的其他基准进行了比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Big Data Multiple-

CiteScore

11.80

自引率

2.80%

发文量

114

期刊介绍： The IEEE Transactions on Big Data publishes peer-reviewed articles focusing on big data. These articles present innovative research ideas and application results across disciplines, including novel theories, algorithms, and applications. Research areas cover a wide range, such as big data analytics, visualization, curation, management, semantics, infrastructure, standards, performance analysis, intelligence extraction, scientific discovery, security, privacy, and legal issues specific to big data. The journal also prioritizes applications of big data in fields generating massive datasets.