FedDBL: Communication and Data Efficient Federated Deep-Broad Learning for Histopathological Tissue Classification.

IF 9.4 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

IEEE Transactions on Cybernetics Pub Date : 2024-06-26 DOI:10.1109/TCYB.2024.3403927

Tianpeng Deng, Yanqi Huang, Guoqiang Han, Zhenwei Shi, Jiatai Lin, Qi Dou, Zaiyi Liu, Xiao-Jing Guo, C L Philip Chen, Chu Han

{"title":"FedDBL: Communication and Data Efficient Federated Deep-Broad Learning for Histopathological Tissue Classification.","authors":"Tianpeng Deng, Yanqi Huang, Guoqiang Han, Zhenwei Shi, Jiatai Lin, Qi Dou, Zaiyi Liu, Xiao-Jing Guo, C L Philip Chen, Chu Han","doi":"10.1109/TCYB.2024.3403927","DOIUrl":null,"url":null,"abstract":"<p><p>Histopathological tissue classification is a fundamental task in computational pathology. Deep learning (DL)-based models have achieved superior performance but centralized training suffers from the privacy leakage problem. Federated learning (FL) can safeguard privacy by keeping training samples locally, while existing FL-based frameworks require a large number of well-annotated training samples and numerous rounds of communication which hinder their viability in real-world clinical scenarios. In this article, we propose a lightweight and universal FL framework, named federated deep-broad learning (FedDBL), to achieve superior classification performance with limited training samples and only one-round communication. By simply integrating a pretrained DL feature extractor, a fast and lightweight broad learning inference system with a classical federated aggregation approach, FedDBL can dramatically reduce data dependency and improve communication efficiency. Five-fold cross-validation demonstrates that FedDBL greatly outperforms the competitors with only one-round communication and limited training samples, while it even achieves comparable performance with the ones under multiple-round communications. Furthermore, due to the lightweight design and one-round communication, FedDBL reduces the communication burden from 4.6 GB to only 138.4 KB per client using the ResNet-50 backbone at 50-round training. Extensive experiments also show the scalability of FedDBL on model generalization to the unseen dataset, various client numbers, model personalization and other image modalities. Since no data or deep model sharing across different clients, the privacy issue is well-solved and the model security is guaranteed with no model inversion attack risk. Code is available at https://github.com/tianpeng-deng/FedDBL.</p>","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":null,"pages":null},"PeriodicalIF":9.4000,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Cybernetics","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1109/TCYB.2024.3403927","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Histopathological tissue classification is a fundamental task in computational pathology. Deep learning (DL)-based models have achieved superior performance but centralized training suffers from the privacy leakage problem. Federated learning (FL) can safeguard privacy by keeping training samples locally, while existing FL-based frameworks require a large number of well-annotated training samples and numerous rounds of communication which hinder their viability in real-world clinical scenarios. In this article, we propose a lightweight and universal FL framework, named federated deep-broad learning (FedDBL), to achieve superior classification performance with limited training samples and only one-round communication. By simply integrating a pretrained DL feature extractor, a fast and lightweight broad learning inference system with a classical federated aggregation approach, FedDBL can dramatically reduce data dependency and improve communication efficiency. Five-fold cross-validation demonstrates that FedDBL greatly outperforms the competitors with only one-round communication and limited training samples, while it even achieves comparable performance with the ones under multiple-round communications. Furthermore, due to the lightweight design and one-round communication, FedDBL reduces the communication burden from 4.6 GB to only 138.4 KB per client using the ResNet-50 backbone at 50-round training. Extensive experiments also show the scalability of FedDBL on model generalization to the unseen dataset, various client numbers, model personalization and other image modalities. Since no data or deep model sharing across different clients, the privacy issue is well-solved and the model security is guaranteed with no model inversion attack risk. Code is available at https://github.com/tianpeng-deng/FedDBL.

查看原文本刊更多论文

FedDBL：用于组织病理学组织分类的通信和数据高效联合深度学习。

组织病理学分类是计算病理学的一项基本任务。基于深度学习（DL）的模型性能优越，但集中式训练存在隐私泄露问题。联合学习（FL）可以通过在本地保存训练样本来保护隐私，而现有的基于联合学习的框架需要大量注释良好的训练样本和多轮通信，这阻碍了它们在实际临床场景中的可行性。在本文中，我们提出了一种轻量级通用 FL 框架，命名为联合深度学习（FedDBL），它能在有限的训练样本和仅一轮通信的情况下实现卓越的分类性能。通过简单地集成预训练 DL 特征提取器、快速轻量级广泛学习推理系统和经典的联合聚合方法，FedDBL 可以显著降低数据依赖性并提高通信效率。五倍交叉验证表明，在只有一轮通信和有限训练样本的情况下，FedDBL 的性能大大优于竞争对手，甚至与多轮通信下的竞争对手性能相当。此外，由于采用了轻量级设计和单轮通信，FedDBL 在使用 ResNet-50 骨干网进行 50 轮训练时，每个客户端的通信负担从 4.6 GB 减少到仅 138.4 KB。广泛的实验还显示了 FedDBL 在模型泛化到未见数据集、各种客户端数量、模型个性化和其他图像模式方面的可扩展性。由于不同客户端之间不共享数据或深度模型，因此隐私问题得到了很好的解决，模型的安全性也得到了保证，没有模型反转攻击的风险。代码见 https://github.com/tianpeng-deng/FedDBL。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Cybernetics COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-COMPUTER SCIENCE, CYBERNETICS

CiteScore

25.40

自引率

11.00%

发文量

1869

期刊介绍： The scope of the IEEE Transactions on Cybernetics includes computational approaches to the field of cybernetics. Specifically, the transactions welcomes papers on communication and control across machines or machine, human, and organizations. The scope includes such areas as computational intelligence, computer vision, neural networks, genetic algorithms, machine learning, fuzzy systems, cognitive systems, decision making, and robotics, to the extent that they contribute to the theme of cybernetics or demonstrate an application of cybernetics principles.