Zhiheng Liu;Hongtao Lv;Xiangyu Liu;Chenhao Ma;Fan Wu;Lei Liu;Lizhen Cui
{"title":"Similarity and Diversity: PCA-Based Contribution Evaluation in Federated Learning","authors":"Zhiheng Liu;Hongtao Lv;Xiangyu Liu;Chenhao Ma;Fan Wu;Lei Liu;Lizhen Cui","doi":"10.1109/JIOT.2025.3546679","DOIUrl":null,"url":null,"abstract":"Federated learning (FL) is a rapidly evolving paradigm that facilitates distributed training of large-scale deep neural networks (DNNs). However, the distributed nature exposes the system to threats from potentially malicious or low-quality participants, which can significantly degrade the overall performance of FL. Existing contribution evaluation approaches in previous FL studies are vulnerable when there exist complicated types of malicious or low-quality clients. In this article, we propose to assess the clients’ contributions by treating their model parameters as data. By extracting information from the statistical properties of model parameters using principal component analysis-based data mining techniques, we quantitatively estimate the similarity and diversity between different clients. Furthermore, we analyze the convergence of our proposed method and establish a convergence rate of <inline-formula> <tex-math>$\\mathcal {O}({1}/{T})$ </tex-math></inline-formula> with commonly accepted assumptions. Extensive experiments are conducted on public datasets to evaluate the effectiveness of our proposed method against typical malicious or low-quality clients: sybil-based backdoor attackers and clients with redundant data. Experimental results demonstrate the superiority of our approach in excluding malicious or low-quality clients and thereby enhancing the model performance in FL.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 12","pages":"20393-20405"},"PeriodicalIF":8.9000,"publicationDate":"2025-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10907906/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Federated learning (FL) is a rapidly evolving paradigm that facilitates distributed training of large-scale deep neural networks (DNNs). However, the distributed nature exposes the system to threats from potentially malicious or low-quality participants, which can significantly degrade the overall performance of FL. Existing contribution evaluation approaches in previous FL studies are vulnerable when there exist complicated types of malicious or low-quality clients. In this article, we propose to assess the clients’ contributions by treating their model parameters as data. By extracting information from the statistical properties of model parameters using principal component analysis-based data mining techniques, we quantitatively estimate the similarity and diversity between different clients. Furthermore, we analyze the convergence of our proposed method and establish a convergence rate of $\mathcal {O}({1}/{T})$ with commonly accepted assumptions. Extensive experiments are conducted on public datasets to evaluate the effectiveness of our proposed method against typical malicious or low-quality clients: sybil-based backdoor attackers and clients with redundant data. Experimental results demonstrate the superiority of our approach in excluding malicious or low-quality clients and thereby enhancing the model performance in FL.
期刊介绍:
The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.