Qing Zhao , Jielei Chu , Zhaoyu Li , Wei Huang , Zhipeng Luo , Tianrui Li
{"title":"FedLGMatch:通过联合局部和全局伪标记的联邦半监督学习","authors":"Qing Zhao , Jielei Chu , Zhaoyu Li , Wei Huang , Zhipeng Luo , Tianrui Li","doi":"10.1016/j.knosys.2025.113642","DOIUrl":null,"url":null,"abstract":"<div><div>The bulk of existing Federated Learning (FL) algorithms pay attention to supervised setting and assume that clients have fully labeled data. However, it may be impractical for all clients to obtain plenty of labels due to high annotation costs. Hence, the Federated Semi-Supervised Learning (FSSL) as a promising paradigm has better prospect in many realistic applications (e.g. medical scenario). Under Labels-at-Server scenario, most pseudo labeling based FSSL approaches use only the global model to generate pseudo-labels for unlabeled data, while the local models are ignored. When the local data distribution is much more different from the central server (e.g., Non-IID setting), the generated pseudo-labels may contain much noise, thus, resulting in more serious confirmation bias. To tackle the crucial issue, a novel Federated Semi-Supervised Learning via Joint Local and Global Pseudo Labeling (FedLGMatch) framework is proposed in this paper. The prominent advantage of the proposed FedLGMatch is that it allows local models trained in the last communication round to assist global model in generating pseudo-labels, which successfully emphasizes more clean pseudo-label learning at each client. Experimental results also show that FedLGMatch achieves significant performance improvements than other state-of-the-art models on the standard benchmark datasets.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"320 ","pages":"Article 113642"},"PeriodicalIF":7.2000,"publicationDate":"2025-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"FedLGMatch: Federated semi-supervised learning via joint local and global pseudo labeling\",\"authors\":\"Qing Zhao , Jielei Chu , Zhaoyu Li , Wei Huang , Zhipeng Luo , Tianrui Li\",\"doi\":\"10.1016/j.knosys.2025.113642\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The bulk of existing Federated Learning (FL) algorithms pay attention to supervised setting and assume that clients have fully labeled data. However, it may be impractical for all clients to obtain plenty of labels due to high annotation costs. Hence, the Federated Semi-Supervised Learning (FSSL) as a promising paradigm has better prospect in many realistic applications (e.g. medical scenario). Under Labels-at-Server scenario, most pseudo labeling based FSSL approaches use only the global model to generate pseudo-labels for unlabeled data, while the local models are ignored. When the local data distribution is much more different from the central server (e.g., Non-IID setting), the generated pseudo-labels may contain much noise, thus, resulting in more serious confirmation bias. To tackle the crucial issue, a novel Federated Semi-Supervised Learning via Joint Local and Global Pseudo Labeling (FedLGMatch) framework is proposed in this paper. The prominent advantage of the proposed FedLGMatch is that it allows local models trained in the last communication round to assist global model in generating pseudo-labels, which successfully emphasizes more clean pseudo-label learning at each client. Experimental results also show that FedLGMatch achieves significant performance improvements than other state-of-the-art models on the standard benchmark datasets.</div></div>\",\"PeriodicalId\":49939,\"journal\":{\"name\":\"Knowledge-Based Systems\",\"volume\":\"320 \",\"pages\":\"Article 113642\"},\"PeriodicalIF\":7.2000,\"publicationDate\":\"2025-05-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Knowledge-Based Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0950705125006884\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125006884","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
FedLGMatch: Federated semi-supervised learning via joint local and global pseudo labeling
The bulk of existing Federated Learning (FL) algorithms pay attention to supervised setting and assume that clients have fully labeled data. However, it may be impractical for all clients to obtain plenty of labels due to high annotation costs. Hence, the Federated Semi-Supervised Learning (FSSL) as a promising paradigm has better prospect in many realistic applications (e.g. medical scenario). Under Labels-at-Server scenario, most pseudo labeling based FSSL approaches use only the global model to generate pseudo-labels for unlabeled data, while the local models are ignored. When the local data distribution is much more different from the central server (e.g., Non-IID setting), the generated pseudo-labels may contain much noise, thus, resulting in more serious confirmation bias. To tackle the crucial issue, a novel Federated Semi-Supervised Learning via Joint Local and Global Pseudo Labeling (FedLGMatch) framework is proposed in this paper. The prominent advantage of the proposed FedLGMatch is that it allows local models trained in the last communication round to assist global model in generating pseudo-labels, which successfully emphasizes more clean pseudo-label learning at each client. Experimental results also show that FedLGMatch achieves significant performance improvements than other state-of-the-art models on the standard benchmark datasets.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.