Ignatius Iwan, Sean Yonathan Tanjung, Bernardo Nugroho Yahya, Seok-Lyong Lee
{"title":"FedCLLM: Federated client selection assisted large language model utilizing domain description","authors":"Ignatius Iwan, Sean Yonathan Tanjung, Bernardo Nugroho Yahya, Seok-Lyong Lee","doi":"10.1016/j.iot.2025.101506","DOIUrl":null,"url":null,"abstract":"<div><div>Federated Learning has become an emerging topic since the rise of privacy regulation regarding personal data protection and sensitivity. It provides a decentralized training approach to train a global model between a server and multiple clients while ensuring client data confidentiality. However, in practical scenarios, there are malicious clients within a large pool of client candidates, and selecting trustworthy honest clients becomes a crucial problem. Some previous works tried to solve the problem by exchanging client data for comparison and using a labelled dataset for evaluating client models to select honest clients. However, they pose a limitation when the server possesses unlabelled data from other sources and has limited or unavailable resources to label the data. To address the issue, this work proposes FedCLLM, a novel approach using Large Language Models (LLM) proficiency in semantic tasks on text-based data to compare client and server domain descriptions summary in a text format and assess their similarity. Experiments on popular benchmark datasets show that FedCLLM effectively distinguishes honest clients from potentially malicious ones and outperforms other previous works in terms of performance.</div></div>","PeriodicalId":29968,"journal":{"name":"Internet of Things","volume":"30 ","pages":"Article 101506"},"PeriodicalIF":6.0000,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Internet of Things","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2542660525000198","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Federated Learning has become an emerging topic since the rise of privacy regulation regarding personal data protection and sensitivity. It provides a decentralized training approach to train a global model between a server and multiple clients while ensuring client data confidentiality. However, in practical scenarios, there are malicious clients within a large pool of client candidates, and selecting trustworthy honest clients becomes a crucial problem. Some previous works tried to solve the problem by exchanging client data for comparison and using a labelled dataset for evaluating client models to select honest clients. However, they pose a limitation when the server possesses unlabelled data from other sources and has limited or unavailable resources to label the data. To address the issue, this work proposes FedCLLM, a novel approach using Large Language Models (LLM) proficiency in semantic tasks on text-based data to compare client and server domain descriptions summary in a text format and assess their similarity. Experiments on popular benchmark datasets show that FedCLLM effectively distinguishes honest clients from potentially malicious ones and outperforms other previous works in terms of performance.
期刊介绍:
Internet of Things; Engineering Cyber Physical Human Systems is a comprehensive journal encouraging cross collaboration between researchers, engineers and practitioners in the field of IoT & Cyber Physical Human Systems. The journal offers a unique platform to exchange scientific information on the entire breadth of technology, science, and societal applications of the IoT.
The journal will place a high priority on timely publication, and provide a home for high quality.
Furthermore, IOT is interested in publishing topical Special Issues on any aspect of IOT.