THE-TAFL: Transforming Healthcare Edge with Transformer-based Adaptive Federated Learning and Learning Rate Optimization

IF 6 3区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Internet of Things Pub Date : 2025-04-17 DOI:10.1016/j.iot.2025.101605

Farhan Ullah , Nazeeruddin Mohammad , Leonardo Mostarda , Diletta Cacciagrano , Shamsher Ullah , Yue Zhao

{"title":"THE-TAFL: Transforming Healthcare Edge with Transformer-based Adaptive Federated Learning and Learning Rate Optimization","authors":"Farhan Ullah , Nazeeruddin Mohammad , Leonardo Mostarda , Diletta Cacciagrano , Shamsher Ullah , Yue Zhao","doi":"10.1016/j.iot.2025.101605","DOIUrl":null,"url":null,"abstract":"<div><div>The healthcare industry is becoming more vulnerable to privacy violations and cybercrime due to the pervasive dissemination and sensitivity of medical data. Advanced data security systems are needed to protect privacy, data integrity, and dependability as confidentiality breaches increase across industries. Decentralized healthcare networks face challenges in feature extraction during local training, hindering effective federated averaging and learning rate optimization, which affects data processing and model training efficiency. This paper introduces a novel approach of Transforming Healthcare Edge with Transformer-based Adaptive Federated Learning (THE-TAFL) and Learning Rate Optimization. In this paper, we combine Transformer-based Adaptive Federated Learning (TAFL) with learning rate optimization to improve the privacy and security of healthcare information on edge devices. We used data augmentation approaches that generate robust and generalized input datasets for deep learning models. Next, we use the Vision Transformer (ViT) model for local training, generating Local Model Weights (LMUs) that enhance feature extraction and learning. We designed a training optimization method that improves model performance and stability by combining a loss function with weight decay for regularization, learning rate scheduling, and gradient clipping. This ensures effective training across decentralized clients in a Federated Learning (FL) framework. The FL server receives LMUs from many clients and aggregates them. The aggregation procedure utilizes adaptive federated averaging to aggregate the LMUs based on the performance of each client. This adaptive method ensures that high-performing clients contribute more to the Global Model Update (GMU). Following aggregation, clients receive the GMU to continue training with the updated parameters, ensuring collaborative and dynamic learning. The proposed method provides better performance on two standard datasets using various numbers of clients.</div></div>","PeriodicalId":29968,"journal":{"name":"Internet of Things","volume":"32 ","pages":"Article 101605"},"PeriodicalIF":6.0000,"publicationDate":"2025-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Internet of Things","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2542660525001180","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

The healthcare industry is becoming more vulnerable to privacy violations and cybercrime due to the pervasive dissemination and sensitivity of medical data. Advanced data security systems are needed to protect privacy, data integrity, and dependability as confidentiality breaches increase across industries. Decentralized healthcare networks face challenges in feature extraction during local training, hindering effective federated averaging and learning rate optimization, which affects data processing and model training efficiency. This paper introduces a novel approach of Transforming Healthcare Edge with Transformer-based Adaptive Federated Learning (THE-TAFL) and Learning Rate Optimization. In this paper, we combine Transformer-based Adaptive Federated Learning (TAFL) with learning rate optimization to improve the privacy and security of healthcare information on edge devices. We used data augmentation approaches that generate robust and generalized input datasets for deep learning models. Next, we use the Vision Transformer (ViT) model for local training, generating Local Model Weights (LMUs) that enhance feature extraction and learning. We designed a training optimization method that improves model performance and stability by combining a loss function with weight decay for regularization, learning rate scheduling, and gradient clipping. This ensures effective training across decentralized clients in a Federated Learning (FL) framework. The FL server receives LMUs from many clients and aggregates them. The aggregation procedure utilizes adaptive federated averaging to aggregate the LMUs based on the performance of each client. This adaptive method ensures that high-performing clients contribute more to the Global Model Update (GMU). Following aggregation, clients receive the GMU to continue training with the updated parameters, ensuring collaborative and dynamic learning. The proposed method provides better performance on two standard datasets using various numbers of clients.

查看原文本刊更多论文

THE-TAFL：利用基于变压器的自适应联合学习和学习率优化转变医疗保健优势

由于医疗数据的广泛传播和敏感性，医疗保健行业越来越容易受到隐私侵犯和网络犯罪的影响。随着各行业机密泄露事件的增加，需要先进的数据安全系统来保护隐私、数据完整性和可靠性。分散的医疗网络在局部训练过程中面临特征提取的挑战，阻碍了有效的联邦平均和学习率优化，从而影响了数据处理和模型训练效率。本文介绍了一种基于变压器的自适应联邦学习（THE-TAFL）和学习率优化的医疗边缘转换新方法。在本文中，我们将基于变压器的自适应联邦学习（TAFL）与学习率优化相结合，以提高边缘设备上医疗保健信息的隐私性和安全性。我们使用数据增强方法为深度学习模型生成鲁棒和广义输入数据集。接下来，我们使用Vision Transformer （ViT）模型进行局部训练，生成增强特征提取和学习的局部模型权重（lmu）。我们设计了一种训练优化方法，通过将损失函数与权值衰减相结合，用于正则化、学习率调度和梯度裁剪，提高了模型的性能和稳定性。这确保了在联邦学习（FL）框架中跨分散的客户端进行有效的培训。FL服务器接收来自许多客户机的lmu，并对它们进行聚合。聚合过程利用自适应联邦平均来根据每个客户机的性能聚合lmu。这种自适应方法确保高性能客户端对全局模型更新（GMU）做出更多贡献。在聚合之后，客户端接收GMU，继续使用更新的参数进行培训，确保协作和动态学习。该方法在使用不同数量客户端的两个标准数据集上提供了更好的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Internet of Things Multiple-

CiteScore

3.60

自引率

5.10%

发文量

115

审稿时长

37 days

期刊介绍： Internet of Things; Engineering Cyber Physical Human Systems is a comprehensive journal encouraging cross collaboration between researchers, engineers and practitioners in the field of IoT & Cyber Physical Human Systems. The journal offers a unique platform to exchange scientific information on the entire breadth of technology, science, and societal applications of the IoT. The journal will place a high priority on timely publication, and provide a home for high quality. Furthermore, IOT is interested in publishing topical Special Issues on any aspect of IOT.