Himel Saha , Md Nur Ahmed , Palash Roy , Md. Abdur Razzaque , Nafis Fuad Tanvir , Mohammad Mehedi Hassan , Md Zia Uddin
{"title":"面向数字孪生的工业物联网的设备和数据异构感知分离学习","authors":"Himel Saha , Md Nur Ahmed , Palash Roy , Md. Abdur Razzaque , Nafis Fuad Tanvir , Mohammad Mehedi Hassan , Md Zia Uddin","doi":"10.1016/j.comnet.2025.111478","DOIUrl":null,"url":null,"abstract":"<div><div>Distributed learning methods in the Industrial Internet of Things (IIoT) face challenges due to their dynamically changing learning environment. This is mainly caused by two facts. Firstly, the complex nature of statistically heterogeneous data, often non-independent and identically distributed (non-IID), from geographically scattered clients hampers the accuracy of the training model. Secondly, the heterogeneity in computational and communication resources among IIoT devices frequently introduces instability in the training process, resulting in a delay referred to as the straggler. Existing literature addresses only the unilateral side of the heterogeneity issue but lacks comprehensive efforts to tackle the joint problem of device and data heterogeneities for the resource-constrained IIoT. To address these multifaceted challenges, in this article, we have introduced device and data Heterogeneity Aware SplitFed Learning, namely Het-SFL, a distributed learning framework designed for deployment within dynamic IIoT networks. The developed Het-SFL framework optimizes resource utilization and reduces the computational burden by dynamically assessing the optimal split point of the training model for the resource-limited devices, considering factors such as training time, device energy consumption, and model accuracy. A clustering mechanism is employed to mitigate the straggler effect and to reduce the solution space to obtain the optimal split point within a significantly shortened deadline. Furthermore, the Het-SFL framework leverages emerging Digital Twin (DT) technology to facilitate real-time analysis of heterogeneous data, thereby improving the performance of the training model in non-IID contexts. The numeric performance analysis reveals that the Het-SFL framework improves training performance in terms of accuracy, training time, and device energy consumption by up to 30%, 40%, and 55%, respectively, compared to other state-of-the-art works.</div></div>","PeriodicalId":50637,"journal":{"name":"Computer Networks","volume":"270 ","pages":"Article 111478"},"PeriodicalIF":4.6000,"publicationDate":"2025-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Device and data Heterogeneity Aware SplitFed Learning for Digital Twin empowered Industrial Internet of Things\",\"authors\":\"Himel Saha , Md Nur Ahmed , Palash Roy , Md. Abdur Razzaque , Nafis Fuad Tanvir , Mohammad Mehedi Hassan , Md Zia Uddin\",\"doi\":\"10.1016/j.comnet.2025.111478\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Distributed learning methods in the Industrial Internet of Things (IIoT) face challenges due to their dynamically changing learning environment. This is mainly caused by two facts. Firstly, the complex nature of statistically heterogeneous data, often non-independent and identically distributed (non-IID), from geographically scattered clients hampers the accuracy of the training model. Secondly, the heterogeneity in computational and communication resources among IIoT devices frequently introduces instability in the training process, resulting in a delay referred to as the straggler. Existing literature addresses only the unilateral side of the heterogeneity issue but lacks comprehensive efforts to tackle the joint problem of device and data heterogeneities for the resource-constrained IIoT. To address these multifaceted challenges, in this article, we have introduced device and data Heterogeneity Aware SplitFed Learning, namely Het-SFL, a distributed learning framework designed for deployment within dynamic IIoT networks. The developed Het-SFL framework optimizes resource utilization and reduces the computational burden by dynamically assessing the optimal split point of the training model for the resource-limited devices, considering factors such as training time, device energy consumption, and model accuracy. A clustering mechanism is employed to mitigate the straggler effect and to reduce the solution space to obtain the optimal split point within a significantly shortened deadline. Furthermore, the Het-SFL framework leverages emerging Digital Twin (DT) technology to facilitate real-time analysis of heterogeneous data, thereby improving the performance of the training model in non-IID contexts. The numeric performance analysis reveals that the Het-SFL framework improves training performance in terms of accuracy, training time, and device energy consumption by up to 30%, 40%, and 55%, respectively, compared to other state-of-the-art works.</div></div>\",\"PeriodicalId\":50637,\"journal\":{\"name\":\"Computer Networks\",\"volume\":\"270 \",\"pages\":\"Article 111478\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2025-06-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Networks\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1389128625004451\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Networks","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1389128625004451","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
Device and data Heterogeneity Aware SplitFed Learning for Digital Twin empowered Industrial Internet of Things
Distributed learning methods in the Industrial Internet of Things (IIoT) face challenges due to their dynamically changing learning environment. This is mainly caused by two facts. Firstly, the complex nature of statistically heterogeneous data, often non-independent and identically distributed (non-IID), from geographically scattered clients hampers the accuracy of the training model. Secondly, the heterogeneity in computational and communication resources among IIoT devices frequently introduces instability in the training process, resulting in a delay referred to as the straggler. Existing literature addresses only the unilateral side of the heterogeneity issue but lacks comprehensive efforts to tackle the joint problem of device and data heterogeneities for the resource-constrained IIoT. To address these multifaceted challenges, in this article, we have introduced device and data Heterogeneity Aware SplitFed Learning, namely Het-SFL, a distributed learning framework designed for deployment within dynamic IIoT networks. The developed Het-SFL framework optimizes resource utilization and reduces the computational burden by dynamically assessing the optimal split point of the training model for the resource-limited devices, considering factors such as training time, device energy consumption, and model accuracy. A clustering mechanism is employed to mitigate the straggler effect and to reduce the solution space to obtain the optimal split point within a significantly shortened deadline. Furthermore, the Het-SFL framework leverages emerging Digital Twin (DT) technology to facilitate real-time analysis of heterogeneous data, thereby improving the performance of the training model in non-IID contexts. The numeric performance analysis reveals that the Het-SFL framework improves training performance in terms of accuracy, training time, and device energy consumption by up to 30%, 40%, and 55%, respectively, compared to other state-of-the-art works.
期刊介绍:
Computer Networks is an international, archival journal providing a publication vehicle for complete coverage of all topics of interest to those involved in the computer communications networking area. The audience includes researchers, managers and operators of networks as well as designers and implementors. The Editorial Board will consider any material for publication that is of interest to those groups.