Ling Xing , Jingjing Cui , Jianping Gao , Kaikai Deng , Honghai Wu , Huahong Ma
{"title":"Octopus:车联网中背包模型驱动的联合学习客户端选择","authors":"Ling Xing , Jingjing Cui , Jianping Gao , Kaikai Deng , Honghai Wu , Huahong Ma","doi":"10.1016/j.pmcj.2025.102063","DOIUrl":null,"url":null,"abstract":"<div><div>Federated learning (FL), as a distributed way for processing real-time vehicle data, is widely used to improve driving experience and enhance service quality in Internet of Vehicles (IoV). However, considering the data and devices heterogeneity of vehicle nodes, randomly selecting vehicles that are involved in model training would suffer from data skewness, high resource consumption, and low convergence speed. To this end, we propose <span>Octopus</span>, which consists of two components: i) an <em>importance sampling-based local loss computation</em> method is designed to request resource information for each client and apply the importance sampling technique to assess each client’s contribution to the global model’s convergence, followed by utilizing a knapsack model that treats the local loss of each client as the item value, while treating the total system training time as the knapsack capacity to accelerate the client convergence; ii) a <em>knapsack model-based federated learning client selection</em> method is designed to select the client with optimal local loss and maximum model uploading speed to participate in training. In each training round, these clients download and update the model within a predefined time, followed by enabling the selected clients to continue uploading the updated model parameters for assisting the server to efficiently complete the model aggregation. Experimental results show that <span>Octopus</span> improved the model accuracy by 2.64% <span><math><mo>∼</mo></math></span>32.61% with heterogeneous data, and by 1.97% <span><math><mo>∼</mo></math></span>11.74% with device heterogeneity, compared to eight state-of-the-art baselines.</div></div>","PeriodicalId":49005,"journal":{"name":"Pervasive and Mobile Computing","volume":"111 ","pages":"Article 102063"},"PeriodicalIF":3.0000,"publicationDate":"2025-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Octopus: Knapsack model-driven federated learning client selection in internet of vehicles\",\"authors\":\"Ling Xing , Jingjing Cui , Jianping Gao , Kaikai Deng , Honghai Wu , Huahong Ma\",\"doi\":\"10.1016/j.pmcj.2025.102063\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Federated learning (FL), as a distributed way for processing real-time vehicle data, is widely used to improve driving experience and enhance service quality in Internet of Vehicles (IoV). However, considering the data and devices heterogeneity of vehicle nodes, randomly selecting vehicles that are involved in model training would suffer from data skewness, high resource consumption, and low convergence speed. To this end, we propose <span>Octopus</span>, which consists of two components: i) an <em>importance sampling-based local loss computation</em> method is designed to request resource information for each client and apply the importance sampling technique to assess each client’s contribution to the global model’s convergence, followed by utilizing a knapsack model that treats the local loss of each client as the item value, while treating the total system training time as the knapsack capacity to accelerate the client convergence; ii) a <em>knapsack model-based federated learning client selection</em> method is designed to select the client with optimal local loss and maximum model uploading speed to participate in training. In each training round, these clients download and update the model within a predefined time, followed by enabling the selected clients to continue uploading the updated model parameters for assisting the server to efficiently complete the model aggregation. Experimental results show that <span>Octopus</span> improved the model accuracy by 2.64% <span><math><mo>∼</mo></math></span>32.61% with heterogeneous data, and by 1.97% <span><math><mo>∼</mo></math></span>11.74% with device heterogeneity, compared to eight state-of-the-art baselines.</div></div>\",\"PeriodicalId\":49005,\"journal\":{\"name\":\"Pervasive and Mobile Computing\",\"volume\":\"111 \",\"pages\":\"Article 102063\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-05-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pervasive and Mobile Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1574119225000525\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pervasive and Mobile Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1574119225000525","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
Octopus: Knapsack model-driven federated learning client selection in internet of vehicles
Federated learning (FL), as a distributed way for processing real-time vehicle data, is widely used to improve driving experience and enhance service quality in Internet of Vehicles (IoV). However, considering the data and devices heterogeneity of vehicle nodes, randomly selecting vehicles that are involved in model training would suffer from data skewness, high resource consumption, and low convergence speed. To this end, we propose Octopus, which consists of two components: i) an importance sampling-based local loss computation method is designed to request resource information for each client and apply the importance sampling technique to assess each client’s contribution to the global model’s convergence, followed by utilizing a knapsack model that treats the local loss of each client as the item value, while treating the total system training time as the knapsack capacity to accelerate the client convergence; ii) a knapsack model-based federated learning client selection method is designed to select the client with optimal local loss and maximum model uploading speed to participate in training. In each training round, these clients download and update the model within a predefined time, followed by enabling the selected clients to continue uploading the updated model parameters for assisting the server to efficiently complete the model aggregation. Experimental results show that Octopus improved the model accuracy by 2.64% 32.61% with heterogeneous data, and by 1.97% 11.74% with device heterogeneity, compared to eight state-of-the-art baselines.
期刊介绍:
As envisioned by Mark Weiser as early as 1991, pervasive computing systems and services have truly become integral parts of our daily lives. Tremendous developments in a multitude of technologies ranging from personalized and embedded smart devices (e.g., smartphones, sensors, wearables, IoTs, etc.) to ubiquitous connectivity, via a variety of wireless mobile communications and cognitive networking infrastructures, to advanced computing techniques (including edge, fog and cloud) and user-friendly middleware services and platforms have significantly contributed to the unprecedented advances in pervasive and mobile computing. Cutting-edge applications and paradigms have evolved, such as cyber-physical systems and smart environments (e.g., smart city, smart energy, smart transportation, smart healthcare, etc.) that also involve human in the loop through social interactions and participatory and/or mobile crowd sensing, for example. The goal of pervasive computing systems is to improve human experience and quality of life, without explicit awareness of the underlying communications and computing technologies.
The Pervasive and Mobile Computing Journal (PMC) is a high-impact, peer-reviewed technical journal that publishes high-quality scientific articles spanning theory and practice, and covering all aspects of pervasive and mobile computing and systems.