{"title":"Extending battery lifespan in IoT extreme sensor networks through collaborative reinforcement learning-powered task offloading","authors":"Mateo Cumia, Gabriel Mujica, Jorge Portilla","doi":"10.1016/j.iot.2025.101534","DOIUrl":null,"url":null,"abstract":"<div><div>The use of wireless sensor networks (WSN) is increasingly widespread in the Internet of Things domain. Additionally, since the onset of the edge computing paradigm that brings the cloud closer to devices, these networks have seen improvements in battery lifetime and processing time, particularly in extreme edge architectures where network resources are more limited. Meanwhile, AI and machine learning techniques have been expanding across various domains to optimize different decision-making processes, including the task assignment problem in computation offloading. This article employs reinforcement learning (RL) techniques to address the task offloading problem, aiming to extend the lifespan of a WSN. To achieve this, a distributed multi-agent Q-learning algorithm is proposed, where sensor nodes (SNs) make collaborative decisions towards a common goal, avoiding selfish decision-making. The proposed algorithm is compared with two other state-of-the-art solutions, that is, a well-known Q-learning algorithm that allows centralized estimation of the Q-table before distributing it to the network’s sensor nodes (SNs), and a similar implementation of this algorithm but using Deep Q-learning, which theoretically should achieve the best results. The outcomes show that the multi-agent RL algorithm improves performance when it takes other nodes in the network into account in its decisions, being the most suitable solution to be embedded in resource-constrained devices. Although it still achieves worse results than the Deep Q-learning algorithm, the latter involves much greater difficulties for implementation in real devices.</div></div>","PeriodicalId":29968,"journal":{"name":"Internet of Things","volume":"31 ","pages":"Article 101534"},"PeriodicalIF":6.0000,"publicationDate":"2025-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Internet of Things","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2542660525000472","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The use of wireless sensor networks (WSN) is increasingly widespread in the Internet of Things domain. Additionally, since the onset of the edge computing paradigm that brings the cloud closer to devices, these networks have seen improvements in battery lifetime and processing time, particularly in extreme edge architectures where network resources are more limited. Meanwhile, AI and machine learning techniques have been expanding across various domains to optimize different decision-making processes, including the task assignment problem in computation offloading. This article employs reinforcement learning (RL) techniques to address the task offloading problem, aiming to extend the lifespan of a WSN. To achieve this, a distributed multi-agent Q-learning algorithm is proposed, where sensor nodes (SNs) make collaborative decisions towards a common goal, avoiding selfish decision-making. The proposed algorithm is compared with two other state-of-the-art solutions, that is, a well-known Q-learning algorithm that allows centralized estimation of the Q-table before distributing it to the network’s sensor nodes (SNs), and a similar implementation of this algorithm but using Deep Q-learning, which theoretically should achieve the best results. The outcomes show that the multi-agent RL algorithm improves performance when it takes other nodes in the network into account in its decisions, being the most suitable solution to be embedded in resource-constrained devices. Although it still achieves worse results than the Deep Q-learning algorithm, the latter involves much greater difficulties for implementation in real devices.
期刊介绍:
Internet of Things; Engineering Cyber Physical Human Systems is a comprehensive journal encouraging cross collaboration between researchers, engineers and practitioners in the field of IoT & Cyber Physical Human Systems. The journal offers a unique platform to exchange scientific information on the entire breadth of technology, science, and societal applications of the IoT.
The journal will place a high priority on timely publication, and provide a home for high quality.
Furthermore, IOT is interested in publishing topical Special Issues on any aspect of IOT.