VarLenMARL: A Framework of Variable-Length Time-Step Multi-Agent Reinforcement Learning for Cooperative Charging in Sensor Networks

2021 18th Annual IEEE International Conference on Sensing, Communication, and Networking (SECON) Pub Date : 2021-07-06 DOI:10.1109/SECON52354.2021.9491594

Yuxin Chen, He Wu, Yongheng Liang, G. Lai

{"title":"VarLenMARL: A Framework of Variable-Length Time-Step Multi-Agent Reinforcement Learning for Cooperative Charging in Sensor Networks","authors":"Yuxin Chen, He Wu, Yongheng Liang, G. Lai","doi":"10.1109/SECON52354.2021.9491594","DOIUrl":null,"url":null,"abstract":"This paper studies cooperative charging, in which multiple mobile chargers cooperatively provide wireless charging services in a Wireless Rechargeable Sensor Network (WRSN). The ultimate goal of this cooperative charging is the long-term optimization that maximizes both the lifetime of all sensor nodes and the charging utility of each Mobile Charger (MC). We have attempted to apply Multi-Agent Reinforcement Learning (MARL) algorithms to this problem. Unfortunately, similar to existing methods, MARL algorithms also fail early in cooperative charging. We found that an MARL algorithm trained in each time-step of fixed length is neither accurate nor efficient in cooperative charging. We propose a new MARL framework, called VarLenMARL. For the accuracy of reward estimation, VarLenMARL allows each MC completes an action within a time-step of variable length before estimating rewards. Furthermore, we design a special mechanism in VarLenMARL for the long-term optimality of cooperative charging within a WRSN. Our results show that algorithms implemented on VarLenMARL achieved both higher charging utility of MCs and longer lifetime of sensor nodes.","PeriodicalId":120945,"journal":{"name":"2021 18th Annual IEEE International Conference on Sensing, Communication, and Networking (SECON)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 18th Annual IEEE International Conference on Sensing, Communication, and Networking (SECON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SECON52354.2021.9491594","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

This paper studies cooperative charging, in which multiple mobile chargers cooperatively provide wireless charging services in a Wireless Rechargeable Sensor Network (WRSN). The ultimate goal of this cooperative charging is the long-term optimization that maximizes both the lifetime of all sensor nodes and the charging utility of each Mobile Charger (MC). We have attempted to apply Multi-Agent Reinforcement Learning (MARL) algorithms to this problem. Unfortunately, similar to existing methods, MARL algorithms also fail early in cooperative charging. We found that an MARL algorithm trained in each time-step of fixed length is neither accurate nor efficient in cooperative charging. We propose a new MARL framework, called VarLenMARL. For the accuracy of reward estimation, VarLenMARL allows each MC completes an action within a time-step of variable length before estimating rewards. Furthermore, we design a special mechanism in VarLenMARL for the long-term optimality of cooperative charging within a WRSN. Our results show that algorithms implemented on VarLenMARL achieved both higher charging utility of MCs and longer lifetime of sensor nodes.

查看原文本刊更多论文

基于变长时间步长多智能体强化学习的传感器网络协同充电框架

本文研究了在无线充电传感器网络(WRSN)中多个移动充电器协同提供无线充电服务的协同充电技术。这种合作充电的最终目标是使所有传感器节点的使用寿命和每个移动充电器(MC)的充电效用最大化的长期优化。我们尝试应用多智能体强化学习(MARL)算法来解决这个问题。不幸的是，与现有的方法类似，MARL算法在合作收费中也很早就失败了。我们发现在固定长度的每个时间步训练的MARL算法在协同收费中既不准确也不高效。我们提出了一个新的MARL框架，称为VarLenMARL。为了奖励估计的准确性，VarLenMARL允许每个MC在估计奖励之前在可变长度的时间步长内完成一个动作。此外，我们在VarLenMARL中设计了一个特殊的机制，以实现WRSN内合作充电的长期最优性。结果表明，在VarLenMARL上实现的算法既提高了mc的充电利用率，又延长了传感器节点的寿命。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 18th Annual IEEE International Conference on Sensing, Communication, and Networking (SECON)

自引率

0.00%

发文量