{"title":"能量采集传感器网络的分布式强化学习算法","authors":"H. Al-Tous, I. Barhumi","doi":"10.1109/BlackSeaCom.2019.8812862","DOIUrl":null,"url":null,"abstract":"In this paper, a distributed reinforcement-learning (RL) algorithm is proposed for power control and data scheduling in energy-harvesting (EH) multi-hop wireless-sensor-networks (WSNs). The WSN consists of M EH sensor nodes aiming to transmit their data to a sink node with minimum delay. Each sensor node has a battery of limited capacity to save the harvested energy and a buffer of limited size to store both the sensed and relayed data from neighboring nodes. A state-action-reward-state-action (SARSA) based distributed algorithm is proposed. The proposed distributed-SARSA (D-SARSA) algorithm adaptively changes the transmitted data and power control at each sensor node according to the state information such that the data of all sensor nodes are received at the sink node with minimum delay. Simulation results demonstrate the merits of the proposed algorithm.","PeriodicalId":359145,"journal":{"name":"2019 IEEE International Black Sea Conference on Communications and Networking (BlackSeaCom)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Distributed Reinforcement Learning Algorithm for Energy Harvesting Sensor Networks\",\"authors\":\"H. Al-Tous, I. Barhumi\",\"doi\":\"10.1109/BlackSeaCom.2019.8812862\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, a distributed reinforcement-learning (RL) algorithm is proposed for power control and data scheduling in energy-harvesting (EH) multi-hop wireless-sensor-networks (WSNs). The WSN consists of M EH sensor nodes aiming to transmit their data to a sink node with minimum delay. Each sensor node has a battery of limited capacity to save the harvested energy and a buffer of limited size to store both the sensed and relayed data from neighboring nodes. A state-action-reward-state-action (SARSA) based distributed algorithm is proposed. The proposed distributed-SARSA (D-SARSA) algorithm adaptively changes the transmitted data and power control at each sensor node according to the state information such that the data of all sensor nodes are received at the sink node with minimum delay. Simulation results demonstrate the merits of the proposed algorithm.\",\"PeriodicalId\":359145,\"journal\":{\"name\":\"2019 IEEE International Black Sea Conference on Communications and Networking (BlackSeaCom)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE International Black Sea Conference on Communications and Networking (BlackSeaCom)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BlackSeaCom.2019.8812862\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Black Sea Conference on Communications and Networking (BlackSeaCom)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BlackSeaCom.2019.8812862","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Distributed Reinforcement Learning Algorithm for Energy Harvesting Sensor Networks
In this paper, a distributed reinforcement-learning (RL) algorithm is proposed for power control and data scheduling in energy-harvesting (EH) multi-hop wireless-sensor-networks (WSNs). The WSN consists of M EH sensor nodes aiming to transmit their data to a sink node with minimum delay. Each sensor node has a battery of limited capacity to save the harvested energy and a buffer of limited size to store both the sensed and relayed data from neighboring nodes. A state-action-reward-state-action (SARSA) based distributed algorithm is proposed. The proposed distributed-SARSA (D-SARSA) algorithm adaptively changes the transmitted data and power control at each sensor node according to the state information such that the data of all sensor nodes are received at the sink node with minimum delay. Simulation results demonstrate the merits of the proposed algorithm.