{"title":"正在进行的工作:基于强化学习的两个瞬时供电传感器之间的协作通信","authors":"Yawen Wu, Zhenge Jia, Fei Fang, J. Hu","doi":"10.1145/3349567.3351723","DOIUrl":null,"url":null,"abstract":"The transmission between two energy harvesting (EH) powered sensors is successful only when both sensors have enough energy at the same time. Given the scarce, unpredictable, and unevenly distributed energy between two sensors, it is challenging to ensure efficient data transmission. We propose a sensor node architecture with multiple wake-up radios, each with a different ratio of energy consumption on the transmitter and receiver. Two sensors cooperatively select wake-up radios to maximize data throughput. The communication procedure is modeled as a cooperative Markov game with partial observability and multi-agent reinforcement learning (MARL) is employed to maximize the throughput. The proposed methods achieve near-optimal data throughput.","PeriodicalId":194982,"journal":{"name":"2019 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Work-in-Progress: Cooperative Communication Between Two Transiently Powered Sensors by Reinforcement Learning\",\"authors\":\"Yawen Wu, Zhenge Jia, Fei Fang, J. Hu\",\"doi\":\"10.1145/3349567.3351723\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The transmission between two energy harvesting (EH) powered sensors is successful only when both sensors have enough energy at the same time. Given the scarce, unpredictable, and unevenly distributed energy between two sensors, it is challenging to ensure efficient data transmission. We propose a sensor node architecture with multiple wake-up radios, each with a different ratio of energy consumption on the transmitter and receiver. Two sensors cooperatively select wake-up radios to maximize data throughput. The communication procedure is modeled as a cooperative Markov game with partial observability and multi-agent reinforcement learning (MARL) is employed to maximize the throughput. The proposed methods achieve near-optimal data throughput.\",\"PeriodicalId\":194982,\"journal\":{\"name\":\"2019 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS)\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3349567.3351723\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3349567.3351723","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Work-in-Progress: Cooperative Communication Between Two Transiently Powered Sensors by Reinforcement Learning
The transmission between two energy harvesting (EH) powered sensors is successful only when both sensors have enough energy at the same time. Given the scarce, unpredictable, and unevenly distributed energy between two sensors, it is challenging to ensure efficient data transmission. We propose a sensor node architecture with multiple wake-up radios, each with a different ratio of energy consumption on the transmitter and receiver. Two sensors cooperatively select wake-up radios to maximize data throughput. The communication procedure is modeled as a cooperative Markov game with partial observability and multi-agent reinforcement learning (MARL) is employed to maximize the throughput. The proposed methods achieve near-optimal data throughput.