{"title":"Direct Reinforcement Learning for Autonomous Power Configuration and Control in Wireless Networks","authors":"A. Udenze, K. Mcdonald-Maier","doi":"10.1109/AHS.2009.50","DOIUrl":null,"url":null,"abstract":"In this paper, non deterministic Direct Reinforcement Learning (RL) for controlling the transmission times and power of a Wireless Sensor Network (WSN) node is presented. RL allows for truly autonomous optimal behaviour of agents by requiring no models or supervision to learn. Optimal actions are learnt by repeated interactions with the environment. Performance results are presented for Monte Carlo, TD0 and TDλ. The resultant optimal learned policies are shown to out perform static power control in a stochastic environment.","PeriodicalId":318989,"journal":{"name":"2009 NASA/ESA Conference on Adaptive Hardware and Systems","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 NASA/ESA Conference on Adaptive Hardware and Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AHS.2009.50","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13
Abstract
In this paper, non deterministic Direct Reinforcement Learning (RL) for controlling the transmission times and power of a Wireless Sensor Network (WSN) node is presented. RL allows for truly autonomous optimal behaviour of agents by requiring no models or supervision to learn. Optimal actions are learnt by repeated interactions with the environment. Performance results are presented for Monte Carlo, TD0 and TDλ. The resultant optimal learned policies are shown to out perform static power control in a stochastic environment.