无线网络中自主功率配置与控制的间接强化学习

2009 NASA/ESA Conference on Adaptive Hardware and Systems Pub Date : 1900-01-01 DOI:10.1109/AHS.2009.51

A. Udenze, K. Mcdonald-Maier

{"title":"无线网络中自主功率配置与控制的间接强化学习","authors":"A. Udenze, K. Mcdonald-Maier","doi":"10.1109/AHS.2009.51","DOIUrl":null,"url":null,"abstract":"In this paper, non deterministic Indirect Reinforcement Learning (RL) techniques for controlling the transmission times and power of Wireless Network nodes are presented. Indirect RL facilitates planning and learning which ultimately leads to convergence on optimal actions with reduced episodes or time steps compared to direct RL. Three Dyna architecture based algorithms for non deterministic environments are presented. The results show improvements over direct RL and conventional static power control techniques.","PeriodicalId":318989,"journal":{"name":"2009 NASA/ESA Conference on Adaptive Hardware and Systems","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Indirect Reinforcement Learning for Autonomous Power Configuration and Control in Wireless Networks\",\"authors\":\"A. Udenze, K. Mcdonald-Maier\",\"doi\":\"10.1109/AHS.2009.51\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, non deterministic Indirect Reinforcement Learning (RL) techniques for controlling the transmission times and power of Wireless Network nodes are presented. Indirect RL facilitates planning and learning which ultimately leads to convergence on optimal actions with reduced episodes or time steps compared to direct RL. Three Dyna architecture based algorithms for non deterministic environments are presented. The results show improvements over direct RL and conventional static power control techniques.\",\"PeriodicalId\":318989,\"journal\":{\"name\":\"2009 NASA/ESA Conference on Adaptive Hardware and Systems\",\"volume\":\"8 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 NASA/ESA Conference on Adaptive Hardware and Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AHS.2009.51\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 NASA/ESA Conference on Adaptive Hardware and Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AHS.2009.51","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

本文提出了一种控制无线网络节点传输时间和功率的非确定性间接强化学习(RL)技术。与直接强化学习相比，间接强化学习促进了计划和学习，最终导致最优行为的收敛，减少了情节或时间步长。针对非确定性环境，提出了三种基于Dyna体系结构的算法。结果表明，与直接RL和传统的静态功率控制技术相比，该方法有所改进。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Indirect Reinforcement Learning for Autonomous Power Configuration and Control in Wireless Networks

In this paper, non deterministic Indirect Reinforcement Learning (RL) techniques for controlling the transmission times and power of Wireless Network nodes are presented. Indirect RL facilitates planning and learning which ultimately leads to convergence on optimal actions with reduced episodes or time steps compared to direct RL. Three Dyna architecture based algorithms for non deterministic environments are presented. The results show improvements over direct RL and conventional static power control techniques.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2009 NASA/ESA Conference on Adaptive Hardware and Systems

自引率

0.00%

发文量