Indirect Reinforcement Learning for Autonomous Power Configuration and Control in Wireless Networks

2009 NASA/ESA Conference on Adaptive Hardware and Systems Pub Date : 1900-01-01 DOI:10.1109/AHS.2009.51

A. Udenze, K. Mcdonald-Maier

引用次数: 2

Abstract

In this paper, non deterministic Indirect Reinforcement Learning (RL) techniques for controlling the transmission times and power of Wireless Network nodes are presented. Indirect RL facilitates planning and learning which ultimately leads to convergence on optimal actions with reduced episodes or time steps compared to direct RL. Three Dyna architecture based algorithms for non deterministic environments are presented. The results show improvements over direct RL and conventional static power control techniques.

查看原文本刊更多论文

无线网络中自主功率配置与控制的间接强化学习

本文提出了一种控制无线网络节点传输时间和功率的非确定性间接强化学习(RL)技术。与直接强化学习相比，间接强化学习促进了计划和学习，最终导致最优行为的收敛，减少了情节或时间步长。针对非确定性环境，提出了三种基于Dyna体系结构的算法。结果表明，与直接RL和传统的静态功率控制技术相比，该方法有所改进。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2009 NASA/ESA Conference on Adaptive Hardware and Systems

自引率

0.00%

发文量