Improving RL power for on-line evolution of gaits in modular robots

2016 IEEE Symposium Series on Computational Intelligence (SSCI) Pub Date : 2016-12-06 DOI:10.1109/SSCI.2016.7850166

Milan Jelisavcic, Matteo De Carlo, E. Haasdijk, A. Eiben

引用次数: 8

Abstract

This paper addresses the problem of on-line gait learning in modular robots whose shape is not known in advance. The best algorithm for this problem known to us is a reinforcement learning method, called RL PoWER. In this study we revisit the original RL PoWER algorithm and observe that in essence it is a specific evolutionary algorithm. Based on this insight we propose two modifications of the main search operators and compare the quality of the evolved gaits when either or both of these modified operators are employed. The results show that using 2-parent crossover as well as mutation with self-adaptive step-sizes can significantly improve the performance of the original algorithm.

查看原文本刊更多论文

改进模块化机器人步态在线进化的强化学习功率

研究了形状未知的模块化机器人在线步态学习问题。我们已知的解决这个问题的最佳算法是一种强化学习方法，称为RL PoWER。在本研究中，我们重新审视了原始的RL PoWER算法，并观察到本质上它是一个特定的进化算法。基于这一见解，我们提出了两种主要搜索算子的修改，并比较了当使用这两种修改算子中的一种或两种时进化步态的质量。结果表明，采用双亲交叉和自适应步长突变可以显著提高原算法的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 IEEE Symposium Series on Computational Intelligence (SSCI)

自引率

0.00%

发文量