Reinforcement learning in swarm-robotics for multi-agent foraging-task domain

2013 IEEE Symposium on Swarm Intelligence (SIS) Pub Date : 2013-04-16 DOI:10.1109/SIS.2013.6615154

Y. M, P. S G, Kanagaraj G

引用次数: 9

Abstract

The main focus of this paper is to study and develop an efficient learning policy to address the exploration vs. exploitation dilemma in a multi-objective foraging task in swarm robotics domain. An efficient learning policy called FIFO-list is proposed to tackle the above mentioned problem. The proposed FIFO-list is a model-based learning policy which can attain near-optimal solutions. In FIFO-list, the swarm robots maintains a dynamic list of recently visited states. States that are included in the list are banned from exploration by the swarm robots regardless of the Q(s, a) values associated with those states. The FIFO list is updated based on First-In-First-Out (FIFO) rule, meaning the states that enters the list first will exit the list first. The recently visited states will remain in the list for a dynamic number of time-steps which is determined by the desirability of the visited states.

查看原文本刊更多论文

多智能体觅食任务领域的群体机器人强化学习

本文的重点是研究和开发一种有效的学习策略，以解决群体机器人领域中多目标觅食任务中的探索与利用困境。针对上述问题，提出了一种高效的FIFO-list学习策略。所提出的fifo列表是一种基于模型的学习策略，可以获得近似最优解。在FIFO-list中，群体机器人保持最近访问状态的动态列表。无论与这些状态相关的Q(s, a)值如何，被列入列表的状态都被禁止进行群机器人的探索。FIFO列表是根据先进先出(FIFO)规则更新的，这意味着首先进入列表的状态将首先退出列表。最近访问过的国家将在一段动态时间内保留在名单中，这取决于访问过的国家的意愿。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2013 IEEE Symposium on Swarm Intelligence (SIS)

自引率

0.00%

发文量