An incremental fuzzy controller for large dec-POMDPs

2017 Artificial Intelligence and Signal Processing Conference (AISP) Pub Date : 2017-10-01 DOI:10.1109/AISP.2017.8324075

S. Hamzeloo, M. Z. Jahromi

引用次数: 3

Abstract

This paper proposes an incremental fuzzy controller to find a sub-optimal policy for large multi-agent systems modeled as DEC-POMDPs. This algorithm employs a compact fuzzy model to overcome the high computational complexity. In our method, each agent uses an individual fuzzy decision maker to interact with the environment. An incremental method is utilized to tune the rule-base of each agent. Reinforcement learning is used to tune the behavior of the agents to achieved maximum global reward. Moreover, we propose an elegant way to create initial rule-base according to the solution of the underlying MDP to increase the performance of the algorithm. We evaluate our proposed approach on several standard benchmark problems and compare it to the state-of-the-art methods. Experimental results show that the proposed incremental fuzzy method can achieve better results compared to the previous methods. Using compact fuzzy rule-base not only decreases the amount of memory used but also significantly speeds up the learning phase and improves interpretability.

查看原文本刊更多论文

大型deco - pomdp的增量模糊控制器

针对大型多智能体系统(deco - pomdps)的次优策略问题，提出了一种增量模糊控制器。该算法采用了一种紧凑的模糊模型，克服了较高的计算复杂度。在我们的方法中，每个智能体使用一个单独的模糊决策者与环境进行交互。采用增量方法对每个代理的规则库进行调优。强化学习用于调整智能体的行为以获得最大的全局奖励。此外，我们还提出了一种根据底层MDP的解创建初始规则库的优雅方法，以提高算法的性能。我们在几个标准基准问题上评估了我们提出的方法，并将其与最先进的方法进行了比较。实验结果表明，与以往的方法相比，所提出的增量模糊方法可以取得更好的效果。使用紧凑模糊规则库不仅减少了内存的使用，而且显著加快了学习阶段，提高了可解释性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 Artificial Intelligence and Signal Processing Conference (AISP)

自引率

0.00%

发文量