An incremental fuzzy controller for large dec-POMDPs

S. Hamzeloo, M. Z. Jahromi
{"title":"An incremental fuzzy controller for large dec-POMDPs","authors":"S. Hamzeloo, M. Z. Jahromi","doi":"10.1109/AISP.2017.8324075","DOIUrl":null,"url":null,"abstract":"This paper proposes an incremental fuzzy controller to find a sub-optimal policy for large multi-agent systems modeled as DEC-POMDPs. This algorithm employs a compact fuzzy model to overcome the high computational complexity. In our method, each agent uses an individual fuzzy decision maker to interact with the environment. An incremental method is utilized to tune the rule-base of each agent. Reinforcement learning is used to tune the behavior of the agents to achieved maximum global reward. Moreover, we propose an elegant way to create initial rule-base according to the solution of the underlying MDP to increase the performance of the algorithm. We evaluate our proposed approach on several standard benchmark problems and compare it to the state-of-the-art methods. Experimental results show that the proposed incremental fuzzy method can achieve better results compared to the previous methods. Using compact fuzzy rule-base not only decreases the amount of memory used but also significantly speeds up the learning phase and improves interpretability.","PeriodicalId":386952,"journal":{"name":"2017 Artificial Intelligence and Signal Processing Conference (AISP)","volume":"351 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Artificial Intelligence and Signal Processing Conference (AISP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AISP.2017.8324075","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

This paper proposes an incremental fuzzy controller to find a sub-optimal policy for large multi-agent systems modeled as DEC-POMDPs. This algorithm employs a compact fuzzy model to overcome the high computational complexity. In our method, each agent uses an individual fuzzy decision maker to interact with the environment. An incremental method is utilized to tune the rule-base of each agent. Reinforcement learning is used to tune the behavior of the agents to achieved maximum global reward. Moreover, we propose an elegant way to create initial rule-base according to the solution of the underlying MDP to increase the performance of the algorithm. We evaluate our proposed approach on several standard benchmark problems and compare it to the state-of-the-art methods. Experimental results show that the proposed incremental fuzzy method can achieve better results compared to the previous methods. Using compact fuzzy rule-base not only decreases the amount of memory used but also significantly speeds up the learning phase and improves interpretability.
大型deco - pomdp的增量模糊控制器
针对大型多智能体系统(deco - pomdps)的次优策略问题,提出了一种增量模糊控制器。该算法采用了一种紧凑的模糊模型,克服了较高的计算复杂度。在我们的方法中,每个智能体使用一个单独的模糊决策者与环境进行交互。采用增量方法对每个代理的规则库进行调优。强化学习用于调整智能体的行为以获得最大的全局奖励。此外,我们还提出了一种根据底层MDP的解创建初始规则库的优雅方法,以提高算法的性能。我们在几个标准基准问题上评估了我们提出的方法,并将其与最先进的方法进行了比较。实验结果表明,与以往的方法相比,所提出的增量模糊方法可以取得更好的效果。使用紧凑模糊规则库不仅减少了内存的使用,而且显著加快了学习阶段,提高了可解释性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信