Cooperative co-learning: a model-based approach for solving multi-agent reinforcement problems

14th IEEE International Conference on Tools with Artificial Intelligence, 2002. (ICTAI 2002). Proceedings. Pub Date : 2002-11-04 DOI:10.1109/TAI.2002.1180839

B. Scherrer, F. Charpillet

引用次数: 13

Abstract

Solving multiagent reinforcement learning problems is a key issue. Indeed, the complexity of deriving multiagent plans, especially when one uses an explicit model of the problem, is dramatically increasing with the number of agents. This papers introduces a general iterative heuristic: at each step one chooses a sub-group of agents and update their policies to optimize the task given the rest of agents have fixed plans. We analyse this process in a general purpose and show how it can be applied to Markov decision processes, partially observable Markov decision processes and decentralized partially observable Markov decision processes.

查看原文本刊更多论文

合作共同学习:一种基于模型的多智能体强化问题解决方法

解决多智能体强化学习问题是一个关键问题。事实上，推导多智能体计划的复杂性，特别是当一个人使用一个明确的问题模型时，随着智能体的数量急剧增加。本文介绍了一种通用迭代启发式算法:在给定其他智能体有固定计划的情况下，每一步选择一个子智能体组并更新其策略以优化任务。我们分析了这一过程的一般目的，并展示了如何将其应用于马尔可夫决策过程、部分可观察马尔可夫决策过程和分散的部分可观察马尔可夫决策过程。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

14th IEEE International Conference on Tools with Artificial Intelligence, 2002. (ICTAI 2002). Proceedings.

自引率

0.00%

发文量