Fuzzy actor-critic learning automaton algorithm for the pursuit-evasion differential game

2017 International Automatic Control Conference (CACS) Pub Date : 2017-11-01 DOI:10.1109/CACS.2017.8284278

Ahmad A. Al-Talabi

引用次数: 0

Abstract

This paper presents an efficient learning algorithm to autonomously tune the parameters of a fuzzy logic controller (FLC) of a mobile robot playing a pursuit-evasion (PE) differential game. The proposed algorithm is a modified version of the fuzzy-actor critic learning (FACL) algorithm, in which both the critic and the actor employ a fuzzy inference systems (FIS). It uses the continuous actor-critic learning Automaton (CACLA) algorithm to tune the parameters of the FIS. It is called fuzzy actor-critic learning Automaton (FACLA) algorithm. FACLA is applied to two versions of the PE games. The first version considers that the pursuer interacts with the evader and will learn its default control strategy and the evader has a fixed strategy. The second version assumes both the pursuer and the evader are learning their default strategies. FACLA is compared through simulation with the FACL, and the PSO-based FLC+QFIS algorithms. Simulation results demonstrate that the performance of FACLA quantified by the learning time outperforms that of the FACL and PSO-based FLC+QFIS algorithms.

查看原文本刊更多论文

追求-逃避微分对策的模糊演员-评论家学习自动机算法

本文提出了一种有效的学习算法，用于移动机器人追逃微分博弈的模糊控制器参数的自动整定。提出的算法是模糊行为者批评学习(FACL)算法的改进版本，其中评论家和行为者都使用模糊推理系统(FIS)。它使用连续演员-评论家学习自动机(CACLA)算法来调整FIS的参数。它被称为模糊演员-评论家学习自动机(FACLA)算法。FACLA适用于两个版本的体育游戏。第一个版本认为追赶者与逃避者相互作用，并将学习其默认的控制策略，而逃避者有一个固定的策略。第二个版本假设追求者和逃避者都在学习他们的默认策略。通过仿真将FACLA算法与FACL算法以及基于pso的FLC+QFIS算法进行了比较。仿真结果表明，以学习时间量化的FACLA算法的性能优于基于FACL和基于pso的FLC+QFIS算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 International Automatic Control Conference (CACS)

自引率

0.00%

发文量