An Investigation on Multi-UAVs Cooperative Control Algorithm for Target Chasing

Haopeng Li, Guoqing Qi, Zhifeng Jin, Yinya Li, A. Sheng
{"title":"An Investigation on Multi-UAVs Cooperative Control Algorithm for Target Chasing","authors":"Haopeng Li, Guoqing Qi, Zhifeng Jin, Yinya Li, A. Sheng","doi":"10.1109/ICCSI55536.2022.9970622","DOIUrl":null,"url":null,"abstract":"Aiming at the problem of collective pursuit of Multiple Unmanned Aerial Vehicles (multi-UAVs) against non-cooperative UAVs, a directional chase strategy based on the Multi-Agent Deep Deterministic Policy Gradient(MADDPG) algorithm is designed by using deep reinforcement learning theory. By designing the algorithm model, state variables, actoin variables and reward function, the UAVs are trained to learn the directional chase strategy on a distributed Actor, centralized Critic structure. It is verified by simulation that the proposed chase strategy has higher learning efficiency while ensuring accuracy than the strategy with distributed training using the Deep Deterministic Policy Gradient (DDPG) algorithm. It also has a higher capture efficiency than the conventional pursuit-only strategy, provides a new research idea for multi- UAV confrontation.","PeriodicalId":421514,"journal":{"name":"2022 International Conference on Cyber-Physical Social Intelligence (ICCSI)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Cyber-Physical Social Intelligence (ICCSI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCSI55536.2022.9970622","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Aiming at the problem of collective pursuit of Multiple Unmanned Aerial Vehicles (multi-UAVs) against non-cooperative UAVs, a directional chase strategy based on the Multi-Agent Deep Deterministic Policy Gradient(MADDPG) algorithm is designed by using deep reinforcement learning theory. By designing the algorithm model, state variables, actoin variables and reward function, the UAVs are trained to learn the directional chase strategy on a distributed Actor, centralized Critic structure. It is verified by simulation that the proposed chase strategy has higher learning efficiency while ensuring accuracy than the strategy with distributed training using the Deep Deterministic Policy Gradient (DDPG) algorithm. It also has a higher capture efficiency than the conventional pursuit-only strategy, provides a new research idea for multi- UAV confrontation.
多无人机目标跟踪协同控制算法研究
针对多无人机对非合作无人机的集体追击问题,利用深度强化学习理论,设计了一种基于多智能体深度确定性策略梯度(madpg)算法的定向追击策略。通过设计算法模型、状态变量、动作变量和奖励函数,训练无人机在分布式Actor、集中式Critic结构上学习定向追击策略。仿真结果表明,与采用深度确定性策略梯度(Deep Deterministic Policy Gradient, DDPG)算法进行分布式训练的追赶策略相比,所提出的追赶策略在保证准确率的同时具有更高的学习效率。与传统的单追击策略相比,该策略具有更高的捕获效率,为多无人机对抗提供了新的研究思路。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信