Multi-agent Target Defense Game with Learned Defender to Attacker Assignment

Amith Manoharan, Prajwal Thakur, Ashutosh Kumar Singh
{"title":"Multi-agent Target Defense Game with Learned Defender to Attacker Assignment","authors":"Amith Manoharan, Prajwal Thakur, Ashutosh Kumar Singh","doi":"10.1109/ICUAS57906.2023.10156083","DOIUrl":null,"url":null,"abstract":"This paper considers a variant of pursuit-evasion games where multiple attacker unmanned aerial vehicles (UAVs) are trying to converge on a target. The goal is to use a set of defender UAVs to save the target by ensuring they converge on the attackers before the latter converge to the target. The core challenge lies in appropriately assigning a particular defender to an attacker. The simple heuristic assignment based on Euclidean distance between the attacker and defender performs poorly. This paper presents a data-driven solution assuming that the attacker uses a known optimal control policy. We show how massive offline simulations can be leveraged to predict the optimal cost/value function incurred by the defender to converge on an attacker for a given target trajectory. We use this optimal cost/value function as a true measure of separation between an attacker and a defender. We use it as the guiding heuristic in the Hungarian algorithm for computing defender-attacker assignments. We perform extensive simulations to validate our approach wherein we couple the learned assignment with a non-linear model predictive controller to perform realistic simulations. We show that our assignment approach outperforms that based on the Euclidean heuristic in terms of the number of successful attempts by the defenders.","PeriodicalId":379073,"journal":{"name":"2023 International Conference on Unmanned Aircraft Systems (ICUAS)","volume":"53 3","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Unmanned Aircraft Systems (ICUAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICUAS57906.2023.10156083","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This paper considers a variant of pursuit-evasion games where multiple attacker unmanned aerial vehicles (UAVs) are trying to converge on a target. The goal is to use a set of defender UAVs to save the target by ensuring they converge on the attackers before the latter converge to the target. The core challenge lies in appropriately assigning a particular defender to an attacker. The simple heuristic assignment based on Euclidean distance between the attacker and defender performs poorly. This paper presents a data-driven solution assuming that the attacker uses a known optimal control policy. We show how massive offline simulations can be leveraged to predict the optimal cost/value function incurred by the defender to converge on an attacker for a given target trajectory. We use this optimal cost/value function as a true measure of separation between an attacker and a defender. We use it as the guiding heuristic in the Hungarian algorithm for computing defender-attacker assignments. We perform extensive simulations to validate our approach wherein we couple the learned assignment with a non-linear model predictive controller to perform realistic simulations. We show that our assignment approach outperforms that based on the Euclidean heuristic in terms of the number of successful attempts by the defenders.
具有学习防御者对攻击者分配的多智能体目标防御博弈
本文研究了一种多攻击无人机试图向目标收敛的追逃博弈。目标是使用一组防御无人机,通过确保它们在攻击者收敛到目标之前收敛到目标上来拯救目标。核心挑战在于适当地为攻击者分配特定的防御者。基于攻击者和防御者之间欧氏距离的简单启发式分配方法性能较差。本文提出了一种数据驱动的解决方案,假设攻击者使用已知的最优控制策略。我们展示了如何利用大规模离线模拟来预测防御者在给定目标轨迹下收敛于攻击者所产生的最优成本/价值函数。我们使用这个最优成本/价值函数作为区分攻击者和防御者的真正度量。我们将其作为匈牙利算法中用于计算防御者-攻击者分配的指导启发式。我们进行了大量的仿真来验证我们的方法,其中我们将学习分配与非线性模型预测控制器相结合以进行真实的仿真。我们表明,就防守者成功尝试的次数而言,我们的分配方法优于基于欧几里得启发式的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信