Multi-agent Target Defense Game with Learned Defender to Attacker Assignment

2023 International Conference on Unmanned Aircraft Systems (ICUAS) Pub Date : 2023-06-06 DOI:10.1109/ICUAS57906.2023.10156083

Amith Manoharan, Prajwal Thakur, Ashutosh Kumar Singh

{"title":"Multi-agent Target Defense Game with Learned Defender to Attacker Assignment","authors":"Amith Manoharan, Prajwal Thakur, Ashutosh Kumar Singh","doi":"10.1109/ICUAS57906.2023.10156083","DOIUrl":null,"url":null,"abstract":"This paper considers a variant of pursuit-evasion games where multiple attacker unmanned aerial vehicles (UAVs) are trying to converge on a target. The goal is to use a set of defender UAVs to save the target by ensuring they converge on the attackers before the latter converge to the target. The core challenge lies in appropriately assigning a particular defender to an attacker. The simple heuristic assignment based on Euclidean distance between the attacker and defender performs poorly. This paper presents a data-driven solution assuming that the attacker uses a known optimal control policy. We show how massive offline simulations can be leveraged to predict the optimal cost/value function incurred by the defender to converge on an attacker for a given target trajectory. We use this optimal cost/value function as a true measure of separation between an attacker and a defender. We use it as the guiding heuristic in the Hungarian algorithm for computing defender-attacker assignments. We perform extensive simulations to validate our approach wherein we couple the learned assignment with a non-linear model predictive controller to perform realistic simulations. We show that our assignment approach outperforms that based on the Euclidean heuristic in terms of the number of successful attempts by the defenders.","PeriodicalId":379073,"journal":{"name":"2023 International Conference on Unmanned Aircraft Systems (ICUAS)","volume":"53 3","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Unmanned Aircraft Systems (ICUAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICUAS57906.2023.10156083","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

This paper considers a variant of pursuit-evasion games where multiple attacker unmanned aerial vehicles (UAVs) are trying to converge on a target. The goal is to use a set of defender UAVs to save the target by ensuring they converge on the attackers before the latter converge to the target. The core challenge lies in appropriately assigning a particular defender to an attacker. The simple heuristic assignment based on Euclidean distance between the attacker and defender performs poorly. This paper presents a data-driven solution assuming that the attacker uses a known optimal control policy. We show how massive offline simulations can be leveraged to predict the optimal cost/value function incurred by the defender to converge on an attacker for a given target trajectory. We use this optimal cost/value function as a true measure of separation between an attacker and a defender. We use it as the guiding heuristic in the Hungarian algorithm for computing defender-attacker assignments. We perform extensive simulations to validate our approach wherein we couple the learned assignment with a non-linear model predictive controller to perform realistic simulations. We show that our assignment approach outperforms that based on the Euclidean heuristic in terms of the number of successful attempts by the defenders.

查看原文本刊更多论文

具有学习防御者对攻击者分配的多智能体目标防御博弈

本文研究了一种多攻击无人机试图向目标收敛的追逃博弈。目标是使用一组防御无人机，通过确保它们在攻击者收敛到目标之前收敛到目标上来拯救目标。核心挑战在于适当地为攻击者分配特定的防御者。基于攻击者和防御者之间欧氏距离的简单启发式分配方法性能较差。本文提出了一种数据驱动的解决方案，假设攻击者使用已知的最优控制策略。我们展示了如何利用大规模离线模拟来预测防御者在给定目标轨迹下收敛于攻击者所产生的最优成本/价值函数。我们使用这个最优成本/价值函数作为区分攻击者和防御者的真正度量。我们将其作为匈牙利算法中用于计算防御者-攻击者分配的指导启发式。我们进行了大量的仿真来验证我们的方法，其中我们将学习分配与非线性模型预测控制器相结合以进行真实的仿真。我们表明，就防守者成功尝试的次数而言，我们的分配方法优于基于欧几里得启发式的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 International Conference on Unmanned Aircraft Systems (ICUAS)

自引率

0.00%

发文量