Cooperative Path Planning for Multiplayer Reach-Avoid Games under Imperfect Observation Information

IF 6.8 Q1 AUTOMATION & CONTROL SYSTEMS
Hongwei Fang, Yue Chen, Peng Yi
{"title":"Cooperative Path Planning for Multiplayer Reach-Avoid Games under Imperfect Observation Information","authors":"Hongwei Fang,&nbsp;Yue Chen,&nbsp;Peng Yi","doi":"10.1002/aisy.202300794","DOIUrl":null,"url":null,"abstract":"<p>This article investigates a reach-avoid game and proposes a cooperative path planning algorithm for a target–pursuers (TP) coalition to capture an evader. In the game, the target aims to bait and escape from the evader, and the pursuer aims to capture the evader. Due to imperfect observations, the TP coalition has uncertain information of the evader's state, while the evader is assumed to have perfect observation. The game model is constructed by formulating the optimization problems for each player in a receding horizon fashion. Then, to counter the evader effectively, the TP coalition constructs a virtual evader using the belief information from a Kalman filter. And a chance constraint optimization problem is constructed to predict the virtual evader's trajectory under uncertainties. The TP coalition can capture the actual evader by generating a robust counter-strategy against the virtual evader with a chance constraint feasible set. Next, to compute the Nash equilibrium of the TP coalition's subjective game, an iterative algorithm is designed that combines the iterative best response and the distributed alternating direction method of multiplier algorithms. Finally, the effectiveness of the algorithm is validated through simulations and experiments.</p>","PeriodicalId":93858,"journal":{"name":"Advanced intelligent systems (Weinheim an der Bergstrasse, Germany)","volume":"6 9","pages":""},"PeriodicalIF":6.8000,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aisy.202300794","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced intelligent systems (Weinheim an der Bergstrasse, Germany)","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/aisy.202300794","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

This article investigates a reach-avoid game and proposes a cooperative path planning algorithm for a target–pursuers (TP) coalition to capture an evader. In the game, the target aims to bait and escape from the evader, and the pursuer aims to capture the evader. Due to imperfect observations, the TP coalition has uncertain information of the evader's state, while the evader is assumed to have perfect observation. The game model is constructed by formulating the optimization problems for each player in a receding horizon fashion. Then, to counter the evader effectively, the TP coalition constructs a virtual evader using the belief information from a Kalman filter. And a chance constraint optimization problem is constructed to predict the virtual evader's trajectory under uncertainties. The TP coalition can capture the actual evader by generating a robust counter-strategy against the virtual evader with a chance constraint feasible set. Next, to compute the Nash equilibrium of the TP coalition's subjective game, an iterative algorithm is designed that combines the iterative best response and the distributed alternating direction method of multiplier algorithms. Finally, the effectiveness of the algorithm is validated through simulations and experiments.

Abstract Image

不完全观测信息下的多人到达-避免游戏的合作路径规划
本文研究了一种 "到达-回避 "博弈,并提出了一种目标-追击者(TP)联盟捕捉回避者的合作路径规划算法。在博弈中,目标的目的是诱捕并逃离逃逸者,追捕者的目的是捕获逃逸者。由于观测不完全,TP 联盟对逃逸者的状态信息不确定,而逃逸者被假定为观测完全。博弈模型是通过以后退视界方式为每个博弈方提出优化问题而构建的。然后,为了有效对抗逃避者,TP 联盟利用卡尔曼滤波器的信念信息构建了一个虚拟逃避者。并构建一个机会约束优化问题,以预测虚拟逃避者在不确定情况下的轨迹。TP 联盟可以通过机会约束可行集生成一个针对虚拟逃避者的稳健反策略,从而捕获实际逃避者。接下来,为了计算 TP 联盟主观博弈的纳什均衡,设计了一种迭代算法,该算法结合了乘法算法的迭代最佳响应和分布式交替方向法。最后,通过模拟和实验验证了算法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
1.30
自引率
0.00%
发文量
0
审稿时长
4 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信