Xinfeng Xu , Chun Liu , Liang Xu , Qiang Wang , Yizhen Meng
{"title":"Integral reinforcement learning-based optimal pursuit-evasion control for multi-QUAVs: A zero-sum game approach","authors":"Xinfeng Xu , Chun Liu , Liang Xu , Qiang Wang , Yizhen Meng","doi":"10.1016/j.jfranklin.2025.107685","DOIUrl":null,"url":null,"abstract":"<div><div>This paper investigates the optimal pursuit-evasion control (PEC) under zero-sum game for multiple quadrotor unmanned aerial vehicles (multi-QUAVs) with unknown dynamics, aiming to capture an evader QUAV (EQUAV). First, the pursuit-evasion error dynamics are constructed based on multi-pursuer QUAVs (multi-PQUAVs) and an EQUAV. Second, within the framework of a zero-sum game, adversarial strategies are designed for both the multi-PQUAVs and the EQUAV. By minimizing the cost function, the multi-PQUAVs aim to minimize the pursuit-evasion error, while the EQUAV seeks to maximize pursuit-evasion error. Third, an actor-critic neural network (NN) based on integral reinforcement learning (IRL) is developed to optimize the adversarial strategies of both the multi-PQUAVs and the EQUAV while updating their strategies toward the optimal approximate solution. The stability analysis demonstrates that the pursuit-evasion error, critic NN weight error, PQUAVs’ actor NN weight error, and EQUAV’s actor NN weight error are uniformly ultimately bounded. Finally, simulations validate the effectiveness and adaptability of the IRL-based optimal PEC algorithm under zero-sum game.</div></div>","PeriodicalId":17283,"journal":{"name":"Journal of The Franklin Institute-engineering and Applied Mathematics","volume":"362 8","pages":"Article 107685"},"PeriodicalIF":3.7000,"publicationDate":"2025-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of The Franklin Institute-engineering and Applied Mathematics","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0016003225001784","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
This paper investigates the optimal pursuit-evasion control (PEC) under zero-sum game for multiple quadrotor unmanned aerial vehicles (multi-QUAVs) with unknown dynamics, aiming to capture an evader QUAV (EQUAV). First, the pursuit-evasion error dynamics are constructed based on multi-pursuer QUAVs (multi-PQUAVs) and an EQUAV. Second, within the framework of a zero-sum game, adversarial strategies are designed for both the multi-PQUAVs and the EQUAV. By minimizing the cost function, the multi-PQUAVs aim to minimize the pursuit-evasion error, while the EQUAV seeks to maximize pursuit-evasion error. Third, an actor-critic neural network (NN) based on integral reinforcement learning (IRL) is developed to optimize the adversarial strategies of both the multi-PQUAVs and the EQUAV while updating their strategies toward the optimal approximate solution. The stability analysis demonstrates that the pursuit-evasion error, critic NN weight error, PQUAVs’ actor NN weight error, and EQUAV’s actor NN weight error are uniformly ultimately bounded. Finally, simulations validate the effectiveness and adaptability of the IRL-based optimal PEC algorithm under zero-sum game.
期刊介绍:
The Journal of The Franklin Institute has an established reputation for publishing high-quality papers in the field of engineering and applied mathematics. Its current focus is on control systems, complex networks and dynamic systems, signal processing and communications and their applications. All submitted papers are peer-reviewed. The Journal will publish original research papers and research review papers of substance. Papers and special focus issues are judged upon possible lasting value, which has been and continues to be the strength of the Journal of The Franklin Institute.