{"title":"Independent Soft Actor-Critic Deep Reinforcement Learning for UAV Cooperative Air Combat Maneuvering Decision-Making","authors":"Haolin Li, Delin Luo, Haibin Duan","doi":"10.1002/rob.22538","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>This paper delves into the research of collaborative combat strategies for multiple unmanned combat aerial vehicles (UAVs), utilizing the independent soft Actor-Critic (is-AC) algorithm. We aim to achieve collaborative jamming confrontation, accurate battlefield situational awareness, and UAV decision-making capabilities to control their behavior. However, the SAC algorithm is plagued by instability and poor scalability in Multi-agent reinforcement learning scenarios. To address this, we draw inspiration from the Independent Q-Learning (IQL) algorithm and improve SAC. Our experimental analysis of the is-AC algorithm in UAV confrontation models demonstrates its stability and scalability in multi-machine scenarios.</p>\n </div>","PeriodicalId":192,"journal":{"name":"Journal of Field Robotics","volume":"42 6","pages":"2656-2670"},"PeriodicalIF":5.2000,"publicationDate":"2025-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Field Robotics","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/rob.22538","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ROBOTICS","Score":null,"Total":0}
引用次数: 0
Abstract
This paper delves into the research of collaborative combat strategies for multiple unmanned combat aerial vehicles (UAVs), utilizing the independent soft Actor-Critic (is-AC) algorithm. We aim to achieve collaborative jamming confrontation, accurate battlefield situational awareness, and UAV decision-making capabilities to control their behavior. However, the SAC algorithm is plagued by instability and poor scalability in Multi-agent reinforcement learning scenarios. To address this, we draw inspiration from the Independent Q-Learning (IQL) algorithm and improve SAC. Our experimental analysis of the is-AC algorithm in UAV confrontation models demonstrates its stability and scalability in multi-machine scenarios.
期刊介绍:
The Journal of Field Robotics seeks to promote scholarly publications dealing with the fundamentals of robotics in unstructured and dynamic environments.
The Journal focuses on experimental robotics and encourages publication of work that has both theoretical and practical significance.