{"title":"Global critic and local actor for campaign-tactic combat management in the joint operation simulation software","authors":"Yabin Wang, Peng Cui, Youjiang Li","doi":"10.1117/12.2671217","DOIUrl":null,"url":null,"abstract":"Users of the simulation software not only need to model the capability of each unit, but also need to create a decision maker for the units in the simulation software, to control ships, aircrafts and ground units to cooperates to achieve one goal. In this paper a new approach is constructed to create the decision maker. We use reinforcement learning based on global critic and local actor. The invention constructs an air isomorphic formation command method based on multiagent PPO algorithm. The evaluation network uses global information, so that the algorithm has the ability to evaluate global information and guide the agent to select actions that are beneficial to the global environment state. The input of the action network is local information, so that the agent can focus on local information.","PeriodicalId":227528,"journal":{"name":"International Conference on Artificial Intelligence and Computer Engineering (ICAICE 2022)","volume":"36 ","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Artificial Intelligence and Computer Engineering (ICAICE 2022)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2671217","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Users of the simulation software not only need to model the capability of each unit, but also need to create a decision maker for the units in the simulation software, to control ships, aircrafts and ground units to cooperates to achieve one goal. In this paper a new approach is constructed to create the decision maker. We use reinforcement learning based on global critic and local actor. The invention constructs an air isomorphic formation command method based on multiagent PPO algorithm. The evaluation network uses global information, so that the algorithm has the ability to evaluate global information and guide the agent to select actions that are beneficial to the global environment state. The input of the action network is local information, so that the agent can focus on local information.