{"title":"USV Target Interception Control With Reinforcement Learning and Motion Prediction Method","authors":"Y. Liu, Yuanda Wang, Lu Dong","doi":"10.1109/YAC57282.2022.10023694","DOIUrl":null,"url":null,"abstract":"In this paper, an unmanned surface vehicle (USV) target interception problem is studied with reinforcement learning (RL)-based method. In the proposed new structure, the proximal policy optimization (PPO) and proportional derivative (PD) are combined. First, the PD controller is used to predict the interception position. Then, the PPO algorithm is trained to control the USV, so that it can move quickly to the predicted position. By comparing with the traditional PPO algorithm, the simulation results verify that the proposed algorithm spends less time solving the problem of the USV interception of a moving target.","PeriodicalId":272227,"journal":{"name":"2022 37th Youth Academic Annual Conference of Chinese Association of Automation (YAC)","volume":"35 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 37th Youth Academic Annual Conference of Chinese Association of Automation (YAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/YAC57282.2022.10023694","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, an unmanned surface vehicle (USV) target interception problem is studied with reinforcement learning (RL)-based method. In the proposed new structure, the proximal policy optimization (PPO) and proportional derivative (PD) are combined. First, the PD controller is used to predict the interception position. Then, the PPO algorithm is trained to control the USV, so that it can move quickly to the predicted position. By comparing with the traditional PPO algorithm, the simulation results verify that the proposed algorithm spends less time solving the problem of the USV interception of a moving target.