Rongxin Cui, Chenguang Yang, Y. Li, Sanjay K. Sharma
{"title":"Neural network based reinforcement learning control of autonomous underwater vehicles with control input saturation","authors":"Rongxin Cui, Chenguang Yang, Y. Li, Sanjay K. Sharma","doi":"10.1109/CONTROL.2014.6915114","DOIUrl":null,"url":null,"abstract":"In this paper, the trajectory tracking control of the autonomous underwater vehicle (AUV) has been investigated in discrete time, for ease of digital computer calculation. A reinforcement learning scheme is employed using two neural networks, whereas the first one is to compensate for uncertainties for the controller, and the second one is to estimate the evaluation function, such that optimal tracking performance could be achieve for the AUV. Simulation results show that the errors convergence to a adjustable neighborhood around zero, and optimization has been achieved in the sense of reinforcement learning.","PeriodicalId":269044,"journal":{"name":"2014 UKACC International Conference on Control (CONTROL)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 UKACC International Conference on Control (CONTROL)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CONTROL.2014.6915114","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 18
Abstract
In this paper, the trajectory tracking control of the autonomous underwater vehicle (AUV) has been investigated in discrete time, for ease of digital computer calculation. A reinforcement learning scheme is employed using two neural networks, whereas the first one is to compensate for uncertainties for the controller, and the second one is to estimate the evaluation function, such that optimal tracking performance could be achieve for the AUV. Simulation results show that the errors convergence to a adjustable neighborhood around zero, and optimization has been achieved in the sense of reinforcement learning.