A. García, David Barragan-Alcantar, Ivana Collado-Gonzalez, Leonardo Garrido
{"title":"基于自适应动态规划和深度强化学习的无人水面车辆控制","authors":"A. García, David Barragan-Alcantar, Ivana Collado-Gonzalez, Leonardo Garrido","doi":"10.1145/3417188.3417194","DOIUrl":null,"url":null,"abstract":"This paper presents a low-level controller for an unmanned surface vehicle based on Adaptive Dynamic Programming (ADP) and deep reinforcement learning (DRL). The model-based algorithm Back-propagation Through Time and a simulation of the mathematical model of the vessel are implemented to train a deep neural network to drive the surge speed and yaw dynamics. The controller presents successful simulation results validating the feasibility of the proposed strategy and contributes to the diversity of validated applications of ADP and DRL control strategies.","PeriodicalId":373913,"journal":{"name":"Proceedings of the 2020 4th International Conference on Deep Learning Technologies","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Control of an Unmanned Surface Vehicle Based on Adaptive Dynamic Programming and Deep Reinforcement Learning\",\"authors\":\"A. García, David Barragan-Alcantar, Ivana Collado-Gonzalez, Leonardo Garrido\",\"doi\":\"10.1145/3417188.3417194\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a low-level controller for an unmanned surface vehicle based on Adaptive Dynamic Programming (ADP) and deep reinforcement learning (DRL). The model-based algorithm Back-propagation Through Time and a simulation of the mathematical model of the vessel are implemented to train a deep neural network to drive the surge speed and yaw dynamics. The controller presents successful simulation results validating the feasibility of the proposed strategy and contributes to the diversity of validated applications of ADP and DRL control strategies.\",\"PeriodicalId\":373913,\"journal\":{\"name\":\"Proceedings of the 2020 4th International Conference on Deep Learning Technologies\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-07-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2020 4th International Conference on Deep Learning Technologies\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3417188.3417194\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2020 4th International Conference on Deep Learning Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3417188.3417194","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Control of an Unmanned Surface Vehicle Based on Adaptive Dynamic Programming and Deep Reinforcement Learning
This paper presents a low-level controller for an unmanned surface vehicle based on Adaptive Dynamic Programming (ADP) and deep reinforcement learning (DRL). The model-based algorithm Back-propagation Through Time and a simulation of the mathematical model of the vessel are implemented to train a deep neural network to drive the surge speed and yaw dynamics. The controller presents successful simulation results validating the feasibility of the proposed strategy and contributes to the diversity of validated applications of ADP and DRL control strategies.