Jin-wen Hu, Man Wang, Congzhe Zhang, Chunhui Zhao, Q. Pan, Xuemei Cheng, Feng Yang, Xiaoxu Wang
{"title":"基于值函数逼近算法的无人驾驶车辆路径规划","authors":"Jin-wen Hu, Man Wang, Congzhe Zhang, Chunhui Zhao, Q. Pan, Xuemei Cheng, Feng Yang, Xiaoxu Wang","doi":"10.1109/ICCA.2019.8899906","DOIUrl":null,"url":null,"abstract":"This paper deals with the path planning problem for unmanned vehicles based on reinforcement learning. Considering the unmanned vehicles’ dynamic model, the neural network is used to approximate the value function. Besides, in order to make it more suitable for practical applications and speed up the learning process, the recursive least squares algorithm is used to eliminate the inverse operation. Then some experiments are implemented to verify the effectiveness of the proposed improved value function approximation algorithm. It is proved to have improved the generalization performance of reinforcement learning in continuous space.","PeriodicalId":130891,"journal":{"name":"2019 IEEE 15th International Conference on Control and Automation (ICCA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Path Planning for Unmanned Vehicles Based on Value Function Approximation Algorithm\",\"authors\":\"Jin-wen Hu, Man Wang, Congzhe Zhang, Chunhui Zhao, Q. Pan, Xuemei Cheng, Feng Yang, Xiaoxu Wang\",\"doi\":\"10.1109/ICCA.2019.8899906\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper deals with the path planning problem for unmanned vehicles based on reinforcement learning. Considering the unmanned vehicles’ dynamic model, the neural network is used to approximate the value function. Besides, in order to make it more suitable for practical applications and speed up the learning process, the recursive least squares algorithm is used to eliminate the inverse operation. Then some experiments are implemented to verify the effectiveness of the proposed improved value function approximation algorithm. It is proved to have improved the generalization performance of reinforcement learning in continuous space.\",\"PeriodicalId\":130891,\"journal\":{\"name\":\"2019 IEEE 15th International Conference on Control and Automation (ICCA)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE 15th International Conference on Control and Automation (ICCA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCA.2019.8899906\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 15th International Conference on Control and Automation (ICCA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCA.2019.8899906","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Path Planning for Unmanned Vehicles Based on Value Function Approximation Algorithm
This paper deals with the path planning problem for unmanned vehicles based on reinforcement learning. Considering the unmanned vehicles’ dynamic model, the neural network is used to approximate the value function. Besides, in order to make it more suitable for practical applications and speed up the learning process, the recursive least squares algorithm is used to eliminate the inverse operation. Then some experiments are implemented to verify the effectiveness of the proposed improved value function approximation algorithm. It is proved to have improved the generalization performance of reinforcement learning in continuous space.