{"title":"Robot Path Planning Method Based on Deep Reinforcement Learning","authors":"Yongmei Zhang, Jiarui Zhao, Jie Sun","doi":"10.1109/CCET50901.2020.9213166","DOIUrl":null,"url":null,"abstract":"Aiming at the excessive dependence of traditional path planning methods on map information and the lack of self-learning and self-adaptive capabilities, a path planning method for robot based on deep reinforcement learning is proposed. The paper takes lidar data of Gazebo simulation environments built on the ROS platform as input. Learn direct action control from environment information through end-to-end learning, adopt neural network to fit value-based non-model time difference Q learning algorithm, reasonably design environment models, and the number of state spaces, the optimal decision strategy is learned by maximizing robot and dynamic environment interaction of cumulative reward. Simulation results show the method can meet the requirements of intelligent perception and decision-making by only relying on some map information.","PeriodicalId":236862,"journal":{"name":"2020 IEEE 3rd International Conference on Computer and Communication Engineering Technology (CCET)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 3rd International Conference on Computer and Communication Engineering Technology (CCET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCET50901.2020.9213166","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Aiming at the excessive dependence of traditional path planning methods on map information and the lack of self-learning and self-adaptive capabilities, a path planning method for robot based on deep reinforcement learning is proposed. The paper takes lidar data of Gazebo simulation environments built on the ROS platform as input. Learn direct action control from environment information through end-to-end learning, adopt neural network to fit value-based non-model time difference Q learning algorithm, reasonably design environment models, and the number of state spaces, the optimal decision strategy is learned by maximizing robot and dynamic environment interaction of cumulative reward. Simulation results show the method can meet the requirements of intelligent perception and decision-making by only relying on some map information.