{"title":"基于改进强化学习优化的移动机器人路径规划","authors":"Yanshu Jing, Yukun Chen, Ming-hai Jiao, Jie Huang, Bowen Niu, Wenbo Zheng","doi":"10.1145/3366715.3366717","DOIUrl":null,"url":null,"abstract":"The constant parameter is usually set in adaptive function with traditional mobile robot path planning problem. Q-learning, a type of reinforcement learning, has gained increasing popularity in autonomous mobile robot path recently. In order to effectively solve mobile robot path planning problem in obstacle avoidance environment, a path planning model and search algorithm based on improved reinforcement learning are proposed. The incentive model of reinforcement learning mechanism is introduced with search selection strategy, modifying dynamic reward function parameter setting. The group intelligent search iterative process of global position selection and local position selection is exploited to combine particle behavior with reinforcement learning algorithm, dynamically adjusting the empirical parameter of the reward function by strengthening the data training experiment of Q-learning. to determine the constant parameters for simulation experiment, once the distance between the robot and the obstacle is less than a certain thresholds value, the 0-1 random number is used to randomly adjust the moving direction, avoiding the occurrence of mobile robot path matching deadlock. The study case shows that the proposed algorithm is proved to be better efficient and effective, thereby improving the search intensity and accuracy of the mobile robot path planning problem. And the experimental simulation shows that the proposed model and algorithm effectively solve mobile robot path planning problem that the parameter selection and the actual scene cannot be adapted in real time in traditional path planning problem.","PeriodicalId":425980,"journal":{"name":"Proceedings of the 2019 International Conference on Robotics Systems and Vehicle Technology - RSVT '19","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Mobile Robot Path Planning Based on Improved Reinforcement Learning Optimization\",\"authors\":\"Yanshu Jing, Yukun Chen, Ming-hai Jiao, Jie Huang, Bowen Niu, Wenbo Zheng\",\"doi\":\"10.1145/3366715.3366717\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The constant parameter is usually set in adaptive function with traditional mobile robot path planning problem. Q-learning, a type of reinforcement learning, has gained increasing popularity in autonomous mobile robot path recently. In order to effectively solve mobile robot path planning problem in obstacle avoidance environment, a path planning model and search algorithm based on improved reinforcement learning are proposed. The incentive model of reinforcement learning mechanism is introduced with search selection strategy, modifying dynamic reward function parameter setting. The group intelligent search iterative process of global position selection and local position selection is exploited to combine particle behavior with reinforcement learning algorithm, dynamically adjusting the empirical parameter of the reward function by strengthening the data training experiment of Q-learning. to determine the constant parameters for simulation experiment, once the distance between the robot and the obstacle is less than a certain thresholds value, the 0-1 random number is used to randomly adjust the moving direction, avoiding the occurrence of mobile robot path matching deadlock. The study case shows that the proposed algorithm is proved to be better efficient and effective, thereby improving the search intensity and accuracy of the mobile robot path planning problem. And the experimental simulation shows that the proposed model and algorithm effectively solve mobile robot path planning problem that the parameter selection and the actual scene cannot be adapted in real time in traditional path planning problem.\",\"PeriodicalId\":425980,\"journal\":{\"name\":\"Proceedings of the 2019 International Conference on Robotics Systems and Vehicle Technology - RSVT '19\",\"volume\":\"53 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2019 International Conference on Robotics Systems and Vehicle Technology - RSVT '19\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3366715.3366717\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 International Conference on Robotics Systems and Vehicle Technology - RSVT '19","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3366715.3366717","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Mobile Robot Path Planning Based on Improved Reinforcement Learning Optimization
The constant parameter is usually set in adaptive function with traditional mobile robot path planning problem. Q-learning, a type of reinforcement learning, has gained increasing popularity in autonomous mobile robot path recently. In order to effectively solve mobile robot path planning problem in obstacle avoidance environment, a path planning model and search algorithm based on improved reinforcement learning are proposed. The incentive model of reinforcement learning mechanism is introduced with search selection strategy, modifying dynamic reward function parameter setting. The group intelligent search iterative process of global position selection and local position selection is exploited to combine particle behavior with reinforcement learning algorithm, dynamically adjusting the empirical parameter of the reward function by strengthening the data training experiment of Q-learning. to determine the constant parameters for simulation experiment, once the distance between the robot and the obstacle is less than a certain thresholds value, the 0-1 random number is used to randomly adjust the moving direction, avoiding the occurrence of mobile robot path matching deadlock. The study case shows that the proposed algorithm is proved to be better efficient and effective, thereby improving the search intensity and accuracy of the mobile robot path planning problem. And the experimental simulation shows that the proposed model and algorithm effectively solve mobile robot path planning problem that the parameter selection and the actual scene cannot be adapted in real time in traditional path planning problem.