Robotic Arm Motion Planning Based on Curriculum Reinforcement Learning

2021 6th International Conference on Control and Robotics Engineering (ICCRE) Pub Date : 2021-04-16 DOI:10.1109/ICCRE51898.2021.9435700

Dongxu Zhou, Ruiqing Jia, Haifeng Yao

{"title":"Robotic Arm Motion Planning Based on Curriculum Reinforcement Learning","authors":"Dongxu Zhou, Ruiqing Jia, Haifeng Yao","doi":"10.1109/ICCRE51898.2021.9435700","DOIUrl":null,"url":null,"abstract":"With the rapid changes in application scenarios, the robotic arm’s motion planning function is playing an increasingly important role. The traditional demonstration motion planning method of the robotic arm cannot be carried out quickly. The use of reinforcement learning algorithms to solve motion planning problems is a new research trend that has emerged in recent years. However, reinforcement learning algorithms are difficult to converge quickly in some complex tasks. This leads to inefficient and difficult training problems in actual training. This paper proposes a robotic arm motion planning method based on curriculum reinforcement learning. This method adopts the concept of obstacle effective sphere to simplify obstacles in the environment. According to the reinforcement learning agent’s real-time motion planning ability, the size of the effective sphere radius of the obstacle is adaptively adjusted so that the agent can train in an environment that matches its ability. The agent can first be trained in a simple environment and then gradually transition to a complete obstacle environment. The experiment in a virtual environment shows that this method can successfully perform motion planning. Comparing this method with the training effect of using only the PPO algorithm shows that this algorithm can effectively improve the efficiency of reinforcement learning training and reduce algorithm convergence difficulty.","PeriodicalId":382619,"journal":{"name":"2021 6th International Conference on Control and Robotics Engineering (ICCRE)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 6th International Conference on Control and Robotics Engineering (ICCRE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCRE51898.2021.9435700","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

With the rapid changes in application scenarios, the robotic arm’s motion planning function is playing an increasingly important role. The traditional demonstration motion planning method of the robotic arm cannot be carried out quickly. The use of reinforcement learning algorithms to solve motion planning problems is a new research trend that has emerged in recent years. However, reinforcement learning algorithms are difficult to converge quickly in some complex tasks. This leads to inefficient and difficult training problems in actual training. This paper proposes a robotic arm motion planning method based on curriculum reinforcement learning. This method adopts the concept of obstacle effective sphere to simplify obstacles in the environment. According to the reinforcement learning agent’s real-time motion planning ability, the size of the effective sphere radius of the obstacle is adaptively adjusted so that the agent can train in an environment that matches its ability. The agent can first be trained in a simple environment and then gradually transition to a complete obstacle environment. The experiment in a virtual environment shows that this method can successfully perform motion planning. Comparing this method with the training effect of using only the PPO algorithm shows that this algorithm can effectively improve the efficiency of reinforcement learning training and reduce algorithm convergence difficulty.

查看原文本刊更多论文

基于课程强化学习的机械臂运动规划

随着应用场景的快速变化，机械臂的运动规划功能发挥着越来越重要的作用。传统的机械臂演示运动规划方法无法快速实现。利用强化学习算法解决运动规划问题是近年来出现的一个新的研究趋势。然而，在一些复杂的任务中，强化学习算法难以快速收敛。这就导致了在实际训练中出现训练效率低、训练难度大的问题。提出了一种基于课程强化学习的机械臂运动规划方法。该方法采用障碍物有效球的概念，简化了环境中的障碍物。根据强化学习智能体的实时运动规划能力，自适应调整障碍物有效球半径的大小，使智能体在与其能力相匹配的环境中进行训练。智能体可以先在简单的环境中进行训练，然后逐渐过渡到完整的障碍环境。在虚拟环境中的实验表明，该方法可以成功地进行运动规划。将该方法与仅使用PPO算法的训练效果进行比较，表明该算法能有效提高强化学习训练效率，降低算法收敛难度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 6th International Conference on Control and Robotics Engineering (ICCRE)

自引率

0.00%

发文量