Robotic Arm Motion Planning Based on Curriculum Reinforcement Learning

Dongxu Zhou, Ruiqing Jia, Haifeng Yao
{"title":"Robotic Arm Motion Planning Based on Curriculum Reinforcement Learning","authors":"Dongxu Zhou, Ruiqing Jia, Haifeng Yao","doi":"10.1109/ICCRE51898.2021.9435700","DOIUrl":null,"url":null,"abstract":"With the rapid changes in application scenarios, the robotic arm’s motion planning function is playing an increasingly important role. The traditional demonstration motion planning method of the robotic arm cannot be carried out quickly. The use of reinforcement learning algorithms to solve motion planning problems is a new research trend that has emerged in recent years. However, reinforcement learning algorithms are difficult to converge quickly in some complex tasks. This leads to inefficient and difficult training problems in actual training. This paper proposes a robotic arm motion planning method based on curriculum reinforcement learning. This method adopts the concept of obstacle effective sphere to simplify obstacles in the environment. According to the reinforcement learning agent’s real-time motion planning ability, the size of the effective sphere radius of the obstacle is adaptively adjusted so that the agent can train in an environment that matches its ability. The agent can first be trained in a simple environment and then gradually transition to a complete obstacle environment. The experiment in a virtual environment shows that this method can successfully perform motion planning. Comparing this method with the training effect of using only the PPO algorithm shows that this algorithm can effectively improve the efficiency of reinforcement learning training and reduce algorithm convergence difficulty.","PeriodicalId":382619,"journal":{"name":"2021 6th International Conference on Control and Robotics Engineering (ICCRE)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 6th International Conference on Control and Robotics Engineering (ICCRE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCRE51898.2021.9435700","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

With the rapid changes in application scenarios, the robotic arm’s motion planning function is playing an increasingly important role. The traditional demonstration motion planning method of the robotic arm cannot be carried out quickly. The use of reinforcement learning algorithms to solve motion planning problems is a new research trend that has emerged in recent years. However, reinforcement learning algorithms are difficult to converge quickly in some complex tasks. This leads to inefficient and difficult training problems in actual training. This paper proposes a robotic arm motion planning method based on curriculum reinforcement learning. This method adopts the concept of obstacle effective sphere to simplify obstacles in the environment. According to the reinforcement learning agent’s real-time motion planning ability, the size of the effective sphere radius of the obstacle is adaptively adjusted so that the agent can train in an environment that matches its ability. The agent can first be trained in a simple environment and then gradually transition to a complete obstacle environment. The experiment in a virtual environment shows that this method can successfully perform motion planning. Comparing this method with the training effect of using only the PPO algorithm shows that this algorithm can effectively improve the efficiency of reinforcement learning training and reduce algorithm convergence difficulty.
基于课程强化学习的机械臂运动规划
随着应用场景的快速变化,机械臂的运动规划功能发挥着越来越重要的作用。传统的机械臂演示运动规划方法无法快速实现。利用强化学习算法解决运动规划问题是近年来出现的一个新的研究趋势。然而,在一些复杂的任务中,强化学习算法难以快速收敛。这就导致了在实际训练中出现训练效率低、训练难度大的问题。提出了一种基于课程强化学习的机械臂运动规划方法。该方法采用障碍物有效球的概念,简化了环境中的障碍物。根据强化学习智能体的实时运动规划能力,自适应调整障碍物有效球半径的大小,使智能体在与其能力相匹配的环境中进行训练。智能体可以先在简单的环境中进行训练,然后逐渐过渡到完整的障碍环境。在虚拟环境中的实验表明,该方法可以成功地进行运动规划。将该方法与仅使用PPO算法的训练效果进行比较,表明该算法能有效提高强化学习训练效率,降低算法收敛难度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信