Constrained DMPs for Feasible Skill Learning on Humanoid Robots

2018 IEEE-RAS 18th International Conference on Humanoid Robots (Humanoids) Pub Date : 2018-11-01 DOI:10.1109/HUMANOIDS.2018.8624934

Anqing Duan, R. Camoriano, Diego Ferigo, Daniele Calandriello, L. Rosasco, D. Pucci

{"title":"Constrained DMPs for Feasible Skill Learning on Humanoid Robots","authors":"Anqing Duan, R. Camoriano, Diego Ferigo, Daniele Calandriello, L. Rosasco, D. Pucci","doi":"10.1109/HUMANOIDS.2018.8624934","DOIUrl":null,"url":null,"abstract":"In the context of humanoid skill learning, movement primitives have gained much attention because of their compact representation and convenient combination with a myriad of optimization approaches. Among them, a well-known scheme is to use Dynamic Movement Primitives (DMPs) with reinforcement learning (RL) algorithms. While various remarkable results have been reported, skill learning with physical constraints has not been sufficiently investigated. For example, when RL is employed to optimize the robot joint trajectories, the exploration noise could drive the resulting trajectory out of the joint limits. In this paper, we focus on robot skill learning characterized by joint limit avoidance, by introducing the novel Constrained Dynamic Movement Primitives (CDMPs). By controlling a set of transformed states (called exogenous states) instead of the original DMPs states, CDMPs are capable of maintaining the joint trajectories within the safety limits. We validate CDMPs on the humanoid robot iCub, showing the applicability of our approach.","PeriodicalId":433345,"journal":{"name":"2018 IEEE-RAS 18th International Conference on Humanoid Robots (Humanoids)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE-RAS 18th International Conference on Humanoid Robots (Humanoids)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HUMANOIDS.2018.8624934","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 13

Abstract

In the context of humanoid skill learning, movement primitives have gained much attention because of their compact representation and convenient combination with a myriad of optimization approaches. Among them, a well-known scheme is to use Dynamic Movement Primitives (DMPs) with reinforcement learning (RL) algorithms. While various remarkable results have been reported, skill learning with physical constraints has not been sufficiently investigated. For example, when RL is employed to optimize the robot joint trajectories, the exploration noise could drive the resulting trajectory out of the joint limits. In this paper, we focus on robot skill learning characterized by joint limit avoidance, by introducing the novel Constrained Dynamic Movement Primitives (CDMPs). By controlling a set of transformed states (called exogenous states) instead of the original DMPs states, CDMPs are capable of maintaining the joint trajectories within the safety limits. We validate CDMPs on the humanoid robot iCub, showing the applicability of our approach.

查看原文本刊更多论文

仿人机器人可行技能学习的约束dmp

在类人技能学习的背景下，运动原语以其紧凑的表示形式和方便的结合多种优化方法而备受关注。其中，一个著名的方案是使用动态运动原语(dmp)与强化学习(RL)算法。虽然已经报道了各种显著的结果，但物理限制下的技能学习尚未得到充分的研究。例如，当使用RL优化机器人关节轨迹时，探测噪声可能会使得到的轨迹超出关节限制。本文通过引入新的约束动态运动原语(CDMPs)，研究以关节极限规避为特征的机器人技能学习。通过控制一组转换状态(称为外生状态)而不是原始dmp状态，cdmp能够将联合轨迹维持在安全范围内。我们在仿人机器人iCub上验证了CDMPs，显示了我们方法的适用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 IEEE-RAS 18th International Conference on Humanoid Robots (Humanoids)

自引率

0.00%

发文量