Learning push recovery for a bipedal humanoid robot with Dynamical Movement Primitives

2015 IEEE-RAS 15th International Conference on Humanoid Robots (Humanoids) Pub Date : 2015-12-28 DOI:10.1109/HUMANOIDS.2015.7363478

D. Luo, Xiaoqiang Han, Y. Ding, Yang Ma, Zhan Liu, Xihong Wu

{"title":"Learning push recovery for a bipedal humanoid robot with Dynamical Movement Primitives","authors":"D. Luo, Xiaoqiang Han, Y. Ding, Yang Ma, Zhan Liu, Xihong Wu","doi":"10.1109/HUMANOIDS.2015.7363478","DOIUrl":null,"url":null,"abstract":"Maintaining the balance is always a challenge issue for bipedal humanoid robots in dealing with various locomotive tasks to serve human society, especially when the real environment the robot worked within exhibits to be very complex. Unlike plenty of previous successful approaches on humanoid balancing or push recovery, in this research, the Dynamical Movement Primitives (DMP) is employed to model several typical bio-inspired strategies. As humanoid balancing or push recovery could be regarded as a problem of how a robot to get back to its ongoing behavior when a break happens on account of external force or uneven terrain etc., the DMP model becomes an alternative ideal choice due to its promising nature of attractor. Meanwhile, the DMP composed of a set of differential equations provides a compact parameterized representation in modelling a motion strategy, and thus leads to a strategy model that is suitable to be fulfilled with machine learning techniques. In this research, the learning process for those bio-inspired strategies modeled with DMP are completed by adopting the stochastic policy gradient reinforcement learning and imitation learning separately. Furthermore, with Gaussian Process(GP) regression, push recovery strategies are generalized taking the advantages of the invariance properties of the DMP model. As a consequence, an online adaptive push recovery control strategy is finally achieved. Experimental results on both simulated robot and a real bipedal humanoid robot PKU-HR5 demonstrate the presented approach is effective and promising.","PeriodicalId":417686,"journal":{"name":"2015 IEEE-RAS 15th International Conference on Humanoid Robots (Humanoids)","volume":"133 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE-RAS 15th International Conference on Humanoid Robots (Humanoids)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HUMANOIDS.2015.7363478","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 13

Abstract

Maintaining the balance is always a challenge issue for bipedal humanoid robots in dealing with various locomotive tasks to serve human society, especially when the real environment the robot worked within exhibits to be very complex. Unlike plenty of previous successful approaches on humanoid balancing or push recovery, in this research, the Dynamical Movement Primitives (DMP) is employed to model several typical bio-inspired strategies. As humanoid balancing or push recovery could be regarded as a problem of how a robot to get back to its ongoing behavior when a break happens on account of external force or uneven terrain etc., the DMP model becomes an alternative ideal choice due to its promising nature of attractor. Meanwhile, the DMP composed of a set of differential equations provides a compact parameterized representation in modelling a motion strategy, and thus leads to a strategy model that is suitable to be fulfilled with machine learning techniques. In this research, the learning process for those bio-inspired strategies modeled with DMP are completed by adopting the stochastic policy gradient reinforcement learning and imitation learning separately. Furthermore, with Gaussian Process(GP) regression, push recovery strategies are generalized taking the advantages of the invariance properties of the DMP model. As a consequence, an online adaptive push recovery control strategy is finally achieved. Experimental results on both simulated robot and a real bipedal humanoid robot PKU-HR5 demonstrate the presented approach is effective and promising.

查看原文本刊更多论文

基于动态运动基元的两足仿人机器人学习推送恢复

在处理各种机车任务，为人类社会服务的过程中，保持平衡一直是双足类人机器人面临的难题，特别是当机器人工作的真实环境非常复杂时。与以往许多成功的仿人平衡或推动恢复方法不同，在本研究中，动态运动原语(DMP)被用于几种典型的仿生策略的建模。人形平衡或推恢复可以被视为机器人在外力或不平坦地形等原因导致断裂时如何恢复其正在进行的行为的问题，DMP模型由于其有前途的吸引子性质而成为另一种理想选择。同时，由一组微分方程组成的DMP为运动策略建模提供了一种紧凑的参数化表示，从而得到适合用机器学习技术实现的策略模型。在本研究中，采用随机策略梯度强化学习和模仿学习分别完成了基于DMP模型的仿生策略的学习过程。此外，利用DMP模型的不变性，利用高斯过程(GP)回归对推恢复策略进行了推广。最终实现了一种在线自适应推送恢复控制策略。在仿真机器人和实际两足仿人机器人PKU-HR5上的实验结果表明，该方法是有效的，具有广阔的应用前景。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 IEEE-RAS 15th International Conference on Humanoid Robots (Humanoids)

自引率

0.00%

发文量