神经步态:通过控制障碍函数和零动力学策略学习双足运动

Conference on Learning for Dynamics & Control Pub Date : 2022-04-18 DOI:10.48550/arXiv.2204.08120

I. D. Rodriguez, Noel Csomay-Shanklin, Yisong Yue, A. Ames

{"title":"神经步态:通过控制障碍函数和零动力学策略学习双足运动","authors":"I. D. Rodriguez, Noel Csomay-Shanklin, Yisong Yue, A. Ames","doi":"10.48550/arXiv.2204.08120","DOIUrl":null,"url":null,"abstract":"This work presents Neural Gaits, a method for learning dynamic walking gaits through the enforce-ment of set invariance that can be reﬁned episodically using experimental data from the robot. We frame walking as a set invariance problem enforceable via control barrier functions (CBFs) deﬁned on the reduced-order dynamics quantifying the underactuated component of the robot: the zero dynamics. Our approach contains two learning modules: one for learning a policy that satisﬁes the CBF condition, and another for learning a residual dynamics model to reﬁne imperfections of the nominal model. Importantly, learning only over the zero dynamics signiﬁcantly reduces the dimensionality of the learning problem while using CBFs allows us to still make guarantees for the full-order system. The method is demonstrated experimentally on an underactuated bipedal robot, where we are able to show agile and dynamic locomotion, even with partially unknown dynamics.","PeriodicalId":268449,"journal":{"name":"Conference on Learning for Dynamics & Control","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Neural Gaits: Learning Bipedal Locomotion via Control Barrier Functions and Zero Dynamics Policies\",\"authors\":\"I. D. Rodriguez, Noel Csomay-Shanklin, Yisong Yue, A. Ames\",\"doi\":\"10.48550/arXiv.2204.08120\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This work presents Neural Gaits, a method for learning dynamic walking gaits through the enforce-ment of set invariance that can be reﬁned episodically using experimental data from the robot. We frame walking as a set invariance problem enforceable via control barrier functions (CBFs) deﬁned on the reduced-order dynamics quantifying the underactuated component of the robot: the zero dynamics. Our approach contains two learning modules: one for learning a policy that satisﬁes the CBF condition, and another for learning a residual dynamics model to reﬁne imperfections of the nominal model. Importantly, learning only over the zero dynamics signiﬁcantly reduces the dimensionality of the learning problem while using CBFs allows us to still make guarantees for the full-order system. The method is demonstrated experimentally on an underactuated bipedal robot, where we are able to show agile and dynamic locomotion, even with partially unknown dynamics.\",\"PeriodicalId\":268449,\"journal\":{\"name\":\"Conference on Learning for Dynamics & Control\",\"volume\":\"39 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-04-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Conference on Learning for Dynamics & Control\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.48550/arXiv.2204.08120\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference on Learning for Dynamics & Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2204.08120","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

这项工作提出了Neural Gaits，这是一种通过强化集合不变性来学习动态步行步态的方法，可以使用机器人的实验数据进行偶然的改进。我们将行走作为一个集合不变性问题，通过控制障碍函数(cbf)来实现，cbf定义在量化机器人欠驱动部分的降阶动力学上:零动力学。我们的方法包含两个学习模块:一个用于学习满足CBF条件的策略，另一个用于学习残余动力学模型以改进名义模型的缺陷。重要的是，仅在零动态上学习显著降低了学习问题的维数，而使用cbf允许我们仍然对全阶系统做出保证。该方法在欠驱动双足机器人上进行了实验验证，即使在部分未知的动力学情况下，我们也能够显示出敏捷和动态的运动。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Neural Gaits: Learning Bipedal Locomotion via Control Barrier Functions and Zero Dynamics Policies

This work presents Neural Gaits, a method for learning dynamic walking gaits through the enforce-ment of set invariance that can be reﬁned episodically using experimental data from the robot. We frame walking as a set invariance problem enforceable via control barrier functions (CBFs) deﬁned on the reduced-order dynamics quantifying the underactuated component of the robot: the zero dynamics. Our approach contains two learning modules: one for learning a policy that satisﬁes the CBF condition, and another for learning a residual dynamics model to reﬁne imperfections of the nominal model. Importantly, learning only over the zero dynamics signiﬁcantly reduces the dimensionality of the learning problem while using CBFs allows us to still make guarantees for the full-order system. The method is demonstrated experimentally on an underactuated bipedal robot, where we are able to show agile and dynamic locomotion, even with partially unknown dynamics.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Conference on Learning for Dynamics & Control

自引率

0.00%

发文量