{"title":"TDE2-MBRL: Energy-exchange Dynamics Learning with Task Decomposition for Spring-loaded Bipedal Robot Locomotion","authors":"Cheng-Yu Kuo, Hirofumi Shin, Takumi Kamioka, Takamitsu Matsubara","doi":"10.1109/Humanoids53995.2022.10000180","DOIUrl":null,"url":null,"abstract":"Spring-loaded Inverted Pendulum (SLIP) inspired bipedal robots (SLIP-biped) have high agility owing to their fault tolerance under impacts. Controlling a SLIP-biped requires capturing its dynamics; however, its high complexity makes analytic method implementation challenging. Thus, a Model-based Reinforcement Learning (MBRL) that learns a dynamics model and utilizes it for control design appears to be a reasonable alternative. Nevertheless, modeling high complexity dynamics with conventional MBRL approaches requires enormous samples or a high computation load. Therefore, exploring a simplified and compact dynamics model for SLIP-biped would be a key to increasing the feasibility of MBRL implementation and real-time control. We propose a Task-Decomposed Energy-exchange dynamics learning with MBRL (TDE2-MBRL) to capture simplified SLIP-biped dynamics and utilize them for control. Specifically, under the law of energy conservation, we model the energy exchange to reduce dynamics' dimensionality. Next, we decompose the SLIP-biped dynamics into locomotion task phases to cope with dynamics dissimilarity. The effectiveness is demonstrated by hopping skill acquisition with a precise simulated SLIP-biped replica of a real SLIP-biped. The experiment results show that TDE2-MBRL improves learning efficiency and control frequency while having comparable model accuracy to the standard MBRL.","PeriodicalId":180816,"journal":{"name":"2022 IEEE-RAS 21st International Conference on Humanoid Robots (Humanoids)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE-RAS 21st International Conference on Humanoid Robots (Humanoids)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/Humanoids53995.2022.10000180","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Spring-loaded Inverted Pendulum (SLIP) inspired bipedal robots (SLIP-biped) have high agility owing to their fault tolerance under impacts. Controlling a SLIP-biped requires capturing its dynamics; however, its high complexity makes analytic method implementation challenging. Thus, a Model-based Reinforcement Learning (MBRL) that learns a dynamics model and utilizes it for control design appears to be a reasonable alternative. Nevertheless, modeling high complexity dynamics with conventional MBRL approaches requires enormous samples or a high computation load. Therefore, exploring a simplified and compact dynamics model for SLIP-biped would be a key to increasing the feasibility of MBRL implementation and real-time control. We propose a Task-Decomposed Energy-exchange dynamics learning with MBRL (TDE2-MBRL) to capture simplified SLIP-biped dynamics and utilize them for control. Specifically, under the law of energy conservation, we model the energy exchange to reduce dynamics' dimensionality. Next, we decompose the SLIP-biped dynamics into locomotion task phases to cope with dynamics dissimilarity. The effectiveness is demonstrated by hopping skill acquisition with a precise simulated SLIP-biped replica of a real SLIP-biped. The experiment results show that TDE2-MBRL improves learning efficiency and control frequency while having comparable model accuracy to the standard MBRL.