{"title":"通过强化学习实现双足仿人机器人 \"小曼 \"的盲走平衡控制和干扰抑制","authors":"Chao Ji, Diyuan Liu, Wei Gao, Shiwu Zhang","doi":"10.1109/ROBIO58561.2023.10354629","DOIUrl":null,"url":null,"abstract":"Bipedal humanoid robot has the ability to both move and manipulate in complex environments, which is of great significance in the future. However, stable bipedal walking in the real world has always been a challenge in industry and even in academia. The traditional model-based methods are highly dependent on the environment, with high modeling complexity and lack of generalization. The solution based on the simplified model usually causes the problem that the control algorithms cannot adapt to complex terrain environment. This paper presents a newly designed bipedal humanoid robot, Xiao-Man. Aiming at achieving the robot’s terrain-adaptive walking behavior, a reinforcement learning based Actor-Critic network with asymmetric structure is proposed. Without using any external perception information, robust bipedal walking behavior of Xiao-Man is achieved. In the process, we also build the dataset based on the joint actuation truth data and train a joint actuator network to reduce the gap between the expected torque and the actual response torque. Experimental results show that the bipedal humanoid robot equipped with the trained control policy achieves the capability of stable walking and disturbance rejection only rely on proprioceptive information.","PeriodicalId":505134,"journal":{"name":"2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"24 2","pages":"1-7"},"PeriodicalIF":0.0000,"publicationDate":"2023-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Blind Walking Balance Control and Disturbance Rejection of the Bipedal Humanoid Robot Xiao-Man via Reinforcement Learning\",\"authors\":\"Chao Ji, Diyuan Liu, Wei Gao, Shiwu Zhang\",\"doi\":\"10.1109/ROBIO58561.2023.10354629\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Bipedal humanoid robot has the ability to both move and manipulate in complex environments, which is of great significance in the future. However, stable bipedal walking in the real world has always been a challenge in industry and even in academia. The traditional model-based methods are highly dependent on the environment, with high modeling complexity and lack of generalization. The solution based on the simplified model usually causes the problem that the control algorithms cannot adapt to complex terrain environment. This paper presents a newly designed bipedal humanoid robot, Xiao-Man. Aiming at achieving the robot’s terrain-adaptive walking behavior, a reinforcement learning based Actor-Critic network with asymmetric structure is proposed. Without using any external perception information, robust bipedal walking behavior of Xiao-Man is achieved. In the process, we also build the dataset based on the joint actuation truth data and train a joint actuator network to reduce the gap between the expected torque and the actual response torque. Experimental results show that the bipedal humanoid robot equipped with the trained control policy achieves the capability of stable walking and disturbance rejection only rely on proprioceptive information.\",\"PeriodicalId\":505134,\"journal\":{\"name\":\"2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)\",\"volume\":\"24 2\",\"pages\":\"1-7\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-12-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ROBIO58561.2023.10354629\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ROBIO58561.2023.10354629","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Blind Walking Balance Control and Disturbance Rejection of the Bipedal Humanoid Robot Xiao-Man via Reinforcement Learning
Bipedal humanoid robot has the ability to both move and manipulate in complex environments, which is of great significance in the future. However, stable bipedal walking in the real world has always been a challenge in industry and even in academia. The traditional model-based methods are highly dependent on the environment, with high modeling complexity and lack of generalization. The solution based on the simplified model usually causes the problem that the control algorithms cannot adapt to complex terrain environment. This paper presents a newly designed bipedal humanoid robot, Xiao-Man. Aiming at achieving the robot’s terrain-adaptive walking behavior, a reinforcement learning based Actor-Critic network with asymmetric structure is proposed. Without using any external perception information, robust bipedal walking behavior of Xiao-Man is achieved. In the process, we also build the dataset based on the joint actuation truth data and train a joint actuator network to reduce the gap between the expected torque and the actual response torque. Experimental results show that the bipedal humanoid robot equipped with the trained control policy achieves the capability of stable walking and disturbance rejection only rely on proprioceptive information.