Xiaoguang Wu, Shaowei Liu, Tianci Zhang, Lei Yang, Yanhui Li, Tingjin Wang
{"title":"基于ddpg深度强化学习的双足机器人运动控制","authors":"Xiaoguang Wu, Shaowei Liu, Tianci Zhang, Lei Yang, Yanhui Li, Tingjin Wang","doi":"10.1109/WRC-SARA.2018.8584227","DOIUrl":null,"url":null,"abstract":"In the study of the passive biped robot, the avoidance of fall over is always an important direction of the research. In this paper, we propose Deep Deterministic Policy Gradient (DDPG) to control the biped robot walk steadily on the slope. For improve the speed of DDPG training, the DDPG used in the paper is improved by parallel actors and Prioritized Experience Replay (PER). In the simulation, we control different initial states that cause the biped robot to fall over. After the control, the biped robot can walk stably, which indicating that DDPG can effectively control the fall over of the biped robot.","PeriodicalId":185881,"journal":{"name":"2018 WRC Symposium on Advanced Robotics and Automation (WRC SARA)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"21","resultStr":"{\"title\":\"Motion Control for Biped Robot via DDPG-based Deep Reinforcement Learning\",\"authors\":\"Xiaoguang Wu, Shaowei Liu, Tianci Zhang, Lei Yang, Yanhui Li, Tingjin Wang\",\"doi\":\"10.1109/WRC-SARA.2018.8584227\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the study of the passive biped robot, the avoidance of fall over is always an important direction of the research. In this paper, we propose Deep Deterministic Policy Gradient (DDPG) to control the biped robot walk steadily on the slope. For improve the speed of DDPG training, the DDPG used in the paper is improved by parallel actors and Prioritized Experience Replay (PER). In the simulation, we control different initial states that cause the biped robot to fall over. After the control, the biped robot can walk stably, which indicating that DDPG can effectively control the fall over of the biped robot.\",\"PeriodicalId\":185881,\"journal\":{\"name\":\"2018 WRC Symposium on Advanced Robotics and Automation (WRC SARA)\",\"volume\":\"80 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"21\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 WRC Symposium on Advanced Robotics and Automation (WRC SARA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WRC-SARA.2018.8584227\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 WRC Symposium on Advanced Robotics and Automation (WRC SARA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WRC-SARA.2018.8584227","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Motion Control for Biped Robot via DDPG-based Deep Reinforcement Learning
In the study of the passive biped robot, the avoidance of fall over is always an important direction of the research. In this paper, we propose Deep Deterministic Policy Gradient (DDPG) to control the biped robot walk steadily on the slope. For improve the speed of DDPG training, the DDPG used in the paper is improved by parallel actors and Prioritized Experience Replay (PER). In the simulation, we control different initial states that cause the biped robot to fall over. After the control, the biped robot can walk stably, which indicating that DDPG can effectively control the fall over of the biped robot.