{"title":"Coordinated Motion Planning of Dual-arm Space Robot with Deep Reinforcement Learning","authors":"Mengying Tang, Xiaofei Yue, Zhan Zuo, Xiaoping Huang, Yanfang Liu, N. Qi","doi":"10.1109/ICUS48101.2019.8996069","DOIUrl":null,"url":null,"abstract":"In this paper, we focus on coordinated motion planning of dual-arm robot. The kinematics model of the robotic arm is established by Denavit-Hartenberg (D-H) coordinate method and the mathematical model of the cooperative motion planning problem is established. The rapidly-exploring random trees (RRT) algorithm and the deep deterministic policy gradient (DDPG) algorithm are used to carry out dual-arm coordinated motion planning, respectively. The simulation results show that these algorithms can effectively complete the robot arm motion planning task, but the RRT improved algorithm cannot balance the planning efficiency and result optimization. Compared with the RRT algorithm, the DDPG algorithm trains the model through continuous trial and error to optimize its planning strategy. The trained model can be used to obtain an optimized path and it can ensure the efficiency of the planning with the optimized strategy.","PeriodicalId":344181,"journal":{"name":"2019 IEEE International Conference on Unmanned Systems (ICUS)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Conference on Unmanned Systems (ICUS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICUS48101.2019.8996069","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
In this paper, we focus on coordinated motion planning of dual-arm robot. The kinematics model of the robotic arm is established by Denavit-Hartenberg (D-H) coordinate method and the mathematical model of the cooperative motion planning problem is established. The rapidly-exploring random trees (RRT) algorithm and the deep deterministic policy gradient (DDPG) algorithm are used to carry out dual-arm coordinated motion planning, respectively. The simulation results show that these algorithms can effectively complete the robot arm motion planning task, but the RRT improved algorithm cannot balance the planning efficiency and result optimization. Compared with the RRT algorithm, the DDPG algorithm trains the model through continuous trial and error to optimize its planning strategy. The trained model can be used to obtain an optimized path and it can ensure the efficiency of the planning with the optimized strategy.