{"title":"基于深度强化学习的机械臂智能控制","authors":"Jiangtao Zhou, Hua Zheng, Dong-zhu Zhao, Yingxue Chen","doi":"10.1109/ICMAE52228.2021.9522377","DOIUrl":null,"url":null,"abstract":"Traditional manipulator control requires the establishment of complex models and mathematical equations, and requires manual adjustment of parameters, which cannot adapt to tasks and environmental changes. With the rapid development of artificial intelligence, deep reinforcement learning algorithms show strong online adaptability and self-learning capabilities for complex systems, and have gradually become one of the important research hotspots in the field of intelligent control and artificial intelligence in recent years. Therefore, this paper combines the deep reinforcement learning algorithm with the manipulator control system, and uses the deep deterministic policy gradient (DDPG) algorithm to achieve the end position control of the manipulator. On this basis, this paper presents a scheme to improve the reward function. Finally, the DDPG algorithm is used for interactive training with the manipulator model. The experimental results show that the deep reinforcement learning algorithm can realize the control of the manipulator, and the improved reward function scheme can greatly improve the stability of training and the task completion rate.","PeriodicalId":161846,"journal":{"name":"2021 12th International Conference on Mechanical and Aerospace Engineering (ICMAE)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Intelligent Control of Manipulator Based on Deep Reinforcement Learning\",\"authors\":\"Jiangtao Zhou, Hua Zheng, Dong-zhu Zhao, Yingxue Chen\",\"doi\":\"10.1109/ICMAE52228.2021.9522377\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Traditional manipulator control requires the establishment of complex models and mathematical equations, and requires manual adjustment of parameters, which cannot adapt to tasks and environmental changes. With the rapid development of artificial intelligence, deep reinforcement learning algorithms show strong online adaptability and self-learning capabilities for complex systems, and have gradually become one of the important research hotspots in the field of intelligent control and artificial intelligence in recent years. Therefore, this paper combines the deep reinforcement learning algorithm with the manipulator control system, and uses the deep deterministic policy gradient (DDPG) algorithm to achieve the end position control of the manipulator. On this basis, this paper presents a scheme to improve the reward function. Finally, the DDPG algorithm is used for interactive training with the manipulator model. The experimental results show that the deep reinforcement learning algorithm can realize the control of the manipulator, and the improved reward function scheme can greatly improve the stability of training and the task completion rate.\",\"PeriodicalId\":161846,\"journal\":{\"name\":\"2021 12th International Conference on Mechanical and Aerospace Engineering (ICMAE)\",\"volume\":\"48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 12th International Conference on Mechanical and Aerospace Engineering (ICMAE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMAE52228.2021.9522377\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 12th International Conference on Mechanical and Aerospace Engineering (ICMAE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMAE52228.2021.9522377","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Intelligent Control of Manipulator Based on Deep Reinforcement Learning
Traditional manipulator control requires the establishment of complex models and mathematical equations, and requires manual adjustment of parameters, which cannot adapt to tasks and environmental changes. With the rapid development of artificial intelligence, deep reinforcement learning algorithms show strong online adaptability and self-learning capabilities for complex systems, and have gradually become one of the important research hotspots in the field of intelligent control and artificial intelligence in recent years. Therefore, this paper combines the deep reinforcement learning algorithm with the manipulator control system, and uses the deep deterministic policy gradient (DDPG) algorithm to achieve the end position control of the manipulator. On this basis, this paper presents a scheme to improve the reward function. Finally, the DDPG algorithm is used for interactive training with the manipulator model. The experimental results show that the deep reinforcement learning algorithm can realize the control of the manipulator, and the improved reward function scheme can greatly improve the stability of training and the task completion rate.