基于深度强化学习的机械臂智能控制

2021 12th International Conference on Mechanical and Aerospace Engineering (ICMAE) Pub Date : 2021-07-16 DOI:10.1109/ICMAE52228.2021.9522377

Jiangtao Zhou, Hua Zheng, Dong-zhu Zhao, Yingxue Chen

{"title":"基于深度强化学习的机械臂智能控制","authors":"Jiangtao Zhou, Hua Zheng, Dong-zhu Zhao, Yingxue Chen","doi":"10.1109/ICMAE52228.2021.9522377","DOIUrl":null,"url":null,"abstract":"Traditional manipulator control requires the establishment of complex models and mathematical equations, and requires manual adjustment of parameters, which cannot adapt to tasks and environmental changes. With the rapid development of artificial intelligence, deep reinforcement learning algorithms show strong online adaptability and self-learning capabilities for complex systems, and have gradually become one of the important research hotspots in the field of intelligent control and artificial intelligence in recent years. Therefore, this paper combines the deep reinforcement learning algorithm with the manipulator control system, and uses the deep deterministic policy gradient (DDPG) algorithm to achieve the end position control of the manipulator. On this basis, this paper presents a scheme to improve the reward function. Finally, the DDPG algorithm is used for interactive training with the manipulator model. The experimental results show that the deep reinforcement learning algorithm can realize the control of the manipulator, and the improved reward function scheme can greatly improve the stability of training and the task completion rate.","PeriodicalId":161846,"journal":{"name":"2021 12th International Conference on Mechanical and Aerospace Engineering (ICMAE)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Intelligent Control of Manipulator Based on Deep Reinforcement Learning\",\"authors\":\"Jiangtao Zhou, Hua Zheng, Dong-zhu Zhao, Yingxue Chen\",\"doi\":\"10.1109/ICMAE52228.2021.9522377\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Traditional manipulator control requires the establishment of complex models and mathematical equations, and requires manual adjustment of parameters, which cannot adapt to tasks and environmental changes. With the rapid development of artificial intelligence, deep reinforcement learning algorithms show strong online adaptability and self-learning capabilities for complex systems, and have gradually become one of the important research hotspots in the field of intelligent control and artificial intelligence in recent years. Therefore, this paper combines the deep reinforcement learning algorithm with the manipulator control system, and uses the deep deterministic policy gradient (DDPG) algorithm to achieve the end position control of the manipulator. On this basis, this paper presents a scheme to improve the reward function. Finally, the DDPG algorithm is used for interactive training with the manipulator model. The experimental results show that the deep reinforcement learning algorithm can realize the control of the manipulator, and the improved reward function scheme can greatly improve the stability of training and the task completion rate.\",\"PeriodicalId\":161846,\"journal\":{\"name\":\"2021 12th International Conference on Mechanical and Aerospace Engineering (ICMAE)\",\"volume\":\"48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 12th International Conference on Mechanical and Aerospace Engineering (ICMAE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMAE52228.2021.9522377\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 12th International Conference on Mechanical and Aerospace Engineering (ICMAE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMAE52228.2021.9522377","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

传统的机械手控制需要建立复杂的模型和数学方程，需要人工调整参数，不能适应任务和环境的变化。随着人工智能的快速发展，深度强化学习算法对复杂系统表现出强大的在线适应能力和自学习能力，逐渐成为近年来智能控制和人工智能领域的重要研究热点之一。因此，本文将深度强化学习算法与机械手控制系统相结合，采用深度确定性策略梯度(DDPG)算法实现机械手的末端位置控制。在此基础上，提出了一种改进奖励函数的方案。最后，利用DDPG算法与机械臂模型进行交互训练。实验结果表明，深度强化学习算法可以实现对机械手的控制，改进的奖励函数方案可以大大提高训练的稳定性和任务完成率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Intelligent Control of Manipulator Based on Deep Reinforcement Learning

Traditional manipulator control requires the establishment of complex models and mathematical equations, and requires manual adjustment of parameters, which cannot adapt to tasks and environmental changes. With the rapid development of artificial intelligence, deep reinforcement learning algorithms show strong online adaptability and self-learning capabilities for complex systems, and have gradually become one of the important research hotspots in the field of intelligent control and artificial intelligence in recent years. Therefore, this paper combines the deep reinforcement learning algorithm with the manipulator control system, and uses the deep deterministic policy gradient (DDPG) algorithm to achieve the end position control of the manipulator. On this basis, this paper presents a scheme to improve the reward function. Finally, the DDPG algorithm is used for interactive training with the manipulator model. The experimental results show that the deep reinforcement learning algorithm can realize the control of the manipulator, and the improved reward function scheme can greatly improve the stability of training and the task completion rate.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 12th International Conference on Mechanical and Aerospace Engineering (ICMAE)

自引率

0.00%

发文量