基于深度强化学习的机械臂智能控制

Jiangtao Zhou, Hua Zheng, Dong-zhu Zhao, Yingxue Chen
{"title":"基于深度强化学习的机械臂智能控制","authors":"Jiangtao Zhou, Hua Zheng, Dong-zhu Zhao, Yingxue Chen","doi":"10.1109/ICMAE52228.2021.9522377","DOIUrl":null,"url":null,"abstract":"Traditional manipulator control requires the establishment of complex models and mathematical equations, and requires manual adjustment of parameters, which cannot adapt to tasks and environmental changes. With the rapid development of artificial intelligence, deep reinforcement learning algorithms show strong online adaptability and self-learning capabilities for complex systems, and have gradually become one of the important research hotspots in the field of intelligent control and artificial intelligence in recent years. Therefore, this paper combines the deep reinforcement learning algorithm with the manipulator control system, and uses the deep deterministic policy gradient (DDPG) algorithm to achieve the end position control of the manipulator. On this basis, this paper presents a scheme to improve the reward function. Finally, the DDPG algorithm is used for interactive training with the manipulator model. The experimental results show that the deep reinforcement learning algorithm can realize the control of the manipulator, and the improved reward function scheme can greatly improve the stability of training and the task completion rate.","PeriodicalId":161846,"journal":{"name":"2021 12th International Conference on Mechanical and Aerospace Engineering (ICMAE)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Intelligent Control of Manipulator Based on Deep Reinforcement Learning\",\"authors\":\"Jiangtao Zhou, Hua Zheng, Dong-zhu Zhao, Yingxue Chen\",\"doi\":\"10.1109/ICMAE52228.2021.9522377\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Traditional manipulator control requires the establishment of complex models and mathematical equations, and requires manual adjustment of parameters, which cannot adapt to tasks and environmental changes. With the rapid development of artificial intelligence, deep reinforcement learning algorithms show strong online adaptability and self-learning capabilities for complex systems, and have gradually become one of the important research hotspots in the field of intelligent control and artificial intelligence in recent years. Therefore, this paper combines the deep reinforcement learning algorithm with the manipulator control system, and uses the deep deterministic policy gradient (DDPG) algorithm to achieve the end position control of the manipulator. On this basis, this paper presents a scheme to improve the reward function. Finally, the DDPG algorithm is used for interactive training with the manipulator model. The experimental results show that the deep reinforcement learning algorithm can realize the control of the manipulator, and the improved reward function scheme can greatly improve the stability of training and the task completion rate.\",\"PeriodicalId\":161846,\"journal\":{\"name\":\"2021 12th International Conference on Mechanical and Aerospace Engineering (ICMAE)\",\"volume\":\"48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 12th International Conference on Mechanical and Aerospace Engineering (ICMAE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMAE52228.2021.9522377\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 12th International Conference on Mechanical and Aerospace Engineering (ICMAE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMAE52228.2021.9522377","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

传统的机械手控制需要建立复杂的模型和数学方程,需要人工调整参数,不能适应任务和环境的变化。随着人工智能的快速发展,深度强化学习算法对复杂系统表现出强大的在线适应能力和自学习能力,逐渐成为近年来智能控制和人工智能领域的重要研究热点之一。因此,本文将深度强化学习算法与机械手控制系统相结合,采用深度确定性策略梯度(DDPG)算法实现机械手的末端位置控制。在此基础上,提出了一种改进奖励函数的方案。最后,利用DDPG算法与机械臂模型进行交互训练。实验结果表明,深度强化学习算法可以实现对机械手的控制,改进的奖励函数方案可以大大提高训练的稳定性和任务完成率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Intelligent Control of Manipulator Based on Deep Reinforcement Learning
Traditional manipulator control requires the establishment of complex models and mathematical equations, and requires manual adjustment of parameters, which cannot adapt to tasks and environmental changes. With the rapid development of artificial intelligence, deep reinforcement learning algorithms show strong online adaptability and self-learning capabilities for complex systems, and have gradually become one of the important research hotspots in the field of intelligent control and artificial intelligence in recent years. Therefore, this paper combines the deep reinforcement learning algorithm with the manipulator control system, and uses the deep deterministic policy gradient (DDPG) algorithm to achieve the end position control of the manipulator. On this basis, this paper presents a scheme to improve the reward function. Finally, the DDPG algorithm is used for interactive training with the manipulator model. The experimental results show that the deep reinforcement learning algorithm can realize the control of the manipulator, and the improved reward function scheme can greatly improve the stability of training and the task completion rate.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信