基于深度强化学习的机器人从人手中抓取物体的运动策略

Q1 Computer Science

Virtual Reality Intelligent Hardware Pub Date : 2023-10-01 DOI:10.1016/j.vrih.2022.12.001

Zeyuan Cai , Zhiquan Feng , Liran Zhou , Xiaohui Yang , Tao Xu

{"title":"基于深度强化学习的机器人从人手中抓取物体的运动策略","authors":"Zeyuan Cai , Zhiquan Feng , Liran Zhou , Xiaohui Yang , Tao Xu","doi":"10.1016/j.vrih.2022.12.001","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>Robot grasping encompasses a wide range of research areas; however, most studies have been focused on the grasping of only stationary objects in a scene; only a few studies on how to grasp objects from a user's hand have been conducted. In this paper, a robot grasping algorithm based on deep reinforcement learning (RGRL) is proposed.</p></div><div><h3>Methods</h3><p>The RGRL takes the relative positions of the robot and the object in a user's hand as input and outputs the best action of the robot in the current state. Thus, the proposed algorithm realizes the functions of autonomous path planning and grasping objects safely from the hands of users. A new method for improving the safety of human–robot cooperation is explored. To solve the problems of a low utilization rate and slow convergence of reinforcement learning algorithms, the RGRL is first trained in a simulation scene, and then, the model parameters are applied to a real scene. To reduce the difference between the simulated and real scenes, domain randomization is applied to randomly change the positions and angles of objects in the simulated scenes at regular intervals, thereby improving the diversity of the training samples and robustness of the algorithm.</p></div><div><h3>Results</h3><p>The RGRL's effectiveness and accuracy are verified by evaluating it on both simulated and real scenes, and the results show that the RGRL can achieve an accuracy of more than 80% in both cases.</p></div><div><h3>Conclusions</h3><p>RGRL is a robot grasping algorithm that employs domain randomization and deep reinforcement learning for effective grasping in simulated and real scenes. However, it lacks flexibility in adapting to different grasping poses, prompting future research in achieving safe grasping for diverse user postures.</p></div>","PeriodicalId":33538,"journal":{"name":"Virtual Reality Intelligent Hardware","volume":"5 5","pages":"Pages 407-421"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep-reinforcement-learning-based robot motion strategies for grabbing objects from human hands\",\"authors\":\"Zeyuan Cai , Zhiquan Feng , Liran Zhou , Xiaohui Yang , Tao Xu\",\"doi\":\"10.1016/j.vrih.2022.12.001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><p>Robot grasping encompasses a wide range of research areas; however, most studies have been focused on the grasping of only stationary objects in a scene; only a few studies on how to grasp objects from a user's hand have been conducted. In this paper, a robot grasping algorithm based on deep reinforcement learning (RGRL) is proposed.</p></div><div><h3>Methods</h3><p>The RGRL takes the relative positions of the robot and the object in a user's hand as input and outputs the best action of the robot in the current state. Thus, the proposed algorithm realizes the functions of autonomous path planning and grasping objects safely from the hands of users. A new method for improving the safety of human–robot cooperation is explored. To solve the problems of a low utilization rate and slow convergence of reinforcement learning algorithms, the RGRL is first trained in a simulation scene, and then, the model parameters are applied to a real scene. To reduce the difference between the simulated and real scenes, domain randomization is applied to randomly change the positions and angles of objects in the simulated scenes at regular intervals, thereby improving the diversity of the training samples and robustness of the algorithm.</p></div><div><h3>Results</h3><p>The RGRL's effectiveness and accuracy are verified by evaluating it on both simulated and real scenes, and the results show that the RGRL can achieve an accuracy of more than 80% in both cases.</p></div><div><h3>Conclusions</h3><p>RGRL is a robot grasping algorithm that employs domain randomization and deep reinforcement learning for effective grasping in simulated and real scenes. However, it lacks flexibility in adapting to different grasping poses, prompting future research in achieving safe grasping for diverse user postures.</p></div>\",\"PeriodicalId\":33538,\"journal\":{\"name\":\"Virtual Reality Intelligent Hardware\",\"volume\":\"5 5\",\"pages\":\"Pages 407-421\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Virtual Reality Intelligent Hardware\",\"FirstCategoryId\":\"1093\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2096579622001188\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Virtual Reality Intelligent Hardware","FirstCategoryId":"1093","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2096579622001188","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Computer Science","Score":null,"Total":0}

引用次数: 0

摘要

背景机器人抓取包含了广泛的研究领域；然而，大多数研究都集中在场景中静止物体的抓取上；只有少数关于如何从用户手中抓握物体的研究被进行。本文提出了一种基于深度强化学习的机器人抓取算法。方法RGRL以机器人和物体在用户手中的相对位置为输入，输出机器人在当前状态下的最佳动作。因此，该算法实现了自主路径规划和从用户手中安全抓取物体的功能。探索了一种提高人机协作安全性的新方法。为了解决强化学习算法利用率低、收敛慢的问题，首先在模拟场景中训练RGRL，然后将模型参数应用于真实场景。为了减少模拟场景和真实场景之间的差异，应用域随机化以规则的间隔随机改变模拟场景中对象的位置和角度，从而提高训练样本的多样性和算法的鲁棒性。结果通过对RGRL在模拟和真实场景中的评估，验证了其有效性和准确性，结果表明，RGRL在两种情况下都能达到80%以上的准确率。结论sRGRL是一种采用领域随机化和深度强化学习的机器人抓取算法，可在模拟和真实场景中进行有效抓取。然而，它在适应不同的抓握姿势方面缺乏灵活性，这促使未来对实现不同用户姿势的安全抓握进行研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Deep-reinforcement-learning-based robot motion strategies for grabbing objects from human hands

Background

Robot grasping encompasses a wide range of research areas; however, most studies have been focused on the grasping of only stationary objects in a scene; only a few studies on how to grasp objects from a user's hand have been conducted. In this paper, a robot grasping algorithm based on deep reinforcement learning (RGRL) is proposed.

Methods

The RGRL takes the relative positions of the robot and the object in a user's hand as input and outputs the best action of the robot in the current state. Thus, the proposed algorithm realizes the functions of autonomous path planning and grasping objects safely from the hands of users. A new method for improving the safety of human–robot cooperation is explored. To solve the problems of a low utilization rate and slow convergence of reinforcement learning algorithms, the RGRL is first trained in a simulation scene, and then, the model parameters are applied to a real scene. To reduce the difference between the simulated and real scenes, domain randomization is applied to randomly change the positions and angles of objects in the simulated scenes at regular intervals, thereby improving the diversity of the training samples and robustness of the algorithm.

Results

The RGRL's effectiveness and accuracy are verified by evaluating it on both simulated and real scenes, and the results show that the RGRL can achieve an accuracy of more than 80% in both cases.

Conclusions

RGRL is a robot grasping algorithm that employs domain randomization and deep reinforcement learning for effective grasping in simulated and real scenes. However, it lacks flexibility in adapting to different grasping poses, prompting future research in achieving safe grasping for diverse user postures.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Virtual Reality Intelligent Hardware Computer Science-Computer Graphics and Computer-Aided Design

CiteScore

6.40

自引率

0.00%

发文量

审稿时长

12 weeks