Improving reinforcement learning based moving object grasping with trajectory prediction

IF 4.3 4区计算机科学 Q3 ROBOTICS

Intelligent Service Robotics Pub Date : 2023-11-24 DOI:10.1007/s11370-023-00491-5

Binzhao Xu, Taimur Hassan, Irfan Hussain

{"title":"Improving reinforcement learning based moving object grasping with trajectory prediction","authors":"Binzhao Xu, Taimur Hassan, Irfan Hussain","doi":"10.1007/s11370-023-00491-5","DOIUrl":null,"url":null,"abstract":"<p>Currently, most grasping systems are designed to grasp the static objects only, and grasping dynamic objects has received less attention in the literature. For the traditional manipulation scheme, achieving dynamic grasping requires either a highly precise dynamic model or sophisticated predefined grasping states and gestures, both of which are hard to obtain and tedious to design. In this paper, we develop a novel reinforcement learning (RL)-based dynamic grasping framework with a trajectory prediction module to address these issues. In particular, we divide dynamic grasping into two parts: RL-based grasping strategies learning and trajectory prediction. In the simulation setting, an RL agent is trained to grasp a static object. When this well-trained agent is transferred to the real world, the observation has been augmented with the predicted one from an LSTM-based trajectory prediction module. We validated the proposed method through an experimental setup involving a Baxter manipulator with two finger grippers and an object placed on a moving car. We also evaluated how well RL performs both with and without our intended trajectory prediction. Experiment results demonstrate that our method can grasp the object on different trajectories at various speeds.</p>","PeriodicalId":48813,"journal":{"name":"Intelligent Service Robotics","volume":"23 10","pages":""},"PeriodicalIF":4.3000,"publicationDate":"2023-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligent Service Robotics","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s11370-023-00491-5","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ROBOTICS","Score":null,"Total":0}

引用次数: 0

Abstract

Currently, most grasping systems are designed to grasp the static objects only, and grasping dynamic objects has received less attention in the literature. For the traditional manipulation scheme, achieving dynamic grasping requires either a highly precise dynamic model or sophisticated predefined grasping states and gestures, both of which are hard to obtain and tedious to design. In this paper, we develop a novel reinforcement learning (RL)-based dynamic grasping framework with a trajectory prediction module to address these issues. In particular, we divide dynamic grasping into two parts: RL-based grasping strategies learning and trajectory prediction. In the simulation setting, an RL agent is trained to grasp a static object. When this well-trained agent is transferred to the real world, the observation has been augmented with the predicted one from an LSTM-based trajectory prediction module. We validated the proposed method through an experimental setup involving a Baxter manipulator with two finger grippers and an object placed on a moving car. We also evaluated how well RL performs both with and without our intended trajectory prediction. Experiment results demonstrate that our method can grasp the object on different trajectories at various speeds.

Abstract Image

查看原文本刊更多论文

基于轨迹预测的运动物体抓取强化学习改进

目前，大多数抓取系统都是针对静态对象设计的，而对动态对象的抓取研究较少。对于传统的操作方案，实现动态抓取需要高精度的动态模型或复杂的预定义抓取状态和手势，这两者很难获得且设计繁琐。在本文中，我们开发了一种新的基于强化学习(RL)的动态抓取框架，并带有轨迹预测模块来解决这些问题。特别地，我们将动态抓取分为两个部分:基于强化学习的抓取策略学习和轨迹预测。在模拟设置中，RL代理被训练去抓取一个静态对象。当这个训练有素的智能体被转移到现实世界时，观察结果已经与基于lstm的轨迹预测模块的预测结果相增强。我们通过一个实验装置验证了所提出的方法，该实验装置涉及一个带有两个手指夹持器的Baxter机械手和一个放置在移动汽车上的物体。我们还评估了RL在有和没有预期轨迹预测的情况下的表现。实验结果表明，该方法能够以不同的速度在不同的轨迹上抓取物体。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Intelligent Service Robotics ROBOTICS-

CiteScore

5.70

自引率

4.00%

发文量

期刊介绍： The journal directs special attention to the emerging significance of integrating robotics with information technology and cognitive science (such as ubiquitous and adaptive computing,information integration in a distributed environment, and cognitive modelling for human-robot interaction), which spurs innovation toward a new multi-dimensional robotic service to humans. The journal intends to capture and archive this emerging yet significant advancement in the field of intelligent service robotics. The journal will publish original papers of innovative ideas and concepts, new discoveries and improvements, as well as novel applications and business models which are related to the field of intelligent service robotics described above and are proven to be of high quality. The areas that the Journal will cover include, but are not limited to: Intelligent robots serving humans in daily life or in a hazardous environment, such as home or personal service robots, entertainment robots, education robots, medical robots, healthcare and rehabilitation robots, and rescue robots (Service Robotics); Intelligent robotic functions in the form of embedded systems for applications to, for example, intelligent space, intelligent vehicles and transportation systems, intelligent manufacturing systems, and intelligent medical facilities (Embedded Robotics); The integration of robotics with network technologies, generating such services and solutions as distributed robots, distance robotic education-aides, and virtual laboratories or museums (Networked Robotics).