利用 DRL 进行动态环境中机器人机械手的轨迹规划

arXiv - CS - Systems and Control Pub Date : 2024-03-25 DOI:arxiv-2403.16652

Osama Ahmad, Zawar Hussain, Hammad Naeem

{"title":"利用 DRL 进行动态环境中机器人机械手的轨迹规划","authors":"Osama Ahmad, Zawar Hussain, Hammad Naeem","doi":"arxiv-2403.16652","DOIUrl":null,"url":null,"abstract":"This study is about the implementation of a reinforcement learning algorithm\nin the trajectory planning of manipulators. We have a 7-DOF robotic arm to pick\nand place the randomly placed block at a random target point in an unknown\nenvironment. The obstacle is randomly moving which creates a hurdle in picking\nthe object. The objective of the robot is to avoid the obstacle and pick the\nblock with constraints to a fixed timestamp. In this literature, we have\napplied a deep deterministic policy gradient (DDPG) algorithm and compared the\nmodel's efficiency with dense and sparse rewards.","PeriodicalId":501062,"journal":{"name":"arXiv - CS - Systems and Control","volume":"12 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Trajectory Planning of Robotic Manipulator in Dynamic Environment Exploiting DRL\",\"authors\":\"Osama Ahmad, Zawar Hussain, Hammad Naeem\",\"doi\":\"arxiv-2403.16652\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study is about the implementation of a reinforcement learning algorithm\\nin the trajectory planning of manipulators. We have a 7-DOF robotic arm to pick\\nand place the randomly placed block at a random target point in an unknown\\nenvironment. The obstacle is randomly moving which creates a hurdle in picking\\nthe object. The objective of the robot is to avoid the obstacle and pick the\\nblock with constraints to a fixed timestamp. In this literature, we have\\napplied a deep deterministic policy gradient (DDPG) algorithm and compared the\\nmodel's efficiency with dense and sparse rewards.\",\"PeriodicalId\":501062,\"journal\":{\"name\":\"arXiv - CS - Systems and Control\",\"volume\":\"12 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-03-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Systems and Control\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2403.16652\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Systems and Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2403.16652","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本研究是关于强化学习算法在机械手轨迹规划中的应用。我们有一个 7-DOF 机械臂，要在未知环境中将随机放置的木块拾取并放置到随机目标点。障碍物是随机移动的，这给拾取物体造成了障碍。机器人的目标是避开障碍物，并在固定时间戳的约束下拾取木块。在这篇文献中，我们应用了一种深度确定性策略梯度（DDPG）算法，并比较了该算法在密集奖励和稀疏奖励情况下的效率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Trajectory Planning of Robotic Manipulator in Dynamic Environment Exploiting DRL

This study is about the implementation of a reinforcement learning algorithm in the trajectory planning of manipulators. We have a 7-DOF robotic arm to pick and place the randomly placed block at a random target point in an unknown environment. The obstacle is randomly moving which creates a hurdle in picking the object. The objective of the robot is to avoid the obstacle and pick the block with constraints to a fixed timestamp. In this literature, we have applied a deep deterministic policy gradient (DDPG) algorithm and compared the model's efficiency with dense and sparse rewards.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

arXiv - CS - Systems and Control

自引率

0.00%

发文量