{"title":"利用 DRL 进行动态环境中机器人机械手的轨迹规划","authors":"Osama Ahmad, Zawar Hussain, Hammad Naeem","doi":"arxiv-2403.16652","DOIUrl":null,"url":null,"abstract":"This study is about the implementation of a reinforcement learning algorithm\nin the trajectory planning of manipulators. We have a 7-DOF robotic arm to pick\nand place the randomly placed block at a random target point in an unknown\nenvironment. The obstacle is randomly moving which creates a hurdle in picking\nthe object. The objective of the robot is to avoid the obstacle and pick the\nblock with constraints to a fixed timestamp. In this literature, we have\napplied a deep deterministic policy gradient (DDPG) algorithm and compared the\nmodel's efficiency with dense and sparse rewards.","PeriodicalId":501062,"journal":{"name":"arXiv - CS - Systems and Control","volume":"12 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Trajectory Planning of Robotic Manipulator in Dynamic Environment Exploiting DRL\",\"authors\":\"Osama Ahmad, Zawar Hussain, Hammad Naeem\",\"doi\":\"arxiv-2403.16652\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study is about the implementation of a reinforcement learning algorithm\\nin the trajectory planning of manipulators. We have a 7-DOF robotic arm to pick\\nand place the randomly placed block at a random target point in an unknown\\nenvironment. The obstacle is randomly moving which creates a hurdle in picking\\nthe object. The objective of the robot is to avoid the obstacle and pick the\\nblock with constraints to a fixed timestamp. In this literature, we have\\napplied a deep deterministic policy gradient (DDPG) algorithm and compared the\\nmodel's efficiency with dense and sparse rewards.\",\"PeriodicalId\":501062,\"journal\":{\"name\":\"arXiv - CS - Systems and Control\",\"volume\":\"12 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-03-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Systems and Control\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2403.16652\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Systems and Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2403.16652","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Trajectory Planning of Robotic Manipulator in Dynamic Environment Exploiting DRL
This study is about the implementation of a reinforcement learning algorithm
in the trajectory planning of manipulators. We have a 7-DOF robotic arm to pick
and place the randomly placed block at a random target point in an unknown
environment. The obstacle is randomly moving which creates a hurdle in picking
the object. The objective of the robot is to avoid the obstacle and pick the
block with constraints to a fixed timestamp. In this literature, we have
applied a deep deterministic policy gradient (DDPG) algorithm and compared the
model's efficiency with dense and sparse rewards.