Deep-Reinforcement-Learning-Based Path Planning for Industrial Robots Using Distance Sensors as Observation

2023 8th International Conference on Control and Robotics Engineering (ICCRE) Pub Date : 2023-01-14 DOI:10.1109/ICCRE57112.2023.10155608

Teham Bhuiyan, Linh Kästner, Yifan Hu, Benno Kutschank, Jens Lambrecht

{"title":"Deep-Reinforcement-Learning-Based Path Planning for Industrial Robots Using Distance Sensors as Observation","authors":"Teham Bhuiyan, Linh Kästner, Yifan Hu, Benno Kutschank, Jens Lambrecht","doi":"10.1109/ICCRE57112.2023.10155608","DOIUrl":null,"url":null,"abstract":"Traditionally, collision-free path planning for industrial robots is realized by sampling-based algorithms such as RRT (Rapidly-exploring Random Tree), PRM (Probabilistic Roadmap), etc. Sampling-based algorithms require long computation times, especially in complex environments. Furthermore, the environment in which they are employed needs to be known beforehand. When utilizing these approaches in new environments, a tedious engineering effort in setting hyperparameters needs to be conducted, which is time- and cost-intensive. On the other hand, DRL (Deep Reinforcement Learning) has shown remarkable results in dealing with complex environments, generalizing new problem instances, and solving motion planning problems efficiently. On that account, this paper proposes a Deep-Reinforcement-Learning-based motion planner for robotic manipulators. We propose an easily reproducible method to train an agent in randomized scenarios achieving generalization for unknown environments. We evaluated our model against state-of-the-art sampling- and DRL-based planners in several experiments containing static and dynamic obstacles. Results show the adaptability of our agent in new environments and the superiority in terms of path length and execution time compared to conventional methods. Our code is available on GitHub [1].","PeriodicalId":285164,"journal":{"name":"2023 8th International Conference on Control and Robotics Engineering (ICCRE)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 8th International Conference on Control and Robotics Engineering (ICCRE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCRE57112.2023.10155608","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Traditionally, collision-free path planning for industrial robots is realized by sampling-based algorithms such as RRT (Rapidly-exploring Random Tree), PRM (Probabilistic Roadmap), etc. Sampling-based algorithms require long computation times, especially in complex environments. Furthermore, the environment in which they are employed needs to be known beforehand. When utilizing these approaches in new environments, a tedious engineering effort in setting hyperparameters needs to be conducted, which is time- and cost-intensive. On the other hand, DRL (Deep Reinforcement Learning) has shown remarkable results in dealing with complex environments, generalizing new problem instances, and solving motion planning problems efficiently. On that account, this paper proposes a Deep-Reinforcement-Learning-based motion planner for robotic manipulators. We propose an easily reproducible method to train an agent in randomized scenarios achieving generalization for unknown environments. We evaluated our model against state-of-the-art sampling- and DRL-based planners in several experiments containing static and dynamic obstacles. Results show the adaptability of our agent in new environments and the superiority in terms of path length and execution time compared to conventional methods. Our code is available on GitHub [1].

查看原文本刊更多论文

基于深度强化学习的工业机器人距离传感器路径规划

传统上，工业机器人的无碰撞路径规划是通过基于采样的算法实现的，如快速探索随机树(RRT)、概率路线图(PRM)等。基于采样的算法需要很长的计算时间，特别是在复杂的环境中。此外，他们所处的环境需要事先了解。当在新环境中使用这些方法时，需要进行冗长的工程工作来设置超参数，这是时间和成本密集型的。另一方面，深度强化学习(DRL)在处理复杂环境、推广新问题实例和有效解决运动规划问题方面取得了显著的成果。基于此，本文提出了一种基于深度强化学习的机器人运动规划方法。我们提出了一种易于重复的方法来训练随机场景中的智能体，实现对未知环境的泛化。在包含静态和动态障碍的几个实验中，我们针对最先进的采样和基于drl的规划器评估了我们的模型。结果表明，该智能体在新环境下具有较强的适应性，在路径长度和执行时间上均优于传统方法。我们的代码可以在GitHub上找到[1]。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 8th International Conference on Control and Robotics Engineering (ICCRE)

自引率

0.00%

发文量