基于残差深度强化学习的动态人类环境端到端移动机器人导航

2022 18th IEEE/ASME International Conference on Mechatronic and Embedded Systems and Applications (MESA) Pub Date : 2022-11-28 DOI:10.1109/MESA55290.2022.10004394

Abdullah Ahmed, Yasser F. O. Mohammad, V. Parque, Haitham El-Hussieny, S. Ahmed

{"title":"基于残差深度强化学习的动态人类环境端到端移动机器人导航","authors":"Abdullah Ahmed, Yasser F. O. Mohammad, V. Parque, Haitham El-Hussieny, S. Ahmed","doi":"10.1109/MESA55290.2022.10004394","DOIUrl":null,"url":null,"abstract":"Safe navigation through human crowds is key to enabling practical mobility ubiquitously. The Deep Reinforcement Learning (DRL) and the End-to-End (E2E) approaches to goal-oriented robot navigation have the potential to render policies able to tackle localization, path planning, obstacle avoidance, and adaptation to change in unison. In this paper, we report an architecture based on convolutional units and residual blocks being able to enhance adaptability to unseen and dynamic human environments. In particular, our scheme outperformed the state-of-the-art baselines SOADRL and NAVREP by about 13% and 18% on average success rate, respectively, throughout 27 unseen and dynamic navigation instances. Furthermore, our approach avoids the explicit encoding of positions and trajectories of moving humans compared to the standard models. Our results show the potential to render adaptive and generalizable policies for unknown and dynamic human environments.","PeriodicalId":410029,"journal":{"name":"2022 18th IEEE/ASME International Conference on Mechatronic and Embedded Systems and Applications (MESA)","volume":"114 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"End-to-End Mobile Robot Navigation using a Residual Deep Reinforcement Learning in Dynamic Human Environments\",\"authors\":\"Abdullah Ahmed, Yasser F. O. Mohammad, V. Parque, Haitham El-Hussieny, S. Ahmed\",\"doi\":\"10.1109/MESA55290.2022.10004394\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Safe navigation through human crowds is key to enabling practical mobility ubiquitously. The Deep Reinforcement Learning (DRL) and the End-to-End (E2E) approaches to goal-oriented robot navigation have the potential to render policies able to tackle localization, path planning, obstacle avoidance, and adaptation to change in unison. In this paper, we report an architecture based on convolutional units and residual blocks being able to enhance adaptability to unseen and dynamic human environments. In particular, our scheme outperformed the state-of-the-art baselines SOADRL and NAVREP by about 13% and 18% on average success rate, respectively, throughout 27 unseen and dynamic navigation instances. Furthermore, our approach avoids the explicit encoding of positions and trajectories of moving humans compared to the standard models. Our results show the potential to render adaptive and generalizable policies for unknown and dynamic human environments.\",\"PeriodicalId\":410029,\"journal\":{\"name\":\"2022 18th IEEE/ASME International Conference on Mechatronic and Embedded Systems and Applications (MESA)\",\"volume\":\"114 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 18th IEEE/ASME International Conference on Mechatronic and Embedded Systems and Applications (MESA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MESA55290.2022.10004394\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 18th IEEE/ASME International Conference on Mechatronic and Embedded Systems and Applications (MESA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MESA55290.2022.10004394","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

通过人群的安全导航是实现无处不在的实际机动性的关键。面向目标的机器人导航的深度强化学习(DRL)和端到端(E2E)方法有可能提供能够解决定位、路径规划、避障和适应一致变化的策略。在本文中，我们报告了一种基于卷积单元和残差块的架构，能够增强对不可见和动态人类环境的适应性。特别是，在27个不可见的和动态的导航实例中，我们的方案的平均成功率分别比最先进的基线SOADRL和NAVREP高出13%和18%。此外，与标准模型相比，我们的方法避免了对移动人类的位置和轨迹的显式编码。我们的研究结果显示了为未知和动态的人类环境提供适应性和可推广策略的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

End-to-End Mobile Robot Navigation using a Residual Deep Reinforcement Learning in Dynamic Human Environments

Safe navigation through human crowds is key to enabling practical mobility ubiquitously. The Deep Reinforcement Learning (DRL) and the End-to-End (E2E) approaches to goal-oriented robot navigation have the potential to render policies able to tackle localization, path planning, obstacle avoidance, and adaptation to change in unison. In this paper, we report an architecture based on convolutional units and residual blocks being able to enhance adaptability to unseen and dynamic human environments. In particular, our scheme outperformed the state-of-the-art baselines SOADRL and NAVREP by about 13% and 18% on average success rate, respectively, throughout 27 unseen and dynamic navigation instances. Furthermore, our approach avoids the explicit encoding of positions and trajectories of moving humans compared to the standard models. Our results show the potential to render adaptive and generalizable policies for unknown and dynamic human environments.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 18th IEEE/ASME International Conference on Mechatronic and Embedded Systems and Applications (MESA)

自引率

0.00%

发文量