End-to-End Efficient Indoor Navigation with Optical Flow

Boran Wang, Minghao Gao
{"title":"End-to-End Efficient Indoor Navigation with Optical Flow","authors":"Boran Wang, Minghao Gao","doi":"10.1109/ICSAI57119.2022.10005455","DOIUrl":null,"url":null,"abstract":"There has been a recent interest in employing reinforcement learning for training end-to-end goal-driven robot navigation policies. However, implementing reinforcement learning in end-to-end navigation may result in inefficient policies that exhibit redundant turning actions when attempting to avoid obstacles. This work proposes a two branches network to learn efficient policies with less turning action when robots cross the obstacles. We first employ supervised learning to train a robot action classification network with optical flow. We then combine this classifier with an RGBD optical encoder to develop an action-decision network. Ultimately, we evaluate our approach in a visually realistic simulation environment. The results show that our method can reduce unnecessary steering actions and improve efficiency while ensuring navigation capabilities. We further show that our approach can reduce energy consumption during navigation and extend the robot's work time. Experiment results in the iGibson® simulator over hand-made paths reveal that our method can reduce 13.1% of the action number in the training set and 12.9% in the testing set compared with the baseline approaches. It also can reduce 8.3% energy consumption in the training set and 9.6% in the testing set and only has a 4.2% and 8.1% difference compared with the human path.","PeriodicalId":339547,"journal":{"name":"2022 8th International Conference on Systems and Informatics (ICSAI)","volume":"213 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 8th International Conference on Systems and Informatics (ICSAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSAI57119.2022.10005455","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

There has been a recent interest in employing reinforcement learning for training end-to-end goal-driven robot navigation policies. However, implementing reinforcement learning in end-to-end navigation may result in inefficient policies that exhibit redundant turning actions when attempting to avoid obstacles. This work proposes a two branches network to learn efficient policies with less turning action when robots cross the obstacles. We first employ supervised learning to train a robot action classification network with optical flow. We then combine this classifier with an RGBD optical encoder to develop an action-decision network. Ultimately, we evaluate our approach in a visually realistic simulation environment. The results show that our method can reduce unnecessary steering actions and improve efficiency while ensuring navigation capabilities. We further show that our approach can reduce energy consumption during navigation and extend the robot's work time. Experiment results in the iGibson® simulator over hand-made paths reveal that our method can reduce 13.1% of the action number in the training set and 12.9% in the testing set compared with the baseline approaches. It also can reduce 8.3% energy consumption in the training set and 9.6% in the testing set and only has a 4.2% and 8.1% difference compared with the human path.
基于光流的端到端高效室内导航
最近,人们对使用强化学习来训练端到端目标驱动的机器人导航策略很感兴趣。然而,在端到端导航中实施强化学习可能会导致低效的策略,在试图避开障碍物时表现出冗余的转向动作。本文提出了一种双分支网络,在机器人穿越障碍物时,以较少的转弯动作来学习有效的策略。我们首先利用监督学习训练了一个带有光流的机器人动作分类网络。然后,我们将该分类器与RGBD光学编码器结合起来开发一个动作决策网络。最后,我们在视觉逼真的模拟环境中评估我们的方法。结果表明,该方法在保证导航能力的同时,减少了不必要的转向动作,提高了效率。我们进一步证明,我们的方法可以减少导航过程中的能量消耗,延长机器人的工作时间。在iGibson®模拟器上手工路径的实验结果表明,与基线方法相比,我们的方法可以减少训练集中13.1%的动作数,减少测试集中12.9%的动作数。在训练集和测试集上分别可以减少8.3%和9.6%的能量消耗,与人类路径相比仅相差4.2%和8.1%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信