基于深度学习的视频流重新定位

Tingting Hu, Hanxu Sun
{"title":"基于深度学习的视频流重新定位","authors":"Tingting Hu, Hanxu Sun","doi":"10.1109/ICSAI.2018.8599392","DOIUrl":null,"url":null,"abstract":"This paper presents a six degree of freedom regression system using convolution neural network(CNN) and long and short term memory network(LSTM) with video stream as network inputs. The system trains the network to regress the 6-DOF robot pose in a transfer learning and end-to-end manner with little training data. Relocalization only using CNN ignore the temporal correlation between image-sequences. In fact, the robot can easily collect continuous image-sequences. Therefore, in this paper, the robot can regress to the 6-DOF pose according to continuous images of different step sizes. Compared with relocalization with a single image, the experimental results show that the network model has the best effect of relocalization when the step size is set to 10 in the indoor scene, and the error of relocalization is minimal.","PeriodicalId":375852,"journal":{"name":"2018 5th International Conference on Systems and Informatics (ICSAI)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Video Stream Relocalization With Deep Learning\",\"authors\":\"Tingting Hu, Hanxu Sun\",\"doi\":\"10.1109/ICSAI.2018.8599392\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a six degree of freedom regression system using convolution neural network(CNN) and long and short term memory network(LSTM) with video stream as network inputs. The system trains the network to regress the 6-DOF robot pose in a transfer learning and end-to-end manner with little training data. Relocalization only using CNN ignore the temporal correlation between image-sequences. In fact, the robot can easily collect continuous image-sequences. Therefore, in this paper, the robot can regress to the 6-DOF pose according to continuous images of different step sizes. Compared with relocalization with a single image, the experimental results show that the network model has the best effect of relocalization when the step size is set to 10 in the indoor scene, and the error of relocalization is minimal.\",\"PeriodicalId\":375852,\"journal\":{\"name\":\"2018 5th International Conference on Systems and Informatics (ICSAI)\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 5th International Conference on Systems and Informatics (ICSAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSAI.2018.8599392\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 5th International Conference on Systems and Informatics (ICSAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSAI.2018.8599392","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

提出了一种以视频流为网络输入,采用卷积神经网络(CNN)和长短期记忆网络(LSTM)的六自由度回归系统。该系统在训练数据较少的情况下,以迁移学习和端到端方式训练网络回归六自由度机器人姿态。仅使用CNN进行重定位,忽略了图像序列之间的时间相关性。事实上,机器人可以很容易地收集连续图像序列。因此,在本文中,机器人可以根据不同步长的连续图像回归到六自由度位姿。实验结果表明,在室内场景中,当步长设置为10时,网络模型的重定位效果最好,且重定位误差最小。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Video Stream Relocalization With Deep Learning
This paper presents a six degree of freedom regression system using convolution neural network(CNN) and long and short term memory network(LSTM) with video stream as network inputs. The system trains the network to regress the 6-DOF robot pose in a transfer learning and end-to-end manner with little training data. Relocalization only using CNN ignore the temporal correlation between image-sequences. In fact, the robot can easily collect continuous image-sequences. Therefore, in this paper, the robot can regress to the 6-DOF pose according to continuous images of different step sizes. Compared with relocalization with a single image, the experimental results show that the network model has the best effect of relocalization when the step size is set to 10 in the indoor scene, and the error of relocalization is minimal.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信