{"title":"基于深度学习的视频流重新定位","authors":"Tingting Hu, Hanxu Sun","doi":"10.1109/ICSAI.2018.8599392","DOIUrl":null,"url":null,"abstract":"This paper presents a six degree of freedom regression system using convolution neural network(CNN) and long and short term memory network(LSTM) with video stream as network inputs. The system trains the network to regress the 6-DOF robot pose in a transfer learning and end-to-end manner with little training data. Relocalization only using CNN ignore the temporal correlation between image-sequences. In fact, the robot can easily collect continuous image-sequences. Therefore, in this paper, the robot can regress to the 6-DOF pose according to continuous images of different step sizes. Compared with relocalization with a single image, the experimental results show that the network model has the best effect of relocalization when the step size is set to 10 in the indoor scene, and the error of relocalization is minimal.","PeriodicalId":375852,"journal":{"name":"2018 5th International Conference on Systems and Informatics (ICSAI)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Video Stream Relocalization With Deep Learning\",\"authors\":\"Tingting Hu, Hanxu Sun\",\"doi\":\"10.1109/ICSAI.2018.8599392\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a six degree of freedom regression system using convolution neural network(CNN) and long and short term memory network(LSTM) with video stream as network inputs. The system trains the network to regress the 6-DOF robot pose in a transfer learning and end-to-end manner with little training data. Relocalization only using CNN ignore the temporal correlation between image-sequences. In fact, the robot can easily collect continuous image-sequences. Therefore, in this paper, the robot can regress to the 6-DOF pose according to continuous images of different step sizes. Compared with relocalization with a single image, the experimental results show that the network model has the best effect of relocalization when the step size is set to 10 in the indoor scene, and the error of relocalization is minimal.\",\"PeriodicalId\":375852,\"journal\":{\"name\":\"2018 5th International Conference on Systems and Informatics (ICSAI)\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 5th International Conference on Systems and Informatics (ICSAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSAI.2018.8599392\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 5th International Conference on Systems and Informatics (ICSAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSAI.2018.8599392","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
This paper presents a six degree of freedom regression system using convolution neural network(CNN) and long and short term memory network(LSTM) with video stream as network inputs. The system trains the network to regress the 6-DOF robot pose in a transfer learning and end-to-end manner with little training data. Relocalization only using CNN ignore the temporal correlation between image-sequences. In fact, the robot can easily collect continuous image-sequences. Therefore, in this paper, the robot can regress to the 6-DOF pose according to continuous images of different step sizes. Compared with relocalization with a single image, the experimental results show that the network model has the best effect of relocalization when the step size is set to 10 in the indoor scene, and the error of relocalization is minimal.