{"title":"基于卷积神经网络同时训练多数据集的摄像机再定位","authors":"Yixin Wang, Erwu Liu, Rui Wang","doi":"10.1145/3432291.3432296","DOIUrl":null,"url":null,"abstract":"With the advances of Convolutional Neural Networks (CNN) in computer vision, rapid progresses have been taken in camera re-localization. In this paper we propose a new network, a multi-dataset simultaneously training network (MdNet) to relocate camera pose from an RGB image. Moreover, we propose to construct new loss functions to learn camera pose, image segmentation and images depth maps from the multi-datasets. Compared with PoseNet Geometric Loss in street dataset, position and orientation accuracy are increased by 52% and 35% respectively. Experiment shows that our method outperforms other prior similar works in large-scale scenarios and is more robust under different season or time conditions.","PeriodicalId":126684,"journal":{"name":"Proceedings of the 2020 3rd International Conference on Signal Processing and Machine Learning","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Camera Re-localization by Training Multi-dataset Simultaneously via Convolutional Neural Network\",\"authors\":\"Yixin Wang, Erwu Liu, Rui Wang\",\"doi\":\"10.1145/3432291.3432296\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the advances of Convolutional Neural Networks (CNN) in computer vision, rapid progresses have been taken in camera re-localization. In this paper we propose a new network, a multi-dataset simultaneously training network (MdNet) to relocate camera pose from an RGB image. Moreover, we propose to construct new loss functions to learn camera pose, image segmentation and images depth maps from the multi-datasets. Compared with PoseNet Geometric Loss in street dataset, position and orientation accuracy are increased by 52% and 35% respectively. Experiment shows that our method outperforms other prior similar works in large-scale scenarios and is more robust under different season or time conditions.\",\"PeriodicalId\":126684,\"journal\":{\"name\":\"Proceedings of the 2020 3rd International Conference on Signal Processing and Machine Learning\",\"volume\":\"9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2020 3rd International Conference on Signal Processing and Machine Learning\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3432291.3432296\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2020 3rd International Conference on Signal Processing and Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3432291.3432296","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Camera Re-localization by Training Multi-dataset Simultaneously via Convolutional Neural Network
With the advances of Convolutional Neural Networks (CNN) in computer vision, rapid progresses have been taken in camera re-localization. In this paper we propose a new network, a multi-dataset simultaneously training network (MdNet) to relocate camera pose from an RGB image. Moreover, we propose to construct new loss functions to learn camera pose, image segmentation and images depth maps from the multi-datasets. Compared with PoseNet Geometric Loss in street dataset, position and orientation accuracy are increased by 52% and 35% respectively. Experiment shows that our method outperforms other prior similar works in large-scale scenarios and is more robust under different season or time conditions.