{"title":"基于目标检测网络的实时语义SLAM研究","authors":"Juan Fang, Zhenhu Fang","doi":"10.1145/3449301.3449306","DOIUrl":null,"url":null,"abstract":"Simultaneous localization and mapping (SLAM) is an important technology in the field of robotics. Semantic SLAM can provide a more accurate localization and satisfy the needs of complex applications, which has become a research hot spot. In this paper, we propose a real-time semantic SLAM system that uses semantic information in loop closure detection and mapping. In the system, we use object detection network to get semantic information including bounding box and category. In loop closure detection, we only use semantic information to construct feature structure, and implement feature comparison. Compared with the Bag of Visual Words (BoVW), the proposed approach does not need to generate vocabulary, holds very less amount of memory. Besides, we propose a fast semantic segmentation method combining bounding box and RGB-D image to create semantic OctoMap. Finally, we evaluate our semantic SLAM. Experimental results show that compared with BoVW, the proposed loop closure detection algorithm is about 20% higher in accuracy under the same recall rate. The semantic segmentation results reveal that the mean pixel accuracy of our method on NYUv2 dataset is 0.67.And our system takes 150ms to infer an image with the size of 640480 on MX150 GPU,i7-8550U CPU, which can be used in real-time visual SLAM.","PeriodicalId":429684,"journal":{"name":"Proceedings of the 6th International Conference on Robotics and Artificial Intelligence","volume":"68 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Research on Real-Time Semantic SLAM Based on Object Detection Network\",\"authors\":\"Juan Fang, Zhenhu Fang\",\"doi\":\"10.1145/3449301.3449306\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Simultaneous localization and mapping (SLAM) is an important technology in the field of robotics. Semantic SLAM can provide a more accurate localization and satisfy the needs of complex applications, which has become a research hot spot. In this paper, we propose a real-time semantic SLAM system that uses semantic information in loop closure detection and mapping. In the system, we use object detection network to get semantic information including bounding box and category. In loop closure detection, we only use semantic information to construct feature structure, and implement feature comparison. Compared with the Bag of Visual Words (BoVW), the proposed approach does not need to generate vocabulary, holds very less amount of memory. Besides, we propose a fast semantic segmentation method combining bounding box and RGB-D image to create semantic OctoMap. Finally, we evaluate our semantic SLAM. Experimental results show that compared with BoVW, the proposed loop closure detection algorithm is about 20% higher in accuracy under the same recall rate. The semantic segmentation results reveal that the mean pixel accuracy of our method on NYUv2 dataset is 0.67.And our system takes 150ms to infer an image with the size of 640480 on MX150 GPU,i7-8550U CPU, which can be used in real-time visual SLAM.\",\"PeriodicalId\":429684,\"journal\":{\"name\":\"Proceedings of the 6th International Conference on Robotics and Artificial Intelligence\",\"volume\":\"68 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 6th International Conference on Robotics and Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3449301.3449306\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 6th International Conference on Robotics and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3449301.3449306","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Research on Real-Time Semantic SLAM Based on Object Detection Network
Simultaneous localization and mapping (SLAM) is an important technology in the field of robotics. Semantic SLAM can provide a more accurate localization and satisfy the needs of complex applications, which has become a research hot spot. In this paper, we propose a real-time semantic SLAM system that uses semantic information in loop closure detection and mapping. In the system, we use object detection network to get semantic information including bounding box and category. In loop closure detection, we only use semantic information to construct feature structure, and implement feature comparison. Compared with the Bag of Visual Words (BoVW), the proposed approach does not need to generate vocabulary, holds very less amount of memory. Besides, we propose a fast semantic segmentation method combining bounding box and RGB-D image to create semantic OctoMap. Finally, we evaluate our semantic SLAM. Experimental results show that compared with BoVW, the proposed loop closure detection algorithm is about 20% higher in accuracy under the same recall rate. The semantic segmentation results reveal that the mean pixel accuracy of our method on NYUv2 dataset is 0.67.And our system takes 150ms to infer an image with the size of 640480 on MX150 GPU,i7-8550U CPU, which can be used in real-time visual SLAM.