{"title":"基于轻量级深度多模态远程学习的机器人真实世界物体识别","authors":"Xu Zhang, Bin Xue, Feng Jing","doi":"10.1109/acirs49895.2020.9162600","DOIUrl":null,"url":null,"abstract":"Real-world object recognition is an important and difficult robot vision problem. In this paper, a real-world multi-angle and multi-attitude deformable object recognition method for robot system, named RCOR, is proposed based on lightweight deep multimodal distance learning (DMDL). (1) Deep multimodal convolutional neural network (DMCNN) is proposed to improve the transformation abilities of CNNs and enhance feature maps’ resolutions. (2) Deep distance metric learning (DDML) is presented to relieve the problem of lacking adequate labeled data and efficiently reduce redundancy. (3) To apply RCOR into embedded vision applications in real-world environment, a light weight DCNN, Mobile-XB, is proposed. Extensive experiments demonstrate that the proposed approach significantly outperforms state-of-the-arts. And it performs well on computationally limited platforms.","PeriodicalId":293428,"journal":{"name":"2020 5th Asia-Pacific Conference on Intelligent Robot Systems (ACIRS)","volume":"171 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Real-World Object Recognition for Robot Based on Lightweight Deep Multimodal Distance Learning\",\"authors\":\"Xu Zhang, Bin Xue, Feng Jing\",\"doi\":\"10.1109/acirs49895.2020.9162600\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Real-world object recognition is an important and difficult robot vision problem. In this paper, a real-world multi-angle and multi-attitude deformable object recognition method for robot system, named RCOR, is proposed based on lightweight deep multimodal distance learning (DMDL). (1) Deep multimodal convolutional neural network (DMCNN) is proposed to improve the transformation abilities of CNNs and enhance feature maps’ resolutions. (2) Deep distance metric learning (DDML) is presented to relieve the problem of lacking adequate labeled data and efficiently reduce redundancy. (3) To apply RCOR into embedded vision applications in real-world environment, a light weight DCNN, Mobile-XB, is proposed. Extensive experiments demonstrate that the proposed approach significantly outperforms state-of-the-arts. And it performs well on computationally limited platforms.\",\"PeriodicalId\":293428,\"journal\":{\"name\":\"2020 5th Asia-Pacific Conference on Intelligent Robot Systems (ACIRS)\",\"volume\":\"171 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 5th Asia-Pacific Conference on Intelligent Robot Systems (ACIRS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/acirs49895.2020.9162600\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 5th Asia-Pacific Conference on Intelligent Robot Systems (ACIRS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/acirs49895.2020.9162600","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Real-World Object Recognition for Robot Based on Lightweight Deep Multimodal Distance Learning
Real-world object recognition is an important and difficult robot vision problem. In this paper, a real-world multi-angle and multi-attitude deformable object recognition method for robot system, named RCOR, is proposed based on lightweight deep multimodal distance learning (DMDL). (1) Deep multimodal convolutional neural network (DMCNN) is proposed to improve the transformation abilities of CNNs and enhance feature maps’ resolutions. (2) Deep distance metric learning (DDML) is presented to relieve the problem of lacking adequate labeled data and efficiently reduce redundancy. (3) To apply RCOR into embedded vision applications in real-world environment, a light weight DCNN, Mobile-XB, is proposed. Extensive experiments demonstrate that the proposed approach significantly outperforms state-of-the-arts. And it performs well on computationally limited platforms.