{"title":"Real-World Object Recognition for Robot Based on Lightweight Deep Multimodal Distance Learning","authors":"Xu Zhang, Bin Xue, Feng Jing","doi":"10.1109/acirs49895.2020.9162600","DOIUrl":null,"url":null,"abstract":"Real-world object recognition is an important and difficult robot vision problem. In this paper, a real-world multi-angle and multi-attitude deformable object recognition method for robot system, named RCOR, is proposed based on lightweight deep multimodal distance learning (DMDL). (1) Deep multimodal convolutional neural network (DMCNN) is proposed to improve the transformation abilities of CNNs and enhance feature maps’ resolutions. (2) Deep distance metric learning (DDML) is presented to relieve the problem of lacking adequate labeled data and efficiently reduce redundancy. (3) To apply RCOR into embedded vision applications in real-world environment, a light weight DCNN, Mobile-XB, is proposed. Extensive experiments demonstrate that the proposed approach significantly outperforms state-of-the-arts. And it performs well on computationally limited platforms.","PeriodicalId":293428,"journal":{"name":"2020 5th Asia-Pacific Conference on Intelligent Robot Systems (ACIRS)","volume":"171 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 5th Asia-Pacific Conference on Intelligent Robot Systems (ACIRS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/acirs49895.2020.9162600","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Real-world object recognition is an important and difficult robot vision problem. In this paper, a real-world multi-angle and multi-attitude deformable object recognition method for robot system, named RCOR, is proposed based on lightweight deep multimodal distance learning (DMDL). (1) Deep multimodal convolutional neural network (DMCNN) is proposed to improve the transformation abilities of CNNs and enhance feature maps’ resolutions. (2) Deep distance metric learning (DDML) is presented to relieve the problem of lacking adequate labeled data and efficiently reduce redundancy. (3) To apply RCOR into embedded vision applications in real-world environment, a light weight DCNN, Mobile-XB, is proposed. Extensive experiments demonstrate that the proposed approach significantly outperforms state-of-the-arts. And it performs well on computationally limited platforms.