基于轻量级深度多模态远程学习的机器人真实世界物体识别

2020 5th Asia-Pacific Conference on Intelligent Robot Systems (ACIRS) Pub Date : 2020-07-01 DOI:10.1109/acirs49895.2020.9162600

Xu Zhang, Bin Xue, Feng Jing

{"title":"基于轻量级深度多模态远程学习的机器人真实世界物体识别","authors":"Xu Zhang, Bin Xue, Feng Jing","doi":"10.1109/acirs49895.2020.9162600","DOIUrl":null,"url":null,"abstract":"Real-world object recognition is an important and difficult robot vision problem. In this paper, a real-world multi-angle and multi-attitude deformable object recognition method for robot system, named RCOR, is proposed based on lightweight deep multimodal distance learning (DMDL). (1) Deep multimodal convolutional neural network (DMCNN) is proposed to improve the transformation abilities of CNNs and enhance feature maps’ resolutions. (2) Deep distance metric learning (DDML) is presented to relieve the problem of lacking adequate labeled data and efficiently reduce redundancy. (3) To apply RCOR into embedded vision applications in real-world environment, a light weight DCNN, Mobile-XB, is proposed. Extensive experiments demonstrate that the proposed approach significantly outperforms state-of-the-arts. And it performs well on computationally limited platforms.","PeriodicalId":293428,"journal":{"name":"2020 5th Asia-Pacific Conference on Intelligent Robot Systems (ACIRS)","volume":"171 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Real-World Object Recognition for Robot Based on Lightweight Deep Multimodal Distance Learning\",\"authors\":\"Xu Zhang, Bin Xue, Feng Jing\",\"doi\":\"10.1109/acirs49895.2020.9162600\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Real-world object recognition is an important and difficult robot vision problem. In this paper, a real-world multi-angle and multi-attitude deformable object recognition method for robot system, named RCOR, is proposed based on lightweight deep multimodal distance learning (DMDL). (1) Deep multimodal convolutional neural network (DMCNN) is proposed to improve the transformation abilities of CNNs and enhance feature maps’ resolutions. (2) Deep distance metric learning (DDML) is presented to relieve the problem of lacking adequate labeled data and efficiently reduce redundancy. (3) To apply RCOR into embedded vision applications in real-world environment, a light weight DCNN, Mobile-XB, is proposed. Extensive experiments demonstrate that the proposed approach significantly outperforms state-of-the-arts. And it performs well on computationally limited platforms.\",\"PeriodicalId\":293428,\"journal\":{\"name\":\"2020 5th Asia-Pacific Conference on Intelligent Robot Systems (ACIRS)\",\"volume\":\"171 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 5th Asia-Pacific Conference on Intelligent Robot Systems (ACIRS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/acirs49895.2020.9162600\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 5th Asia-Pacific Conference on Intelligent Robot Systems (ACIRS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/acirs49895.2020.9162600","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

真实世界物体识别是机器人视觉领域的一个重要而又困难的问题。本文提出了一种基于轻量级深度多模态远程学习(DMDL)的机器人系统多角度多姿态可变形物体识别方法RCOR。(1)提出深度多模态卷积神经网络(Deep multimodal convolutional neural network, DMCNN)，提高cnn的变换能力，增强特征图的分辨率。(2)提出深度距离度量学习(Deep distance metric learning, DDML)，解决了标记数据不足的问题，有效地减少了冗余。(3)为了将RCOR应用于现实环境中的嵌入式视觉应用，提出了一种轻量级的DCNN Mobile-XB。大量的实验表明，所提出的方法明显优于最先进的方法。而且它在计算能力有限的平台上表现良好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Real-World Object Recognition for Robot Based on Lightweight Deep Multimodal Distance Learning

Real-world object recognition is an important and difficult robot vision problem. In this paper, a real-world multi-angle and multi-attitude deformable object recognition method for robot system, named RCOR, is proposed based on lightweight deep multimodal distance learning (DMDL). (1) Deep multimodal convolutional neural network (DMCNN) is proposed to improve the transformation abilities of CNNs and enhance feature maps’ resolutions. (2) Deep distance metric learning (DDML) is presented to relieve the problem of lacking adequate labeled data and efficiently reduce redundancy. (3) To apply RCOR into embedded vision applications in real-world environment, a light weight DCNN, Mobile-XB, is proposed. Extensive experiments demonstrate that the proposed approach significantly outperforms state-of-the-arts. And it performs well on computationally limited platforms.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 5th Asia-Pacific Conference on Intelligent Robot Systems (ACIRS)

自引率

0.00%

发文量