通过使用3D和RGB相机数据为社交机器人添加对象操作功能

2021 IEEE Sensors Pub Date : 2021-10-31 DOI:10.1109/SENSORS47087.2021.9639608

G. Mezzina, D. Venuto

{"title":"通过使用3D和RGB相机数据为社交机器人添加对象操作功能","authors":"G. Mezzina, D. Venuto","doi":"10.1109/SENSORS47087.2021.9639608","DOIUrl":null,"url":null,"abstract":"This paper outlines the design and implementation of novel object manipulation for a social robot, here Pepper by SoftBank Robotics. It is primarily designed for verbal interaction and has therefore not been equipped with object manipulation capabilities. The proposed routine exploits the built-in RGB and 3D cameras. First, semantic segmentation based on the Mini-YOLOv3 neural network is run on the RGB image. Next, 3D sensor data are used to position the hand over the object, implementing a novel routine to grab the object and to scan it for recognition purposes. To preserve patient and location sensitive data, the here-proposed architecture operates automatically and offline, running on the robot’s operating system. Experimental results on 370 grabbing processes showed how the manipulation routine achieves a grabbing success rate of up to 96%. They also proved that the success rate remains unaltered if the target object is positioned in a rectangular area of ± 6 cm × ± 3 cm centered in the nominal position provided by an initial positioning grid. The grabbing success rate remains above 80% even if the object to be grabbed is stored with an angle that ranges between 10° and 45° within the above-reported area.","PeriodicalId":6775,"journal":{"name":"2021 IEEE Sensors","volume":"707 ","pages":"1-4"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Adding Object Manipulation Capabilities to Social Robots by using 3D and RGB Cameras Data\",\"authors\":\"G. Mezzina, D. Venuto\",\"doi\":\"10.1109/SENSORS47087.2021.9639608\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper outlines the design and implementation of novel object manipulation for a social robot, here Pepper by SoftBank Robotics. It is primarily designed for verbal interaction and has therefore not been equipped with object manipulation capabilities. The proposed routine exploits the built-in RGB and 3D cameras. First, semantic segmentation based on the Mini-YOLOv3 neural network is run on the RGB image. Next, 3D sensor data are used to position the hand over the object, implementing a novel routine to grab the object and to scan it for recognition purposes. To preserve patient and location sensitive data, the here-proposed architecture operates automatically and offline, running on the robot’s operating system. Experimental results on 370 grabbing processes showed how the manipulation routine achieves a grabbing success rate of up to 96%. They also proved that the success rate remains unaltered if the target object is positioned in a rectangular area of ± 6 cm × ± 3 cm centered in the nominal position provided by an initial positioning grid. The grabbing success rate remains above 80% even if the object to be grabbed is stored with an angle that ranges between 10° and 45° within the above-reported area.\",\"PeriodicalId\":6775,\"journal\":{\"name\":\"2021 IEEE Sensors\",\"volume\":\"707 \",\"pages\":\"1-4\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE Sensors\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SENSORS47087.2021.9639608\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Sensors","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SENSORS47087.2021.9639608","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文概述了社交机器人的新型对象操作的设计和实现，这里是软银机器人公司的Pepper。它主要是为语言交互而设计的，因此没有配备对象操作功能。所提出的例程利用内置的RGB和3D相机。首先，对RGB图像进行基于Mini-YOLOv3神经网络的语义分割。接下来，3D传感器数据用于定位手在物体上的位置，实现一个新的例程来抓取物体并扫描它以进行识别。为了保存病人和位置敏感数据，这里提出的架构可以在机器人的操作系统上自动离线运行。370个抓取过程的实验结果表明，该操作程序的抓取成功率高达96%。他们还证明，如果目标物体被定位在以初始定位网格提供的标称位置为中心的±6厘米×±3厘米的矩形区域内，成功率保持不变。即使被抓取物体以10°到45°的角度存储在上述报告的区域内，抓取成功率仍保持在80%以上。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Adding Object Manipulation Capabilities to Social Robots by using 3D and RGB Cameras Data

This paper outlines the design and implementation of novel object manipulation for a social robot, here Pepper by SoftBank Robotics. It is primarily designed for verbal interaction and has therefore not been equipped with object manipulation capabilities. The proposed routine exploits the built-in RGB and 3D cameras. First, semantic segmentation based on the Mini-YOLOv3 neural network is run on the RGB image. Next, 3D sensor data are used to position the hand over the object, implementing a novel routine to grab the object and to scan it for recognition purposes. To preserve patient and location sensitive data, the here-proposed architecture operates automatically and offline, running on the robot’s operating system. Experimental results on 370 grabbing processes showed how the manipulation routine achieves a grabbing success rate of up to 96%. They also proved that the success rate remains unaltered if the target object is positioned in a rectangular area of ± 6 cm × ± 3 cm centered in the nominal position provided by an initial positioning grid. The grabbing success rate remains above 80% even if the object to be grabbed is stored with an angle that ranges between 10° and 45° within the above-reported area.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 IEEE Sensors

自引率

0.00%

发文量