Automated generation of training sets for object recognition in robotic applications

2014 23rd International Conference on Robotics in Alpe-Adria-Danube Region (RAAD) Pub Date : 2014-09-01 DOI:10.1109/RAAD.2014.7002247

Markus Schoeler, F. Wörgötter, Mohamad Javad Aein, T. Kulvicius

{"title":"Automated generation of training sets for object recognition in robotic applications","authors":"Markus Schoeler, F. Wörgötter, Mohamad Javad Aein, T. Kulvicius","doi":"10.1109/RAAD.2014.7002247","DOIUrl":null,"url":null,"abstract":"Object recognition plays an important role in robotics, since objects/tools first have to be identified in the scene before they can be manipulated/used. The performance of object recognition largely depends on the training dataset. Usually such training sets are gathered manually by a human operator, a tedious procedure, which ultimately limits the size of the dataset. One reason for manual selection of samples is that results returned by search engines often contain irrelevant images, mainly due to the problem of homographs (words spelled the same but with different meanings). In this paper we present an automated and unsupervised method, coined Trainingset Cleaning by Translation (TCT), for generation of training sets which are able to deal with the problem of homographs. For disambiguation, it uses the context provided by a command like “tighten the nut” together with a combination of public image searches, text searches and translation services. We compare our approach against plain Google image search qualitatively as well as in a classification task and demonstrate that our method indeed leads to a task-relevant training set, which results in an improvement of 24.1% in object recognition for 12 ambiguous classes. In addition, we present an application of our method to a real robot scenario.","PeriodicalId":205930,"journal":{"name":"2014 23rd International Conference on Robotics in Alpe-Adria-Danube Region (RAAD)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 23rd International Conference on Robotics in Alpe-Adria-Danube Region (RAAD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RAAD.2014.7002247","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Object recognition plays an important role in robotics, since objects/tools first have to be identified in the scene before they can be manipulated/used. The performance of object recognition largely depends on the training dataset. Usually such training sets are gathered manually by a human operator, a tedious procedure, which ultimately limits the size of the dataset. One reason for manual selection of samples is that results returned by search engines often contain irrelevant images, mainly due to the problem of homographs (words spelled the same but with different meanings). In this paper we present an automated and unsupervised method, coined Trainingset Cleaning by Translation (TCT), for generation of training sets which are able to deal with the problem of homographs. For disambiguation, it uses the context provided by a command like “tighten the nut” together with a combination of public image searches, text searches and translation services. We compare our approach against plain Google image search qualitatively as well as in a classification task and demonstrate that our method indeed leads to a task-relevant training set, which results in an improvement of 24.1% in object recognition for 12 ambiguous classes. In addition, we present an application of our method to a real robot scenario.

查看原文本刊更多论文

机器人应用中物体识别训练集的自动生成

物体识别在机器人技术中扮演着重要的角色，因为物体/工具首先必须在场景中被识别，然后才能被操纵/使用。目标识别的性能很大程度上取决于训练数据集。通常这样的训练集是由人工操作员手动收集的，这是一个繁琐的过程，最终限制了数据集的大小。手动选择样本的一个原因是，搜索引擎返回的结果通常包含不相关的图像，主要是由于同音异义词的问题(拼写相同但含义不同的单词)。在本文中，我们提出了一种自动化和无监督的方法，即翻译训练集清洗(TCT)，用于生成能够处理同音异义词问题的训练集。为了消除歧义，它使用了“拧紧螺母”这样的命令提供的上下文，以及公共图像搜索、文本搜索和翻译服务的组合。我们将我们的方法与纯谷歌图像搜索进行了定性比较，并在分类任务中进行了比较，并证明我们的方法确实导致了任务相关的训练集，这使得12个模糊类的对象识别提高了24.1%。此外，我们还介绍了我们的方法在真实机器人场景中的应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2014 23rd International Conference on Robotics in Alpe-Adria-Danube Region (RAAD)

自引率

0.00%

发文量