Visual detection of objects by mobile agents using CBVIR techniques of low complexity

2015 Federated Conference on Computer Science and Information Systems (FedCSIS) Pub Date : 2015-10-11 DOI:10.15439/2015F85

A. Sluzek

{"title":"Visual detection of objects by mobile agents using CBVIR techniques of low complexity","authors":"A. Sluzek","doi":"10.15439/2015F85","DOIUrl":null,"url":null,"abstract":"Visual search for objects of interest in complex environment is an important (and still challenging) problem in mobile robotics. In particular, the usage of content-based visual information retrieval (CBVIR) methods, which are a natural choice for such tasks, is often constrained by the real-time requirements, and the mobility of searching agents is sometimes not sufficiently exploited in the search model. In this paper, a CBVIR-based scheme is proposed, which takes into account motion of the searching agents to achieve a low-cost and high-speed detection of objects of interest in cluttered scenes, with good overall performances. We combine standard CBVIR tools, i.e. MSER detector and SIFT descriptor (quantized into sufficiently large vocabularies) assuming additionally that objects become objects of interest only when approached closely enough by the mobile agent, i.e. when seen at an adequately large scale. Thus, an object of interest is considered detected only if a sufficient number of keypoints from the current video-frame are matched (including the corresponding matches of scales) to the keypoints from the database images of the object. Preliminary experiments on a limited-size dataset confirm performances of the scheme, although in the classical task of video-frame retrieval the scheme cannot compete with more sophisticated CBVIR algorithms. The scheme can prospectively become more flexible if combined with a range-finding device so that the approximate distances to the scene components within the currently inspected part of the image can be used to proportionally modify the scale correspondences.","PeriodicalId":276884,"journal":{"name":"2015 Federated Conference on Computer Science and Information Systems (FedCSIS)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Federated Conference on Computer Science and Information Systems (FedCSIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15439/2015F85","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Visual search for objects of interest in complex environment is an important (and still challenging) problem in mobile robotics. In particular, the usage of content-based visual information retrieval (CBVIR) methods, which are a natural choice for such tasks, is often constrained by the real-time requirements, and the mobility of searching agents is sometimes not sufficiently exploited in the search model. In this paper, a CBVIR-based scheme is proposed, which takes into account motion of the searching agents to achieve a low-cost and high-speed detection of objects of interest in cluttered scenes, with good overall performances. We combine standard CBVIR tools, i.e. MSER detector and SIFT descriptor (quantized into sufficiently large vocabularies) assuming additionally that objects become objects of interest only when approached closely enough by the mobile agent, i.e. when seen at an adequately large scale. Thus, an object of interest is considered detected only if a sufficient number of keypoints from the current video-frame are matched (including the corresponding matches of scales) to the keypoints from the database images of the object. Preliminary experiments on a limited-size dataset confirm performances of the scheme, although in the classical task of video-frame retrieval the scheme cannot compete with more sophisticated CBVIR algorithms. The scheme can prospectively become more flexible if combined with a range-finding device so that the approximate distances to the scene components within the currently inspected part of the image can be used to proportionally modify the scale correspondences.

查看原文本刊更多论文

使用低复杂度CBVIR技术的移动代理对目标进行视觉检测

复杂环境中感兴趣对象的视觉搜索是移动机器人研究中的一个重要问题。特别是，基于内容的视觉信息检索(CBVIR)方法的使用往往受到实时需求的限制，并且搜索代理的移动性有时在搜索模型中没有得到充分利用。本文提出了一种基于cbvir的方案，该方案考虑了搜索agent的运动特性，实现了低成本、高速的杂乱场景中感兴趣目标的检测，并具有良好的综合性能。我们结合了标准的CBVIR工具，即MSER检测器和SIFT描述符(量化为足够大的词汇表)，另外假设对象只有在移动代理足够接近时才会成为感兴趣的对象，即在足够大的范围内看到。因此，只有当当前视频帧中有足够数量的关键点与该对象的数据库图像中的关键点相匹配(包括相应的尺度匹配)时，才认为检测到感兴趣的对象。在有限大小的数据集上进行的初步实验证实了该方案的性能，尽管在视频帧检索的经典任务中，该方案无法与更复杂的CBVIR算法竞争。如果与测距设备相结合，该方案可以变得更加灵活，以便在当前被检查的图像部分内与场景组件的近似距离可以用于按比例修改比例对应。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 Federated Conference on Computer Science and Information Systems (FedCSIS)

自引率

0.00%

发文量