Active Vision Dataset Benchmark

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) Pub Date : 2018-06-01 DOI:10.1109/CVPRW.2018.00277

Phil Ammirato, A. Berg, J. Kosecka

引用次数: 6

Abstract

Several recent efforts in computer vision indicate a trend toward studying and understanding problems in larger scale environments, beyond single images, and focus on connections to tasks in navigation, mobile manipulation, and visual question answering. A common goal of these tasks is the capability of moving in the environment, acquiring novel views during perception and while performing a task. This capability comes easily in synthetic environments, however achieving the same effect with real images is much more laborious. We propose using the existing Active Vision Dataset to form a benchmark for such problems in a real-world settings with real images. The dataset is well suited for evaluating tasks of multiview active recognition, target driven navigation, and target search, and also can be effective for studying the transfer of strategies learned in simulation to real settings.

查看原文本刊更多论文

主动视觉数据集基准

最近在计算机视觉方面的一些努力表明了一种趋势，即在更大规模的环境中研究和理解问题，而不仅仅是单个图像，并关注与导航、移动操作和视觉问答任务的联系。这些任务的共同目标是在环境中移动的能力，在感知和执行任务时获得新的视图。这种功能在合成环境中很容易实现，但是在真实图像中实现相同的效果要费力得多。我们建议使用现有的主动视觉数据集在真实图像的现实环境中形成此类问题的基准。该数据集非常适合评估多视图主动识别、目标驱动导航和目标搜索任务，也可以有效地研究在模拟中学习到的策略到实际环境的迁移。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

自引率

0.00%

发文量