Spatio-Semantic ConvNet-Based Visual Place Recognition

2019 European Conference on Mobile Robots (ECMR) Pub Date : 2019-09-01 DOI:10.1109/ECMR.2019.8870948

Luis G. Camara, L. Preucil

引用次数: 18

Abstract

We present a Visual Place Recognition system that follows the two-stage format common to image retrieval pipelines. The system encodes images of places by employing the activations of different layers of a pre-trained, off-the-shelf, VGG16 Convolutional Neural Network (CNN) architecture. In the first stage of our method and given a query image of a place, a number of top candidate images is retrieved from a previously stored database of places. In the second stage, we propose an exhaustive comparison of the query image against these candidates by encoding semantic and spatial information in the form of CNN features. Results from our approach outperform by a large margin state-of-the-art visual place recognition methods on five of the most commonly used benchmark datasets. The performance gain is especially remarkable on the most challenging datasets, with more than a twofold recognition improvement with respect to the latest published work.

查看原文本刊更多论文

基于空间语义卷积网络的视觉位置识别

我们提出了一个视觉位置识别系统，它遵循图像检索管道中常见的两阶段格式。该系统通过使用预训练的、现成的VGG16卷积神经网络(CNN)架构的不同层的激活来编码地点图像。在我们的方法的第一阶段，给定一个地点的查询图像，从先前存储的地点数据库中检索许多顶级候选图像。在第二阶段，我们提出通过以CNN特征的形式编码语义和空间信息，将查询图像与这些候选图像进行详尽的比较。在五个最常用的基准数据集上，我们的方法的结果在很大程度上优于最先进的视觉位置识别方法。在最具挑战性的数据集上，性能提升尤为显著，相对于最新发表的工作，识别能力提高了两倍以上。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 European Conference on Mobile Robots (ECMR)

自引率

0.00%

发文量