Compressed Holistic Convolutional Neural Network-based Descriptors for Scene Recognition

2019 4th International Conference on Robotics and Automation Engineering (ICRAE) Pub Date : 2019-11-01 DOI:10.1109/ICRAE48301.2019.9043837

Shuo Wang, Xudong Lv, D. Ye, Bing Li

{"title":"Compressed Holistic Convolutional Neural Network-based Descriptors for Scene Recognition","authors":"Shuo Wang, Xudong Lv, D. Ye, Bing Li","doi":"10.1109/ICRAE48301.2019.9043837","DOIUrl":null,"url":null,"abstract":"Deep convolutional neural networks (CNN) have recently been widely used in many computer vision and pattern recognition applications. With the help of high-level image description features provided by CNN, the deep architecture models perform significantly better than state-of-the-art solutions that use traditional hand-crafted features. In this paper, we concentrate on the scene recognition problem especially for changing environments, such as view angle changes, illumination variations, occlusion, different weather conditions and seasons. We propose a new scene recognition system using the deep residual convolutional neural network (ResNet) as the image feature extractor. The initial feature vectors are chosen from specific layers of the network and after a series of post-processes, we can obtain the final image descriptor vectors for scene similarity measurement. The performance of our proposed methods is evaluated on four popular open datasets by comparing it with the classic FabMap method and some other deep learning-based methods.","PeriodicalId":270665,"journal":{"name":"2019 4th International Conference on Robotics and Automation Engineering (ICRAE)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 4th International Conference on Robotics and Automation Engineering (ICRAE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRAE48301.2019.9043837","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

Deep convolutional neural networks (CNN) have recently been widely used in many computer vision and pattern recognition applications. With the help of high-level image description features provided by CNN, the deep architecture models perform significantly better than state-of-the-art solutions that use traditional hand-crafted features. In this paper, we concentrate on the scene recognition problem especially for changing environments, such as view angle changes, illumination variations, occlusion, different weather conditions and seasons. We propose a new scene recognition system using the deep residual convolutional neural network (ResNet) as the image feature extractor. The initial feature vectors are chosen from specific layers of the network and after a series of post-processes, we can obtain the final image descriptor vectors for scene similarity measurement. The performance of our proposed methods is evaluated on four popular open datasets by comparing it with the classic FabMap method and some other deep learning-based methods.

查看原文本刊更多论文

基于压缩整体卷积神经网络的场景识别描述符

近年来，深度卷积神经网络在计算机视觉和模式识别领域得到了广泛的应用。在CNN提供的高级图像描述特征的帮助下，深度架构模型的表现明显优于使用传统手工制作特征的最先进解决方案。在本文中，我们重点研究了变化环境下的场景识别问题，如视角变化、光照变化、遮挡、不同天气条件和季节。提出了一种基于深度残差卷积神经网络(ResNet)作为图像特征提取器的场景识别系统。从网络的特定层中选择初始特征向量，经过一系列后处理，我们可以得到最终的图像描述子向量，用于场景相似度测量。通过与经典的FabMap方法和其他一些基于深度学习的方法进行比较，在四个流行的开放数据集上评估了我们提出的方法的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 4th International Conference on Robotics and Automation Engineering (ICRAE)

自引率

0.00%

发文量