{"title":"Compressed Holistic Convolutional Neural Network-based Descriptors for Scene Recognition","authors":"Shuo Wang, Xudong Lv, D. Ye, Bing Li","doi":"10.1109/ICRAE48301.2019.9043837","DOIUrl":null,"url":null,"abstract":"Deep convolutional neural networks (CNN) have recently been widely used in many computer vision and pattern recognition applications. With the help of high-level image description features provided by CNN, the deep architecture models perform significantly better than state-of-the-art solutions that use traditional hand-crafted features. In this paper, we concentrate on the scene recognition problem especially for changing environments, such as view angle changes, illumination variations, occlusion, different weather conditions and seasons. We propose a new scene recognition system using the deep residual convolutional neural network (ResNet) as the image feature extractor. The initial feature vectors are chosen from specific layers of the network and after a series of post-processes, we can obtain the final image descriptor vectors for scene similarity measurement. The performance of our proposed methods is evaluated on four popular open datasets by comparing it with the classic FabMap method and some other deep learning-based methods.","PeriodicalId":270665,"journal":{"name":"2019 4th International Conference on Robotics and Automation Engineering (ICRAE)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 4th International Conference on Robotics and Automation Engineering (ICRAE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRAE48301.2019.9043837","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Deep convolutional neural networks (CNN) have recently been widely used in many computer vision and pattern recognition applications. With the help of high-level image description features provided by CNN, the deep architecture models perform significantly better than state-of-the-art solutions that use traditional hand-crafted features. In this paper, we concentrate on the scene recognition problem especially for changing environments, such as view angle changes, illumination variations, occlusion, different weather conditions and seasons. We propose a new scene recognition system using the deep residual convolutional neural network (ResNet) as the image feature extractor. The initial feature vectors are chosen from specific layers of the network and after a series of post-processes, we can obtain the final image descriptor vectors for scene similarity measurement. The performance of our proposed methods is evaluated on four popular open datasets by comparing it with the classic FabMap method and some other deep learning-based methods.