基于多尺度卷积神经网络的遥感场景分类

2018 IEEE International Conference on Electro/Information Technology (EIT) Pub Date : 2018-05-03 DOI:10.1109/EIT.2018.8500107

H. Alhichri, N. Alajlan, Y. Bazi, T. Rabczuk

{"title":"基于多尺度卷积神经网络的遥感场景分类","authors":"H. Alhichri, N. Alajlan, Y. Bazi, T. Rabczuk","doi":"10.1109/EIT.2018.8500107","DOIUrl":null,"url":null,"abstract":"In recent years the problem of scene classification in remote sensing has attracted a considerable amount of attention. Solution for this important problem based on deep convolutional neural networks (CNN) are currently state-of-the-art. So far all CNNs used images of fixed size (typically $224\\times 224$ which commonly used in other fields of computer vision). In this paper, we propose a multi-scale deep CNN architecture that can accept variable image sizes. We achieve this by using multiple CNN, that share some or all parameters, followed by a merge layer, fully connected layers, and finally a softmax layer for classification. In each epoch we train the network with a batch of images of all scales. We have implemented this architecture using three SqueezeNet CNNs trained on three different scales of scene images. Then we carried out experiments on three well know datasets, namely UC Merced, KSA, and AID datasets. Preliminary results show that this multi-scale CNN do converge just as the traditional single-scale training, and leads to better testing accuracy.","PeriodicalId":188414,"journal":{"name":"2018 IEEE International Conference on Electro/Information Technology (EIT)","volume":"187 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":"{\"title\":\"Multi-Scale Convolutional Neural Network for Remote Sensing Scene Classification\",\"authors\":\"H. Alhichri, N. Alajlan, Y. Bazi, T. Rabczuk\",\"doi\":\"10.1109/EIT.2018.8500107\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years the problem of scene classification in remote sensing has attracted a considerable amount of attention. Solution for this important problem based on deep convolutional neural networks (CNN) are currently state-of-the-art. So far all CNNs used images of fixed size (typically $224\\\\times 224$ which commonly used in other fields of computer vision). In this paper, we propose a multi-scale deep CNN architecture that can accept variable image sizes. We achieve this by using multiple CNN, that share some or all parameters, followed by a merge layer, fully connected layers, and finally a softmax layer for classification. In each epoch we train the network with a batch of images of all scales. We have implemented this architecture using three SqueezeNet CNNs trained on three different scales of scene images. Then we carried out experiments on three well know datasets, namely UC Merced, KSA, and AID datasets. Preliminary results show that this multi-scale CNN do converge just as the traditional single-scale training, and leads to better testing accuracy.\",\"PeriodicalId\":188414,\"journal\":{\"name\":\"2018 IEEE International Conference on Electro/Information Technology (EIT)\",\"volume\":\"187 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-05-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"13\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE International Conference on Electro/Information Technology (EIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EIT.2018.8500107\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on Electro/Information Technology (EIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EIT.2018.8500107","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 13

摘要

近年来，遥感场景分类问题引起了广泛的关注。基于深度卷积神经网络(CNN)的解决方案是目前最先进的。到目前为止，所有cnn都使用固定大小的图像(通常是224 × 224美元，这在其他计算机视觉领域常用)。在本文中，我们提出了一种可以接受可变图像尺寸的多尺度深度CNN架构。我们通过使用多个共享部分或全部参数的CNN来实现这一点，然后是合并层，完全连接层，最后是用于分类的softmax层。在每个历元中，我们用一批所有尺度的图像来训练网络。我们使用三个在三种不同尺度的场景图像上训练的SqueezeNet cnn实现了这个架构。然后我们在UC Merced, KSA和AID三个众所周知的数据集上进行了实验。初步结果表明，这种多尺度CNN与传统的单尺度训练一样具有收敛性，并且具有更好的测试精度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Multi-Scale Convolutional Neural Network for Remote Sensing Scene Classification

In recent years the problem of scene classification in remote sensing has attracted a considerable amount of attention. Solution for this important problem based on deep convolutional neural networks (CNN) are currently state-of-the-art. So far all CNNs used images of fixed size (typically $224\times 224$ which commonly used in other fields of computer vision). In this paper, we propose a multi-scale deep CNN architecture that can accept variable image sizes. We achieve this by using multiple CNN, that share some or all parameters, followed by a merge layer, fully connected layers, and finally a softmax layer for classification. In each epoch we train the network with a batch of images of all scales. We have implemented this architecture using three SqueezeNet CNNs trained on three different scales of scene images. Then we carried out experiments on three well know datasets, namely UC Merced, KSA, and AID datasets. Preliminary results show that this multi-scale CNN do converge just as the traditional single-scale training, and leads to better testing accuracy.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2018 IEEE International Conference on Electro/Information Technology (EIT)

自引率

0.00%

发文量