{"title":"基于多尺度卷积神经网络的遥感场景分类","authors":"H. Alhichri, N. Alajlan, Y. Bazi, T. Rabczuk","doi":"10.1109/EIT.2018.8500107","DOIUrl":null,"url":null,"abstract":"In recent years the problem of scene classification in remote sensing has attracted a considerable amount of attention. Solution for this important problem based on deep convolutional neural networks (CNN) are currently state-of-the-art. So far all CNNs used images of fixed size (typically $224\\times 224$ which commonly used in other fields of computer vision). In this paper, we propose a multi-scale deep CNN architecture that can accept variable image sizes. We achieve this by using multiple CNN, that share some or all parameters, followed by a merge layer, fully connected layers, and finally a softmax layer for classification. In each epoch we train the network with a batch of images of all scales. We have implemented this architecture using three SqueezeNet CNNs trained on three different scales of scene images. Then we carried out experiments on three well know datasets, namely UC Merced, KSA, and AID datasets. Preliminary results show that this multi-scale CNN do converge just as the traditional single-scale training, and leads to better testing accuracy.","PeriodicalId":188414,"journal":{"name":"2018 IEEE International Conference on Electro/Information Technology (EIT)","volume":"187 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":"{\"title\":\"Multi-Scale Convolutional Neural Network for Remote Sensing Scene Classification\",\"authors\":\"H. Alhichri, N. Alajlan, Y. Bazi, T. Rabczuk\",\"doi\":\"10.1109/EIT.2018.8500107\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years the problem of scene classification in remote sensing has attracted a considerable amount of attention. Solution for this important problem based on deep convolutional neural networks (CNN) are currently state-of-the-art. So far all CNNs used images of fixed size (typically $224\\\\times 224$ which commonly used in other fields of computer vision). In this paper, we propose a multi-scale deep CNN architecture that can accept variable image sizes. We achieve this by using multiple CNN, that share some or all parameters, followed by a merge layer, fully connected layers, and finally a softmax layer for classification. In each epoch we train the network with a batch of images of all scales. We have implemented this architecture using three SqueezeNet CNNs trained on three different scales of scene images. Then we carried out experiments on three well know datasets, namely UC Merced, KSA, and AID datasets. Preliminary results show that this multi-scale CNN do converge just as the traditional single-scale training, and leads to better testing accuracy.\",\"PeriodicalId\":188414,\"journal\":{\"name\":\"2018 IEEE International Conference on Electro/Information Technology (EIT)\",\"volume\":\"187 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-05-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"13\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE International Conference on Electro/Information Technology (EIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EIT.2018.8500107\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on Electro/Information Technology (EIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EIT.2018.8500107","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Multi-Scale Convolutional Neural Network for Remote Sensing Scene Classification
In recent years the problem of scene classification in remote sensing has attracted a considerable amount of attention. Solution for this important problem based on deep convolutional neural networks (CNN) are currently state-of-the-art. So far all CNNs used images of fixed size (typically $224\times 224$ which commonly used in other fields of computer vision). In this paper, we propose a multi-scale deep CNN architecture that can accept variable image sizes. We achieve this by using multiple CNN, that share some or all parameters, followed by a merge layer, fully connected layers, and finally a softmax layer for classification. In each epoch we train the network with a batch of images of all scales. We have implemented this architecture using three SqueezeNet CNNs trained on three different scales of scene images. Then we carried out experiments on three well know datasets, namely UC Merced, KSA, and AID datasets. Preliminary results show that this multi-scale CNN do converge just as the traditional single-scale training, and leads to better testing accuracy.