Mayar A. Shafaey, M. A. Salem, H. M. Ebeid, M. Al-Berry, M. Tolba
{"title":"Comparison of CNNs for Remote Sensing Scene Classification","authors":"Mayar A. Shafaey, M. A. Salem, H. M. Ebeid, M. Al-Berry, M. Tolba","doi":"10.1109/ICCES.2018.8639467","DOIUrl":null,"url":null,"abstract":"Nowadays, deep learning are used widely in many applications related to remote sensing i.e. earth observation, urban planning, earth’s scene classification, and so on. The deep learning manner, especially CNNs, has proved its accuracy for these practical applications. Hence, in this article, CNNs models are reviewed and its five different architectures are applied for comparisons; namely, AlexNet, VGGNet, GoogleNet, Inception-V3, and ResNet-101. These models are carried out on seven different remote-sensing image datasets for image scene classification purpose; namely, WHU-RS19, UC-Merced Land Use, SIRI-WHU, RSSCN7, AID, PatternNet, and NWPU-RESISC45. These datasets have different spatial resolutions, ranging from 0.2 to 30, to differentiate the classification accuracy of the low and high resolution images. As well, the classification accuracy of each model is assessed by trying five different classifiers; namely, Naïve Bayes, Decision Tree, Random Forest, K-Nearest Neighbor (KNN), and Support Vector Machine (SVM). The best accuracy credits to ResNet-101 model with SVM classifier; it has reached about 98.6±0.02 % of the high resolution dataset, PatternNet.","PeriodicalId":113848,"journal":{"name":"2018 13th International Conference on Computer Engineering and Systems (ICCES)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 13th International Conference on Computer Engineering and Systems (ICCES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCES.2018.8639467","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
Nowadays, deep learning are used widely in many applications related to remote sensing i.e. earth observation, urban planning, earth’s scene classification, and so on. The deep learning manner, especially CNNs, has proved its accuracy for these practical applications. Hence, in this article, CNNs models are reviewed and its five different architectures are applied for comparisons; namely, AlexNet, VGGNet, GoogleNet, Inception-V3, and ResNet-101. These models are carried out on seven different remote-sensing image datasets for image scene classification purpose; namely, WHU-RS19, UC-Merced Land Use, SIRI-WHU, RSSCN7, AID, PatternNet, and NWPU-RESISC45. These datasets have different spatial resolutions, ranging from 0.2 to 30, to differentiate the classification accuracy of the low and high resolution images. As well, the classification accuracy of each model is assessed by trying five different classifiers; namely, Naïve Bayes, Decision Tree, Random Forest, K-Nearest Neighbor (KNN), and Support Vector Machine (SVM). The best accuracy credits to ResNet-101 model with SVM classifier; it has reached about 98.6±0.02 % of the high resolution dataset, PatternNet.
目前,深度学习被广泛应用于与遥感相关的许多应用中,如对地观测、城市规划、地球场景分类等。深度学习方式,尤其是cnn,已经在这些实际应用中证明了它的准确性。因此,本文回顾了cnn模型,并应用其五种不同的架构进行比较;即AlexNet、VGGNet、GoogleNet、Inception-V3和ResNet-101。这些模型在7个不同的遥感图像数据集上进行图像场景分类;即WHU-RS19、UC-Merced Land Use、SIRI-WHU、RSSCN7、AID、PatternNet和NWPU-RESISC45。这些数据集具有不同的空间分辨率,从0.2到30不等,以区分低分辨率和高分辨率图像的分类精度。并且,通过尝试五种不同的分类器来评估每个模型的分类精度;即Naïve贝叶斯、决策树、随机森林、k近邻(KNN)和支持向量机(SVM)。基于SVM分类器的ResNet-101模型准确率最高;它达到了高分辨率数据集PatternNet的98.6%±0.02%。