RS-DeepSuperLearner:融合CNN集成的遥感场景分类

IF 3.3 Q1 GEOGRAPHY

Annals of GIS Pub Date : 2023-01-02 DOI:10.1080/19475683.2023.2165544

H. Alhichri

{"title":"RS-DeepSuperLearner:融合CNN集成的遥感场景分类","authors":"H. Alhichri","doi":"10.1080/19475683.2023.2165544","DOIUrl":null,"url":null,"abstract":"ABSTRACT Scene classification is an important problem in remote sensing (RS) and has attracted a lot of research in the past decade. Nowadays, most proposed methods are based on deep convolutional neural network (CNN) models, and many pretrained CNN models have been investigated. Ensemble techniques are well studied in the machine learning community; however, few works have used them in RS scene classification. In this work, we propose an ensemble approach, called RS-DeepSuperLearner, that fuses the outputs of five advanced CNN models, namely, VGG16, Inception-V3, DenseNet121, InceptionResNet-V2, and EfficientNet-B3. First, we improve the architecture of the five CNN models by attaching an auxiliary branch at specific layer locations. In other words, the models now have two output layers producing predictions each and the final prediction is the average of the two. The RS-DeepSuperLearner method starts by fine-tuning the five CNN models using the training data. Then, it employs a deep neural network (DNN) SuperLearner to learn the best way for fusing the outputs of the five CNN models by training it on the predicted probability outputs and the cross-validation accuracies (per class) of the individual models. The proposed methodology was assessed on six publicly available RS datasets: UC Merced, KSA, RSSCN7, Optimal31, AID, and NWPU-RSC45. The experimental results demonstrate its superior capabilities when compared to state-of-the-art methods in the literature.","PeriodicalId":46270,"journal":{"name":"Annals of GIS","volume":"9 1","pages":"121 - 142"},"PeriodicalIF":3.3000,"publicationDate":"2023-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"RS-DeepSuperLearner: fusion of CNN ensemble for remote sensing scene classification\",\"authors\":\"H. Alhichri\",\"doi\":\"10.1080/19475683.2023.2165544\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"ABSTRACT Scene classification is an important problem in remote sensing (RS) and has attracted a lot of research in the past decade. Nowadays, most proposed methods are based on deep convolutional neural network (CNN) models, and many pretrained CNN models have been investigated. Ensemble techniques are well studied in the machine learning community; however, few works have used them in RS scene classification. In this work, we propose an ensemble approach, called RS-DeepSuperLearner, that fuses the outputs of five advanced CNN models, namely, VGG16, Inception-V3, DenseNet121, InceptionResNet-V2, and EfficientNet-B3. First, we improve the architecture of the five CNN models by attaching an auxiliary branch at specific layer locations. In other words, the models now have two output layers producing predictions each and the final prediction is the average of the two. The RS-DeepSuperLearner method starts by fine-tuning the five CNN models using the training data. Then, it employs a deep neural network (DNN) SuperLearner to learn the best way for fusing the outputs of the five CNN models by training it on the predicted probability outputs and the cross-validation accuracies (per class) of the individual models. The proposed methodology was assessed on six publicly available RS datasets: UC Merced, KSA, RSSCN7, Optimal31, AID, and NWPU-RSC45. The experimental results demonstrate its superior capabilities when compared to state-of-the-art methods in the literature.\",\"PeriodicalId\":46270,\"journal\":{\"name\":\"Annals of GIS\",\"volume\":\"9 1\",\"pages\":\"121 - 142\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2023-01-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Annals of GIS\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/19475683.2023.2165544\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GEOGRAPHY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of GIS","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/19475683.2023.2165544","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOGRAPHY","Score":null,"Total":0}

引用次数: 1

摘要

场景分类是遥感中的一个重要问题，在过去的十年中引起了大量的研究。目前，大多数提出的方法都是基于深度卷积神经网络(CNN)模型，并且已经研究了许多预训练的CNN模型。集成技术在机器学习社区得到了很好的研究;然而，很少有研究将其用于遥感场景分类。在这项工作中，我们提出了一种称为RS-DeepSuperLearner的集成方法，它融合了五个高级CNN模型的输出，即VGG16、Inception-V3、DenseNet121、inception - resnet - v2和EfficientNet-B3。首先，我们通过在特定层位置附加辅助分支来改进五个CNN模型的架构。换句话说，模型现在有两个输出层，每个输出层都产生预测，最终的预测是两个输出层的平均值。RS-DeepSuperLearner方法首先使用训练数据对五个CNN模型进行微调。然后，它使用深度神经网络(DNN)超级学习者，通过对预测的概率输出和单个模型的交叉验证精度(每类)进行训练，学习融合五个CNN模型输出的最佳方法。该方法在六个公开可用的RS数据集上进行了评估:UC Merced、KSA、RSSCN7、Optimal31、AID和NWPU-RSC45。实验结果表明，与文献中最先进的方法相比，它具有优越的能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

RS-DeepSuperLearner: fusion of CNN ensemble for remote sensing scene classification

ABSTRACT Scene classification is an important problem in remote sensing (RS) and has attracted a lot of research in the past decade. Nowadays, most proposed methods are based on deep convolutional neural network (CNN) models, and many pretrained CNN models have been investigated. Ensemble techniques are well studied in the machine learning community; however, few works have used them in RS scene classification. In this work, we propose an ensemble approach, called RS-DeepSuperLearner, that fuses the outputs of five advanced CNN models, namely, VGG16, Inception-V3, DenseNet121, InceptionResNet-V2, and EfficientNet-B3. First, we improve the architecture of the five CNN models by attaching an auxiliary branch at specific layer locations. In other words, the models now have two output layers producing predictions each and the final prediction is the average of the two. The RS-DeepSuperLearner method starts by fine-tuning the five CNN models using the training data. Then, it employs a deep neural network (DNN) SuperLearner to learn the best way for fusing the outputs of the five CNN models by training it on the predicted probability outputs and the cross-validation accuracies (per class) of the individual models. The proposed methodology was assessed on six publicly available RS datasets: UC Merced, KSA, RSSCN7, Optimal31, AID, and NWPU-RSC45. The experimental results demonstrate its superior capabilities when compared to state-of-the-art methods in the literature.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Annals of GIS Multiple-

CiteScore

8.30

自引率

2.00%

发文量