Chiagoziem C. Ukwuoma , Dongsheng Cai , Oluwatoyosi Bamisile , Chibueze D. Ukwuoma , Chinedu I. Otuka , Nnadozie O. Anyanwu , Chidera O. Ukwuoma , Qi Huang
{"title":"Optimised multi-hierarchical feature fusion with multi-kernel CNN and spectral-spatial convolutions for remote sensing image classification","authors":"Chiagoziem C. Ukwuoma , Dongsheng Cai , Oluwatoyosi Bamisile , Chibueze D. Ukwuoma , Chinedu I. Otuka , Nnadozie O. Anyanwu , Chidera O. Ukwuoma , Qi Huang","doi":"10.1016/j.rsase.2025.101727","DOIUrl":null,"url":null,"abstract":"<div><div>Remote sensing image classification is a central process in the interpretation of Earth observation data for applications such as land use mapping, environmental monitoring, and urban planning. Despite significant progress brought about by deep learning, particularly Convolutional Neural Networks (CNNs), existing models are prone to failing to represent complex spatial patterns, multi-scale object variation, and rich spectral dependencies characteristic of high-resolution remote sensing images. In addition, most models are suffering from greater computational complexity and poor generalisation on diverse datasets. To address these issues, we propose an Optimised Multi-Hierarchical Feature Fusion Framework, a deep learning model that integrates multi-kernel convolution, spectral-spatial depthwise convolution, and residual learning into a ResNet-50 backbone. The model learns a rich set of spatial textures and spectral features at different hierarchical layers, leading to enhanced feature representation and classification robustness. We evaluated the proposed method on six remote sensing datasets: AID, EUROSAT, NWPU-RESISC45, RSSCN7, UC Merced, and WHU-RS19 with standard performance metrics like accuracy, precision, recall, specificity, and F1-score. The proposed method achieved an average accuracy of 99.00 % for AID, 99.51 % for EUROSAT, 99.56 % for RESISC45, 95.71 % for RSSCN7, 99.14 % for UC Merced, and 95.79 % for WHU-RS19. Moreover, qualitative visualisation techniques such as Class Activation Mapping (CAM) and LIME were employed to provide explanations of model decisions by identifying the most influential image regions to the predictions. These visual explanations confirmed that the model is based on semantically meaningful regions, further improving its interpretability and trustworthiness. The superior accuracy and robust performance on multiple datasets verify the efficiency and generalisation ability of the designed model, indicating it is a strong candidate for practical remote sensing classification tasks. <span><span>https://github.com/chiagoziemchima/Multi-Hierarchical-Feature-Fusion-/tree/main</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":53227,"journal":{"name":"Remote Sensing Applications-Society and Environment","volume":"40 ","pages":"Article 101727"},"PeriodicalIF":4.5000,"publicationDate":"2025-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Remote Sensing Applications-Society and Environment","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352938525002800","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Remote sensing image classification is a central process in the interpretation of Earth observation data for applications such as land use mapping, environmental monitoring, and urban planning. Despite significant progress brought about by deep learning, particularly Convolutional Neural Networks (CNNs), existing models are prone to failing to represent complex spatial patterns, multi-scale object variation, and rich spectral dependencies characteristic of high-resolution remote sensing images. In addition, most models are suffering from greater computational complexity and poor generalisation on diverse datasets. To address these issues, we propose an Optimised Multi-Hierarchical Feature Fusion Framework, a deep learning model that integrates multi-kernel convolution, spectral-spatial depthwise convolution, and residual learning into a ResNet-50 backbone. The model learns a rich set of spatial textures and spectral features at different hierarchical layers, leading to enhanced feature representation and classification robustness. We evaluated the proposed method on six remote sensing datasets: AID, EUROSAT, NWPU-RESISC45, RSSCN7, UC Merced, and WHU-RS19 with standard performance metrics like accuracy, precision, recall, specificity, and F1-score. The proposed method achieved an average accuracy of 99.00 % for AID, 99.51 % for EUROSAT, 99.56 % for RESISC45, 95.71 % for RSSCN7, 99.14 % for UC Merced, and 95.79 % for WHU-RS19. Moreover, qualitative visualisation techniques such as Class Activation Mapping (CAM) and LIME were employed to provide explanations of model decisions by identifying the most influential image regions to the predictions. These visual explanations confirmed that the model is based on semantically meaningful regions, further improving its interpretability and trustworthiness. The superior accuracy and robust performance on multiple datasets verify the efficiency and generalisation ability of the designed model, indicating it is a strong candidate for practical remote sensing classification tasks. https://github.com/chiagoziemchima/Multi-Hierarchical-Feature-Fusion-/tree/main.
期刊介绍:
The journal ''Remote Sensing Applications: Society and Environment'' (RSASE) focuses on remote sensing studies that address specific topics with an emphasis on environmental and societal issues - regional / local studies with global significance. Subjects are encouraged to have an interdisciplinary approach and include, but are not limited by: " -Global and climate change studies addressing the impact of increasing concentrations of greenhouse gases, CO2 emission, carbon balance and carbon mitigation, energy system on social and environmental systems -Ecological and environmental issues including biodiversity, ecosystem dynamics, land degradation, atmospheric and water pollution, urban footprint, ecosystem management and natural hazards (e.g. earthquakes, typhoons, floods, landslides) -Natural resource studies including land-use in general, biomass estimation, forests, agricultural land, plantation, soils, coral reefs, wetland and water resources -Agriculture, food production systems and food security outcomes -Socio-economic issues including urban systems, urban growth, public health, epidemics, land-use transition and land use conflicts -Oceanography and coastal zone studies, including sea level rise projections, coastlines changes and the ocean-land interface -Regional challenges for remote sensing application techniques, monitoring and analysis, such as cloud screening and atmospheric correction for tropical regions -Interdisciplinary studies combining remote sensing, household survey data, field measurements and models to address environmental, societal and sustainability issues -Quantitative and qualitative analysis that documents the impact of using remote sensing studies in social, political, environmental or economic systems