{"title":"Adaptive feature extraction and attention-based segmentation network for remote sensing imagery","authors":"Aneeqah Azmat , Basim Azam , Farrukh Aziz Bhatti , Sheheryar Khan","doi":"10.1016/j.rsase.2025.101679","DOIUrl":null,"url":null,"abstract":"<div><div>Semantic segmentation of land cover types is a pivotal task in remote sensing, essential for applications in urban planning, environmental monitoring, disaster management, and agriculture. Accurate segmentation is challenged by imbalanced class distributions and ambiguous boundaries. This paper introduces AdaptiveFusionNet, a novel architecture designed to address heterogeneous complexities in remote sensory image, by leveraging adaptive, multi-scale feature extraction and efficient fusion mechanisms. The architecture comprises three core modules: the Adaptive Pixel Encoder (APE), which enhances pixel-level feature extraction across multiple scales; the Fusion Atrous Pooling (FAP), which effectively integrates contextual information using atrous convolutions; and the Parallel Attention Decoder (PAD), which refines segmentation boundaries through attention-enhanced upsampling. Evaluated on the high-resolution Gaofen 2 dataset, AdaptiveFusionNet demonstrates substantial improvements in key performance metrics, achieving an overall Intersection over Union (IoU) of 71% and excelling in Precision, Recall, and F1 score across various land cover classes, including urban areas, vegetation, water bodies, and infrastructure. An ablation study is presented to validate AdaptiveFusionNet’s superiority over existing architectures. The results establish AdaptiveFusionNet as an improved architecture for high-resolution land cover segmentation in terms of both accuracy and computational efficiency.</div></div>","PeriodicalId":53227,"journal":{"name":"Remote Sensing Applications-Society and Environment","volume":"39 ","pages":"Article 101679"},"PeriodicalIF":4.5000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Remote Sensing Applications-Society and Environment","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352938525002320","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Semantic segmentation of land cover types is a pivotal task in remote sensing, essential for applications in urban planning, environmental monitoring, disaster management, and agriculture. Accurate segmentation is challenged by imbalanced class distributions and ambiguous boundaries. This paper introduces AdaptiveFusionNet, a novel architecture designed to address heterogeneous complexities in remote sensory image, by leveraging adaptive, multi-scale feature extraction and efficient fusion mechanisms. The architecture comprises three core modules: the Adaptive Pixel Encoder (APE), which enhances pixel-level feature extraction across multiple scales; the Fusion Atrous Pooling (FAP), which effectively integrates contextual information using atrous convolutions; and the Parallel Attention Decoder (PAD), which refines segmentation boundaries through attention-enhanced upsampling. Evaluated on the high-resolution Gaofen 2 dataset, AdaptiveFusionNet demonstrates substantial improvements in key performance metrics, achieving an overall Intersection over Union (IoU) of 71% and excelling in Precision, Recall, and F1 score across various land cover classes, including urban areas, vegetation, water bodies, and infrastructure. An ablation study is presented to validate AdaptiveFusionNet’s superiority over existing architectures. The results establish AdaptiveFusionNet as an improved architecture for high-resolution land cover segmentation in terms of both accuracy and computational efficiency.
期刊介绍:
The journal ''Remote Sensing Applications: Society and Environment'' (RSASE) focuses on remote sensing studies that address specific topics with an emphasis on environmental and societal issues - regional / local studies with global significance. Subjects are encouraged to have an interdisciplinary approach and include, but are not limited by: " -Global and climate change studies addressing the impact of increasing concentrations of greenhouse gases, CO2 emission, carbon balance and carbon mitigation, energy system on social and environmental systems -Ecological and environmental issues including biodiversity, ecosystem dynamics, land degradation, atmospheric and water pollution, urban footprint, ecosystem management and natural hazards (e.g. earthquakes, typhoons, floods, landslides) -Natural resource studies including land-use in general, biomass estimation, forests, agricultural land, plantation, soils, coral reefs, wetland and water resources -Agriculture, food production systems and food security outcomes -Socio-economic issues including urban systems, urban growth, public health, epidemics, land-use transition and land use conflicts -Oceanography and coastal zone studies, including sea level rise projections, coastlines changes and the ocean-land interface -Regional challenges for remote sensing application techniques, monitoring and analysis, such as cloud screening and atmospheric correction for tropical regions -Interdisciplinary studies combining remote sensing, household survey data, field measurements and models to address environmental, societal and sustainability issues -Quantitative and qualitative analysis that documents the impact of using remote sensing studies in social, political, environmental or economic systems