Kequan Shi , Qi Li , Hengchao Li , Pan Xu , Peng Zhang , Sen Yang , Hongna Zhu
{"title":"LMFAN: Lightweight multi-scale feature aggregation network with channel pruning and knowledge distillation for ship detection in remote sensing images","authors":"Kequan Shi , Qi Li , Hengchao Li , Pan Xu , Peng Zhang , Sen Yang , Hongna Zhu","doi":"10.1016/j.rsase.2025.101692","DOIUrl":null,"url":null,"abstract":"<div><div>Ship detection in remote sensing images, especially in synthetic aperture radar (SAR) images holds significant application value in maritime traffic monitoring, military reconnaissance, and related domains. To address the accuracy degradation caused by complex maritime background interference, multi-scale target coexistence, and weak feature representation of small targets in SAR images, we propose a lightweight multi-scale feature aggregation network (LMFAN) based on YOLO11. Firstly, a gram polynomial-driven dynamic convolution module that employs differentiable Gram-Kolmogorov basis functions is designed to expand the receptive field of convolutional kernels, enhancing coarse-grained feature representation. Secondly, we employ an enhanced spatial-orientation pyramid utilizing an Omni-Kernel to fuse global, large-local, and local detailed features, significantly improving feature responses for small targets. Thirdly, we design a multi-scale detail-sharing decoder incorporating detail-enhanced and shared convolution to preserve contextual information while reducing computational overhead. Moreover, a dynamic channel pruning strategy with channel-wise cascaded optimization and a channel-wise knowledge distillation with the Kullback–Leibler divergence loss function is introduced to resolves the feature coupling issue in dense target scenarios. Experimental results demonstrate that compared to baseline YOLO11n model, LMFAN achieves mAP<span><math><mi>@</mi></math></span>0.5:0.95 improvements of 3.6%, 4.9%, and 1.5% on the SSDD, HRSID, and SAR-Ship-Dataset, respectively. Additional validation confirms the model’s superior generalization capability. Compared with state-of-the-art methods, the LMFAN can achieve better accuracy and fewer parameters.</div></div>","PeriodicalId":53227,"journal":{"name":"Remote Sensing Applications-Society and Environment","volume":"39 ","pages":"Article 101692"},"PeriodicalIF":4.5000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Remote Sensing Applications-Society and Environment","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352938525002459","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Ship detection in remote sensing images, especially in synthetic aperture radar (SAR) images holds significant application value in maritime traffic monitoring, military reconnaissance, and related domains. To address the accuracy degradation caused by complex maritime background interference, multi-scale target coexistence, and weak feature representation of small targets in SAR images, we propose a lightweight multi-scale feature aggregation network (LMFAN) based on YOLO11. Firstly, a gram polynomial-driven dynamic convolution module that employs differentiable Gram-Kolmogorov basis functions is designed to expand the receptive field of convolutional kernels, enhancing coarse-grained feature representation. Secondly, we employ an enhanced spatial-orientation pyramid utilizing an Omni-Kernel to fuse global, large-local, and local detailed features, significantly improving feature responses for small targets. Thirdly, we design a multi-scale detail-sharing decoder incorporating detail-enhanced and shared convolution to preserve contextual information while reducing computational overhead. Moreover, a dynamic channel pruning strategy with channel-wise cascaded optimization and a channel-wise knowledge distillation with the Kullback–Leibler divergence loss function is introduced to resolves the feature coupling issue in dense target scenarios. Experimental results demonstrate that compared to baseline YOLO11n model, LMFAN achieves mAP0.5:0.95 improvements of 3.6%, 4.9%, and 1.5% on the SSDD, HRSID, and SAR-Ship-Dataset, respectively. Additional validation confirms the model’s superior generalization capability. Compared with state-of-the-art methods, the LMFAN can achieve better accuracy and fewer parameters.
期刊介绍:
The journal ''Remote Sensing Applications: Society and Environment'' (RSASE) focuses on remote sensing studies that address specific topics with an emphasis on environmental and societal issues - regional / local studies with global significance. Subjects are encouraged to have an interdisciplinary approach and include, but are not limited by: " -Global and climate change studies addressing the impact of increasing concentrations of greenhouse gases, CO2 emission, carbon balance and carbon mitigation, energy system on social and environmental systems -Ecological and environmental issues including biodiversity, ecosystem dynamics, land degradation, atmospheric and water pollution, urban footprint, ecosystem management and natural hazards (e.g. earthquakes, typhoons, floods, landslides) -Natural resource studies including land-use in general, biomass estimation, forests, agricultural land, plantation, soils, coral reefs, wetland and water resources -Agriculture, food production systems and food security outcomes -Socio-economic issues including urban systems, urban growth, public health, epidemics, land-use transition and land use conflicts -Oceanography and coastal zone studies, including sea level rise projections, coastlines changes and the ocean-land interface -Regional challenges for remote sensing application techniques, monitoring and analysis, such as cloud screening and atmospheric correction for tropical regions -Interdisciplinary studies combining remote sensing, household survey data, field measurements and models to address environmental, societal and sustainability issues -Quantitative and qualitative analysis that documents the impact of using remote sensing studies in social, political, environmental or economic systems