LMFAN: Lightweight multi-scale feature aggregation network with channel pruning and knowledge distillation for ship detection in remote sensing images

IF 4.5 Q2 ENVIRONMENTAL SCIENCES
Kequan Shi , Qi Li , Hengchao Li , Pan Xu , Peng Zhang , Sen Yang , Hongna Zhu
{"title":"LMFAN: Lightweight multi-scale feature aggregation network with channel pruning and knowledge distillation for ship detection in remote sensing images","authors":"Kequan Shi ,&nbsp;Qi Li ,&nbsp;Hengchao Li ,&nbsp;Pan Xu ,&nbsp;Peng Zhang ,&nbsp;Sen Yang ,&nbsp;Hongna Zhu","doi":"10.1016/j.rsase.2025.101692","DOIUrl":null,"url":null,"abstract":"<div><div>Ship detection in remote sensing images, especially in synthetic aperture radar (SAR) images holds significant application value in maritime traffic monitoring, military reconnaissance, and related domains. To address the accuracy degradation caused by complex maritime background interference, multi-scale target coexistence, and weak feature representation of small targets in SAR images, we propose a lightweight multi-scale feature aggregation network (LMFAN) based on YOLO11. Firstly, a gram polynomial-driven dynamic convolution module that employs differentiable Gram-Kolmogorov basis functions is designed to expand the receptive field of convolutional kernels, enhancing coarse-grained feature representation. Secondly, we employ an enhanced spatial-orientation pyramid utilizing an Omni-Kernel to fuse global, large-local, and local detailed features, significantly improving feature responses for small targets. Thirdly, we design a multi-scale detail-sharing decoder incorporating detail-enhanced and shared convolution to preserve contextual information while reducing computational overhead. Moreover, a dynamic channel pruning strategy with channel-wise cascaded optimization and a channel-wise knowledge distillation with the Kullback–Leibler divergence loss function is introduced to resolves the feature coupling issue in dense target scenarios. Experimental results demonstrate that compared to baseline YOLO11n model, LMFAN achieves mAP<span><math><mi>@</mi></math></span>0.5:0.95 improvements of 3.6%, 4.9%, and 1.5% on the SSDD, HRSID, and SAR-Ship-Dataset, respectively. Additional validation confirms the model’s superior generalization capability. Compared with state-of-the-art methods, the LMFAN can achieve better accuracy and fewer parameters.</div></div>","PeriodicalId":53227,"journal":{"name":"Remote Sensing Applications-Society and Environment","volume":"39 ","pages":"Article 101692"},"PeriodicalIF":4.5000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Remote Sensing Applications-Society and Environment","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352938525002459","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Ship detection in remote sensing images, especially in synthetic aperture radar (SAR) images holds significant application value in maritime traffic monitoring, military reconnaissance, and related domains. To address the accuracy degradation caused by complex maritime background interference, multi-scale target coexistence, and weak feature representation of small targets in SAR images, we propose a lightweight multi-scale feature aggregation network (LMFAN) based on YOLO11. Firstly, a gram polynomial-driven dynamic convolution module that employs differentiable Gram-Kolmogorov basis functions is designed to expand the receptive field of convolutional kernels, enhancing coarse-grained feature representation. Secondly, we employ an enhanced spatial-orientation pyramid utilizing an Omni-Kernel to fuse global, large-local, and local detailed features, significantly improving feature responses for small targets. Thirdly, we design a multi-scale detail-sharing decoder incorporating detail-enhanced and shared convolution to preserve contextual information while reducing computational overhead. Moreover, a dynamic channel pruning strategy with channel-wise cascaded optimization and a channel-wise knowledge distillation with the Kullback–Leibler divergence loss function is introduced to resolves the feature coupling issue in dense target scenarios. Experimental results demonstrate that compared to baseline YOLO11n model, LMFAN achieves mAP@0.5:0.95 improvements of 3.6%, 4.9%, and 1.5% on the SSDD, HRSID, and SAR-Ship-Dataset, respectively. Additional validation confirms the model’s superior generalization capability. Compared with state-of-the-art methods, the LMFAN can achieve better accuracy and fewer parameters.
LMFAN:基于信道修剪和知识蒸馏的轻型多尺度特征聚合网络
遥感图像中的船舶检测,特别是合成孔径雷达(SAR)图像中的船舶检测,在海上交通监控、军事侦察等领域具有重要的应用价值。针对复杂海洋背景干扰、多尺度目标共存以及SAR图像中小目标特征表示弱等问题,提出了一种基于YOLO11的轻型多尺度特征聚合网络(LMFAN)。首先,利用可微的gram - kolmogorov基函数设计了一个克多项式驱动的动态卷积模块,扩展了卷积核的接受域,增强了粗粒度特征表示;其次,我们采用增强的空间取向金字塔,利用omnio - kernel融合全局、大局部和局部细节特征,显著改善了小目标的特征响应。第三,我们设计了一个多尺度细节共享解码器,结合细节增强和共享卷积来保留上下文信息,同时减少计算开销。此外,引入了一种基于通道级联优化的动态通道修剪策略和基于Kullback-Leibler散度损失函数的通道知识精馏策略来解决密集目标场景下的特征耦合问题。实验结果表明,与基线YOLO11n模型相比,LMFAN在SSDD、HRSID和sar - ship数据集上分别实现了3.6%、4.9%和1.5%的mAP@0.5:0.95的改进。额外的验证证实了该模型优越的泛化能力。与现有的方法相比,LMFAN可以获得更高的精度和更少的参数。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
8.00
自引率
8.50%
发文量
204
审稿时长
65 days
期刊介绍: The journal ''Remote Sensing Applications: Society and Environment'' (RSASE) focuses on remote sensing studies that address specific topics with an emphasis on environmental and societal issues - regional / local studies with global significance. Subjects are encouraged to have an interdisciplinary approach and include, but are not limited by: " -Global and climate change studies addressing the impact of increasing concentrations of greenhouse gases, CO2 emission, carbon balance and carbon mitigation, energy system on social and environmental systems -Ecological and environmental issues including biodiversity, ecosystem dynamics, land degradation, atmospheric and water pollution, urban footprint, ecosystem management and natural hazards (e.g. earthquakes, typhoons, floods, landslides) -Natural resource studies including land-use in general, biomass estimation, forests, agricultural land, plantation, soils, coral reefs, wetland and water resources -Agriculture, food production systems and food security outcomes -Socio-economic issues including urban systems, urban growth, public health, epidemics, land-use transition and land use conflicts -Oceanography and coastal zone studies, including sea level rise projections, coastlines changes and the ocean-land interface -Regional challenges for remote sensing application techniques, monitoring and analysis, such as cloud screening and atmospheric correction for tropical regions -Interdisciplinary studies combining remote sensing, household survey data, field measurements and models to address environmental, societal and sustainability issues -Quantitative and qualitative analysis that documents the impact of using remote sensing studies in social, political, environmental or economic systems
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信