A Lightweight Hybrid Network for Object Detection in Remote Sensing Images Balancing Global and Local Information

IF 4.4
Shuting Huang;Ge Zhang;Huanzun Zhang;Hui Xu;Guangzhen Yao;Sandong Zhu;Long Zhang;Jun Kong
{"title":"A Lightweight Hybrid Network for Object Detection in Remote Sensing Images Balancing Global and Local Information","authors":"Shuting Huang;Ge Zhang;Huanzun Zhang;Hui Xu;Guangzhen Yao;Sandong Zhu;Long Zhang;Jun Kong","doi":"10.1109/LGRS.2025.3588788","DOIUrl":null,"url":null,"abstract":"In recent years, hybrid convolutional neural networks (CNNs) and Transformer-based object detection technologies have achieved remarkable success. In the field of remote sensing image detection, since remote sensing systems rely on the large-scale deployment of edge devices, detection models need to be lightweight with low parameter complexity to adapt to resource-constrained environments. However, existing lightweight models often struggle with an imbalance in extracting low-frequency global and high-frequency local information. In particular, when processing high-frequency local information (such as edges, textures, and fine structures), these models often lack in-depth analysis, leading to insufficient extraction of local features and reduced detection accuracy. To address the imbalance between low-frequency global information and high-frequency local information in lightweight remote sensing models, we propose an efficient and lightweight hybrid network detection framework, which mainly consists of the global–local balance (GLB) module and the detail-aware feature fusion (DAFF) module. The GLB module adopts dynamic weight adjustment and context-aware mechanisms to effectively aggregate high-frequency local information in the image. The DAFF module further enhances feature fusion and detail refinement, improving the model’s performance and generalization ability. Experimental results on remote sensing datasets, including RSOD, NWPU VHR-10, and LEVIR datasets, demonstrate that our proposed method achieves a well-balanced tradeoff between model size and detection accuracy, reaching state-of-the-art performance.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":4.4000,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11079653/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In recent years, hybrid convolutional neural networks (CNNs) and Transformer-based object detection technologies have achieved remarkable success. In the field of remote sensing image detection, since remote sensing systems rely on the large-scale deployment of edge devices, detection models need to be lightweight with low parameter complexity to adapt to resource-constrained environments. However, existing lightweight models often struggle with an imbalance in extracting low-frequency global and high-frequency local information. In particular, when processing high-frequency local information (such as edges, textures, and fine structures), these models often lack in-depth analysis, leading to insufficient extraction of local features and reduced detection accuracy. To address the imbalance between low-frequency global information and high-frequency local information in lightweight remote sensing models, we propose an efficient and lightweight hybrid network detection framework, which mainly consists of the global–local balance (GLB) module and the detail-aware feature fusion (DAFF) module. The GLB module adopts dynamic weight adjustment and context-aware mechanisms to effectively aggregate high-frequency local information in the image. The DAFF module further enhances feature fusion and detail refinement, improving the model’s performance and generalization ability. Experimental results on remote sensing datasets, including RSOD, NWPU VHR-10, and LEVIR datasets, demonstrate that our proposed method achieves a well-balanced tradeoff between model size and detection accuracy, reaching state-of-the-art performance.
一种平衡全局和局部信息的遥感图像目标检测轻量级混合网络
近年来,混合卷积神经网络(cnn)和基于transformer的目标检测技术取得了显著的成功。在遥感图像检测领域,由于遥感系统依赖于边缘设备的大规模部署,检测模型需要轻量化和低参数复杂度,以适应资源受限的环境。然而,现有的轻量级模型在提取低频全局信息和高频局部信息时往往存在不平衡的问题。特别是在处理高频局部信息(如边缘、纹理、精细结构)时,这些模型往往缺乏深入分析,导致局部特征提取不足,检测精度降低。为解决轻量化遥感模型中低频全局信息与高频局部信息不平衡的问题,提出了一种高效、轻量化的混合网络检测框架,该框架主要由全局-局部平衡(GLB)模块和细节感知特征融合(DAFF)模块组成。GLB模块采用动态权重调整和上下文感知机制,有效聚合图像中的高频局部信息。DAFF模块进一步增强了特征融合和细节细化,提高了模型的性能和泛化能力。在RSOD、NWPU VHR-10和LEVIR等遥感数据集上的实验结果表明,我们提出的方法在模型大小和检测精度之间取得了很好的平衡,达到了最先进的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信