Frequency-Aware Contextual Feature Pyramid Network for Infrared Small-Target Detection

Shu Cai;Jinfu Yang;Tao Xiang;Jinglei Bai
{"title":"Frequency-Aware Contextual Feature Pyramid Network for Infrared Small-Target Detection","authors":"Shu Cai;Jinfu Yang;Tao Xiang;Jinglei Bai","doi":"10.1109/LGRS.2025.3560340","DOIUrl":null,"url":null,"abstract":"Due to the absence of detailed information, such as texture, shape, and color, detecting infrared small targets remains a challenging problem. While existing model-driven and data-driven approaches have made some progress, they still struggle to effectively exploit global contextual information and frequency-specific details. In this letter, we introduce a frequency-aware contextual feature pyramid network (FACFPNet) to address these limitations in infrared small-target detection. Specifically, we first estimate the correlation between high- and low-frequency feature representations within an encoder-decoder framework based on the ResNet-18 backbone. This is achieved through the contextual fine-grained block (CFGB), which effectively combines local fine-grained features with global semantic information for enhanced contextual feature modeling. Next, we propose a frequency-aware attention module (FAAM) to address the underutilization of prior frequency knowledge in infrared small targets, thereby improving the preservation of these features. This module enhances global contextual representation by more effectively extracting high- and low-frequency information. Finally, during the decoding stage, shallow fine-structure information is interactively fused with deep semantic features through the asymmetric enhancement fusion module (AEFM), which strengthens the representation of small targets and improves information retention. Experimental results on three publicly available datasets, SIRST-Aug, MdvsFA, and IRSTD-1K, demonstrate that our method achieves superior detection performance.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2025-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10964277/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Due to the absence of detailed information, such as texture, shape, and color, detecting infrared small targets remains a challenging problem. While existing model-driven and data-driven approaches have made some progress, they still struggle to effectively exploit global contextual information and frequency-specific details. In this letter, we introduce a frequency-aware contextual feature pyramid network (FACFPNet) to address these limitations in infrared small-target detection. Specifically, we first estimate the correlation between high- and low-frequency feature representations within an encoder-decoder framework based on the ResNet-18 backbone. This is achieved through the contextual fine-grained block (CFGB), which effectively combines local fine-grained features with global semantic information for enhanced contextual feature modeling. Next, we propose a frequency-aware attention module (FAAM) to address the underutilization of prior frequency knowledge in infrared small targets, thereby improving the preservation of these features. This module enhances global contextual representation by more effectively extracting high- and low-frequency information. Finally, during the decoding stage, shallow fine-structure information is interactively fused with deep semantic features through the asymmetric enhancement fusion module (AEFM), which strengthens the representation of small targets and improves information retention. Experimental results on three publicly available datasets, SIRST-Aug, MdvsFA, and IRSTD-1K, demonstrate that our method achieves superior detection performance.
红外小目标检测的频率感知上下文特征金字塔网络
由于缺乏纹理、形状和颜色等详细信息,红外小目标的检测仍然是一个具有挑战性的问题。虽然现有的模型驱动和数据驱动的方法已经取得了一些进展,但它们仍然难以有效地利用全局上下文信息和特定频率的细节。在这封信中,我们介绍了一个频率感知上下文特征金字塔网络(FACFPNet)来解决红外小目标检测中的这些限制。具体来说,我们首先在基于ResNet-18主干的编码器-解码器框架中估计高频和低频特征表示之间的相关性。这是通过上下文细粒度块(CFGB)实现的,它有效地将局部细粒度特征与全局语义信息结合起来,以增强上下文特征建模。接下来,我们提出了一种频率感知关注模块(FAAM),以解决红外小目标中先验频率知识利用不足的问题,从而提高这些特征的保存。该模块通过更有效地提取高频和低频信息来增强全局上下文表示。最后,在解码阶段,通过非对称增强融合模块(AEFM)将浅层精细结构信息与深层语义特征进行交互融合,增强了小目标的表征,提高了信息保留率。在SIRST-Aug、MdvsFA和IRSTD-1K三个公开数据集上的实验结果表明,该方法具有较好的检测性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信