用于红外小目标探测的轻型目标全向增强网络

IF 3.4 3区物理与天体物理 Q2 INSTRUMENTS & INSTRUMENTATION

Infrared Physics & Technology Pub Date : 2025-08-11 DOI:10.1016/j.infrared.2025.106058

Yichuan Li, Feng He, Qiran Zhang, Wei Zhang

{"title":"用于红外小目标探测的轻型目标全向增强网络","authors":"Yichuan Li, Feng He, Qiran Zhang, Wei Zhang","doi":"10.1016/j.infrared.2025.106058","DOIUrl":null,"url":null,"abstract":"<div><div>Due to the limited number of pixels and weak features of small targets in infrared images, detecting such targets in complex backgrounds remains a highly challenging task. It is worthwhile to explore how prior knowledge can be used to compensate for the insufficient inherent information in the original images, thereby assisting deep learning methods in learning more effectively. Inspired by human visual perception, areas with greater local changes tend to attract more attention. In infrared images, while there is some grayscale gradient at the boundary between small targets and the background, background regions also exhibit grayscale variations.To address these issues and make better use of grayscale gradient information as prior knowledge, it is necessary to distinguish the gradients around small targets from those in complex background regions. Therefore, we propose a Target Omnidirectional Enhancement Network (TODENet). The network first uses a Target Enhancement Module to focus on the inherent prior knowledge of infrared images, amplifying the grayscale gradient at the boundary between small targets and the background, while suppressing gradient variations within the background. This approach reduces clutter interference from complex backgrounds and highlights small targets within the image. Building on this, we constructed an Inter-layer Feature Fusion Module based on transposed convolution, which effectively minimizes the loss of high-frequency information of small targets during upsampling. It also makes full use of the semantic information from deep feature maps and the spatial location information from shallow feature maps. Additionally, we developed a Dilated Convolution Module that adjusts the receptive field size to filter out background clutter and then extract fine features of small targets, addressing the problem of losing small target features in the deeper layers of network. Extensive experiments show that TODENet achieves state-of-the-art performance on the NUAA-SIRST, NUDT-SIRST, and IRSTD-1k datasets, with target-level detection rates (Pd) of 97.710%, 99.649%, and 94.218%, respectively. The source code of our work is available at <span><span>https://github.com/LYC-1021/TODE-Net</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":13549,"journal":{"name":"Infrared Physics & Technology","volume":"151 ","pages":"Article 106058"},"PeriodicalIF":3.4000,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Lightweight Target Omni-Directional Enhancement Network for infrared small target detection\",\"authors\":\"Yichuan Li, Feng He, Qiran Zhang, Wei Zhang\",\"doi\":\"10.1016/j.infrared.2025.106058\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Due to the limited number of pixels and weak features of small targets in infrared images, detecting such targets in complex backgrounds remains a highly challenging task. It is worthwhile to explore how prior knowledge can be used to compensate for the insufficient inherent information in the original images, thereby assisting deep learning methods in learning more effectively. Inspired by human visual perception, areas with greater local changes tend to attract more attention. In infrared images, while there is some grayscale gradient at the boundary between small targets and the background, background regions also exhibit grayscale variations.To address these issues and make better use of grayscale gradient information as prior knowledge, it is necessary to distinguish the gradients around small targets from those in complex background regions. Therefore, we propose a Target Omnidirectional Enhancement Network (TODENet). The network first uses a Target Enhancement Module to focus on the inherent prior knowledge of infrared images, amplifying the grayscale gradient at the boundary between small targets and the background, while suppressing gradient variations within the background. This approach reduces clutter interference from complex backgrounds and highlights small targets within the image. Building on this, we constructed an Inter-layer Feature Fusion Module based on transposed convolution, which effectively minimizes the loss of high-frequency information of small targets during upsampling. It also makes full use of the semantic information from deep feature maps and the spatial location information from shallow feature maps. Additionally, we developed a Dilated Convolution Module that adjusts the receptive field size to filter out background clutter and then extract fine features of small targets, addressing the problem of losing small target features in the deeper layers of network. Extensive experiments show that TODENet achieves state-of-the-art performance on the NUAA-SIRST, NUDT-SIRST, and IRSTD-1k datasets, with target-level detection rates (Pd) of 97.710%, 99.649%, and 94.218%, respectively. The source code of our work is available at <span><span>https://github.com/LYC-1021/TODE-Net</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":13549,\"journal\":{\"name\":\"Infrared Physics & Technology\",\"volume\":\"151 \",\"pages\":\"Article 106058\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-08-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Infrared Physics & Technology\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1350449525003512\",\"RegionNum\":3,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"INSTRUMENTS & INSTRUMENTATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Infrared Physics & Technology","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1350449525003512","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"INSTRUMENTS & INSTRUMENTATION","Score":null,"Total":0}

引用次数: 0

摘要

由于红外图像中小目标的像素数量有限且特征较弱，因此在复杂背景下检测小目标仍然是一项极具挑战性的任务。如何利用先验知识来弥补原始图像中固有信息的不足，从而帮助深度学习方法更有效地学习，是值得探讨的问题。受人类视觉感知的启发，局部变化较大的区域往往会吸引更多的注意力。在红外图像中，小目标与背景边界处存在一定的灰度梯度，背景区域也呈现灰度变化。为了解决这些问题，更好地利用灰度梯度信息作为先验知识，有必要将小目标周围的梯度与复杂背景区域的梯度区分开来。为此，我们提出了一种目标全向增强网络（TODENet）。该网络首先利用目标增强模块聚焦红外图像固有的先验知识，放大小目标与背景边界处的灰度梯度，同时抑制背景内的梯度变化。这种方法减少了来自复杂背景的杂波干扰，并突出了图像中的小目标。在此基础上，构建了基于转置卷积的层间特征融合模块，有效地减少了上采样过程中小目标高频信息的丢失。它还充分利用了深层特征图的语义信息和浅层特征图的空间位置信息。此外，我们还开发了一个扩展卷积模块，该模块通过调整接收场大小来过滤背景杂波，然后提取小目标的精细特征，解决了网络更深层中丢失小目标特征的问题。大量实验表明，TODENet在NUAA-SIRST、NUDT-SIRST和IRSTD-1k数据集上取得了最先进的性能，目标水平检测率（Pd）分别为97.710%、99.649%和94.218%。我们工作的源代码可在https://github.com/LYC-1021/TODE-Net上获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Lightweight Target Omni-Directional Enhancement Network for infrared small target detection

Due to the limited number of pixels and weak features of small targets in infrared images, detecting such targets in complex backgrounds remains a highly challenging task. It is worthwhile to explore how prior knowledge can be used to compensate for the insufficient inherent information in the original images, thereby assisting deep learning methods in learning more effectively. Inspired by human visual perception, areas with greater local changes tend to attract more attention. In infrared images, while there is some grayscale gradient at the boundary between small targets and the background, background regions also exhibit grayscale variations.To address these issues and make better use of grayscale gradient information as prior knowledge, it is necessary to distinguish the gradients around small targets from those in complex background regions. Therefore, we propose a Target Omnidirectional Enhancement Network (TODENet). The network first uses a Target Enhancement Module to focus on the inherent prior knowledge of infrared images, amplifying the grayscale gradient at the boundary between small targets and the background, while suppressing gradient variations within the background. This approach reduces clutter interference from complex backgrounds and highlights small targets within the image. Building on this, we constructed an Inter-layer Feature Fusion Module based on transposed convolution, which effectively minimizes the loss of high-frequency information of small targets during upsampling. It also makes full use of the semantic information from deep feature maps and the spatial location information from shallow feature maps. Additionally, we developed a Dilated Convolution Module that adjusts the receptive field size to filter out background clutter and then extract fine features of small targets, addressing the problem of losing small target features in the deeper layers of network. Extensive experiments show that TODENet achieves state-of-the-art performance on the NUAA-SIRST, NUDT-SIRST, and IRSTD-1k datasets, with target-level detection rates (Pd) of 97.710%, 99.649%, and 94.218%, respectively. The source code of our work is available at https://github.com/LYC-1021/TODE-Net.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Infrared Physics & Technology 物理-光学

CiteScore

5.70

自引率

12.10%

发文量

400

审稿时长

67 days

期刊介绍： The Journal covers the entire field of infrared physics and technology: theory, experiment, application, devices and instrumentation. Infrared'' is defined as covering the near, mid and far infrared (terahertz) regions from 0.75um (750nm) to 1mm (300GHz.) Submissions in the 300GHz to 100GHz region may be accepted at the editors discretion if their content is relevant to shorter wavelengths. Submissions must be primarily concerned with and directly relevant to this spectral region. Its core topics can be summarized as the generation, propagation and detection, of infrared radiation; the associated optics, materials and devices; and its use in all fields of science, industry, engineering and medicine. Infrared techniques occur in many different fields, notably spectroscopy and interferometry; material characterization and processing; atmospheric physics, astronomy and space research. Scientific aspects include lasers, quantum optics, quantum electronics, image processing and semiconductor physics. Some important applications are medical diagnostics and treatment, industrial inspection and environmental monitoring.