Peichao Wang, Jiabao Wang, Yao Chen, Rui Zhang, Yang Li, Zhuang Miao
{"title":"Paying more attention to local contrast: Improving infrared small target detection performance via prior knowledge","authors":"Peichao Wang, Jiabao Wang, Yao Chen, Rui Zhang, Yang Li, Zhuang Miao","doi":"10.1016/j.engappai.2025.110244","DOIUrl":null,"url":null,"abstract":"<div><div>The data-driven methods for InfraRed Small Target Detection (IRSTD) have achieved promising results. However, these methods typically incorporate modules with high computational complexity, which enhance performance at the expense of computational efficiency. Utilizing human expert knowledge to assist data-driven methods in better learning with less costs is worthy of exploration. To effectively guide the model to focus on targets’ spatial features, this paper proposes the Local Contrast Attention Enhanced infrared small target detection Network (LCAE-Net), combining prior knowledge with data-driven deep learning methods. LCAE-Net is a U-shaped neural network model which consists of two developed modules: a Local Contrast Enhancement (LCE) module and a Channel Attention Enhancement (CAE) module. The LCE module takes advantage of prior knowledge, leveraging handcrafted convolution operators to acquire Local Contrast Attention (LCA), which could realize background suppression while enhancing the potential target region, thus guiding the neural network to pay more attention to potential infrared small targets’ location information. To effectively utilize the response information throughout the downsampling progresses, the CAE module is proposed to achieve the information fusion among feature maps’ different channels. Experimental results indicate that our LCAE-Net outperforms comparison methods on the three public datasets, and its detection speed could reach up to 70 Frames Per Second (FPS). Meanwhile, our model has a parameter count and Floating-Point Operations (FLOPs) of 1.945 Million (M) and 4.862 Giga (G) respectively, which is suitable for deployment on edge devices. Our code will be available at <span><span>https://github.com/boa2004plaust/LCAENet</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"146 ","pages":"Article 110244"},"PeriodicalIF":7.5000,"publicationDate":"2025-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625002441","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The data-driven methods for InfraRed Small Target Detection (IRSTD) have achieved promising results. However, these methods typically incorporate modules with high computational complexity, which enhance performance at the expense of computational efficiency. Utilizing human expert knowledge to assist data-driven methods in better learning with less costs is worthy of exploration. To effectively guide the model to focus on targets’ spatial features, this paper proposes the Local Contrast Attention Enhanced infrared small target detection Network (LCAE-Net), combining prior knowledge with data-driven deep learning methods. LCAE-Net is a U-shaped neural network model which consists of two developed modules: a Local Contrast Enhancement (LCE) module and a Channel Attention Enhancement (CAE) module. The LCE module takes advantage of prior knowledge, leveraging handcrafted convolution operators to acquire Local Contrast Attention (LCA), which could realize background suppression while enhancing the potential target region, thus guiding the neural network to pay more attention to potential infrared small targets’ location information. To effectively utilize the response information throughout the downsampling progresses, the CAE module is proposed to achieve the information fusion among feature maps’ different channels. Experimental results indicate that our LCAE-Net outperforms comparison methods on the three public datasets, and its detection speed could reach up to 70 Frames Per Second (FPS). Meanwhile, our model has a parameter count and Floating-Point Operations (FLOPs) of 1.945 Million (M) and 4.862 Giga (G) respectively, which is suitable for deployment on edge devices. Our code will be available at https://github.com/boa2004plaust/LCAENet.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.