{"title":"AER-Net:用于红外小目标探测的自适应特征增强和分层细化网络","authors":"Fuqing Zhang;Jing Yang;Shen Deng;Anning Pan;Yang Yang;Chengjiang Zhou","doi":"10.1109/TIM.2024.3485456","DOIUrl":null,"url":null,"abstract":"The infrared small target detection (IRSTD) has significant value in many civilian and military applications. However, most current methods still struggle to maintain tiny targets with low contrast and are prone to false alarms caused by confusing noise similar to the targets. Furthermore, challenges remain in accurately segmenting target shapes due to the difficulty in acquiring sufficient fine-grained details. To address these issues, we propose the adaptive feature enhancement and hierarchical refinement network (AER-Net) to extract discriminative features of small targets while leveraging the distinct semantics of different layers for reducing false alarms and preserving edge contours of targets. Specifically, we present a U-shaped architecture tailored for IRSTD by a feature representation enhancement module (FREM), which combines the local correlation of convolution neural networks (CNNs) and the global correlation of Transformers to enhance features while preventing the loss of tiny targets. Based on this, we further devise a daisy-shaped correlation-aware module (DCM) to capture sufficient global contexts for mitigating clutter interference in deeper layers. Subsequently, a gated refinement module (GRM) is designed to refine low-level features from the encoder utilizing high-level features as guidance for preserving local details and further removing noise in shallower layers. Moreover, we integrate our method with the monitoring device to extend the application of IRSTD to automatic monitoring of birds in plateau lakes and wetlands, while contributing an infrared bird dataset named BSIRST_v1 with targets exhibiting high morphological uncertainty to advance research in this field. Extensive experiments on three public and the proposed datasets have demonstrated that our method achieves a balance between detection capability and segmentation accuracy. Notably, our approach has significant improvements in the mean intersection over union (IoU) metric on the challenging benchmark NUDT-SIRST and IRSTD-1k datasets with 4.46% and 3.86%, respectively. The source code and dataset are available at: \n<uri>https://github.com/fuqingzhang/AER-Net</uri>\n.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"73 ","pages":"1-18"},"PeriodicalIF":5.6000,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"AER-Net: Adaptive Feature Enhancement and Hierarchical Refinement Network for Infrared Small Target Detection\",\"authors\":\"Fuqing Zhang;Jing Yang;Shen Deng;Anning Pan;Yang Yang;Chengjiang Zhou\",\"doi\":\"10.1109/TIM.2024.3485456\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The infrared small target detection (IRSTD) has significant value in many civilian and military applications. However, most current methods still struggle to maintain tiny targets with low contrast and are prone to false alarms caused by confusing noise similar to the targets. Furthermore, challenges remain in accurately segmenting target shapes due to the difficulty in acquiring sufficient fine-grained details. To address these issues, we propose the adaptive feature enhancement and hierarchical refinement network (AER-Net) to extract discriminative features of small targets while leveraging the distinct semantics of different layers for reducing false alarms and preserving edge contours of targets. Specifically, we present a U-shaped architecture tailored for IRSTD by a feature representation enhancement module (FREM), which combines the local correlation of convolution neural networks (CNNs) and the global correlation of Transformers to enhance features while preventing the loss of tiny targets. Based on this, we further devise a daisy-shaped correlation-aware module (DCM) to capture sufficient global contexts for mitigating clutter interference in deeper layers. Subsequently, a gated refinement module (GRM) is designed to refine low-level features from the encoder utilizing high-level features as guidance for preserving local details and further removing noise in shallower layers. Moreover, we integrate our method with the monitoring device to extend the application of IRSTD to automatic monitoring of birds in plateau lakes and wetlands, while contributing an infrared bird dataset named BSIRST_v1 with targets exhibiting high morphological uncertainty to advance research in this field. Extensive experiments on three public and the proposed datasets have demonstrated that our method achieves a balance between detection capability and segmentation accuracy. Notably, our approach has significant improvements in the mean intersection over union (IoU) metric on the challenging benchmark NUDT-SIRST and IRSTD-1k datasets with 4.46% and 3.86%, respectively. The source code and dataset are available at: \\n<uri>https://github.com/fuqingzhang/AER-Net</uri>\\n.\",\"PeriodicalId\":13341,\"journal\":{\"name\":\"IEEE Transactions on Instrumentation and Measurement\",\"volume\":\"73 \",\"pages\":\"1-18\"},\"PeriodicalIF\":5.6000,\"publicationDate\":\"2024-10-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Instrumentation and Measurement\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10731950/\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Instrumentation and Measurement","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10731950/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
AER-Net: Adaptive Feature Enhancement and Hierarchical Refinement Network for Infrared Small Target Detection
The infrared small target detection (IRSTD) has significant value in many civilian and military applications. However, most current methods still struggle to maintain tiny targets with low contrast and are prone to false alarms caused by confusing noise similar to the targets. Furthermore, challenges remain in accurately segmenting target shapes due to the difficulty in acquiring sufficient fine-grained details. To address these issues, we propose the adaptive feature enhancement and hierarchical refinement network (AER-Net) to extract discriminative features of small targets while leveraging the distinct semantics of different layers for reducing false alarms and preserving edge contours of targets. Specifically, we present a U-shaped architecture tailored for IRSTD by a feature representation enhancement module (FREM), which combines the local correlation of convolution neural networks (CNNs) and the global correlation of Transformers to enhance features while preventing the loss of tiny targets. Based on this, we further devise a daisy-shaped correlation-aware module (DCM) to capture sufficient global contexts for mitigating clutter interference in deeper layers. Subsequently, a gated refinement module (GRM) is designed to refine low-level features from the encoder utilizing high-level features as guidance for preserving local details and further removing noise in shallower layers. Moreover, we integrate our method with the monitoring device to extend the application of IRSTD to automatic monitoring of birds in plateau lakes and wetlands, while contributing an infrared bird dataset named BSIRST_v1 with targets exhibiting high morphological uncertainty to advance research in this field. Extensive experiments on three public and the proposed datasets have demonstrated that our method achieves a balance between detection capability and segmentation accuracy. Notably, our approach has significant improvements in the mean intersection over union (IoU) metric on the challenging benchmark NUDT-SIRST and IRSTD-1k datasets with 4.46% and 3.86%, respectively. The source code and dataset are available at:
https://github.com/fuqingzhang/AER-Net
.
期刊介绍:
Papers are sought that address innovative solutions to the development and use of electrical and electronic instruments and equipment to measure, monitor and/or record physical phenomena for the purpose of advancing measurement science, methods, functionality and applications. The scope of these papers may encompass: (1) theory, methodology, and practice of measurement; (2) design, development and evaluation of instrumentation and measurement systems and components used in generating, acquiring, conditioning and processing signals; (3) analysis, representation, display, and preservation of the information obtained from a set of measurements; and (4) scientific and technical support to establishment and maintenance of technical standards in the field of Instrumentation and Measurement.