单帧红外图像中的点源目标检测与定位

Daniel C. Stumpp, Andrew J. Byrne, Alan D. George
{"title":"单帧红外图像中的点源目标检测与定位","authors":"Daniel C. Stumpp, Andrew J. Byrne, Alan D. George","doi":"10.1109/AERO55745.2023.10115662","DOIUrl":null,"url":null,"abstract":"Potential military threats often manifest as dim point-source targets embedded in complex clutter and noise back-grounds, which makes threat detection a significant challenge. A variety of machine-learning architectures have been developed in recent years for performing small-object segmentation in single frames of infrared imagery. Evaluation and comparison of these techniques has been hampered by a lack of reliably labeled data and the use of different evaluation metrics. In this research, we leverage the Air Force Institute of Technology Sensor and Scene Emulation Tool (ASSET) to generate a dataset containing independent frames with unique background and target characteristics. We introduce a standardized method for generating ground-truth segmentation masks for point-source targets that eliminates the risk of manual labeling errors that exist in other small-target segmentation datasets. A local peak signal-to-clutter-and-noise ratio (pSCNR) is also introduced and shown to be strongly correlated to probability of detection. Results show that with the use of the generated dataset, existing state-of-the-art small-object segmentation networks can be adapted specifically to the point-source target detection task. A probability of detection (Pd) greater than 80% is consistently achieved while maintaining low false alarm rates. In addition to the task of target detection, we address the problem of target subpixel localization in a single frame. Accurate subpixel localization is important due to the large physical area included in a single pixel. Existing work commonly overlooks this problem or takes the predicted target mask centroid as the subpixel location. In this research, we introduce a transformer-based subpixel localization technique that uses both the predicted target mask and the local pixel intensity to compute an accurate subpixel location. The proposed architecture reduces mean localization error by up to 72% compared to other single-frame methods for target subpixel localization.","PeriodicalId":344285,"journal":{"name":"2023 IEEE Aerospace Conference","volume":"297 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Point-Source Target Detection and Localization in Single-Frame Infrared Imagery\",\"authors\":\"Daniel C. Stumpp, Andrew J. Byrne, Alan D. George\",\"doi\":\"10.1109/AERO55745.2023.10115662\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Potential military threats often manifest as dim point-source targets embedded in complex clutter and noise back-grounds, which makes threat detection a significant challenge. A variety of machine-learning architectures have been developed in recent years for performing small-object segmentation in single frames of infrared imagery. Evaluation and comparison of these techniques has been hampered by a lack of reliably labeled data and the use of different evaluation metrics. In this research, we leverage the Air Force Institute of Technology Sensor and Scene Emulation Tool (ASSET) to generate a dataset containing independent frames with unique background and target characteristics. We introduce a standardized method for generating ground-truth segmentation masks for point-source targets that eliminates the risk of manual labeling errors that exist in other small-target segmentation datasets. A local peak signal-to-clutter-and-noise ratio (pSCNR) is also introduced and shown to be strongly correlated to probability of detection. Results show that with the use of the generated dataset, existing state-of-the-art small-object segmentation networks can be adapted specifically to the point-source target detection task. A probability of detection (Pd) greater than 80% is consistently achieved while maintaining low false alarm rates. In addition to the task of target detection, we address the problem of target subpixel localization in a single frame. Accurate subpixel localization is important due to the large physical area included in a single pixel. Existing work commonly overlooks this problem or takes the predicted target mask centroid as the subpixel location. In this research, we introduce a transformer-based subpixel localization technique that uses both the predicted target mask and the local pixel intensity to compute an accurate subpixel location. The proposed architecture reduces mean localization error by up to 72% compared to other single-frame methods for target subpixel localization.\",\"PeriodicalId\":344285,\"journal\":{\"name\":\"2023 IEEE Aerospace Conference\",\"volume\":\"297 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE Aerospace Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AERO55745.2023.10115662\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE Aerospace Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AERO55745.2023.10115662","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

潜在的军事威胁通常表现为嵌入在复杂杂波和噪声背景中的微弱点源目标,这给威胁检测带来了重大挑战。近年来,人们开发了各种机器学习架构,用于在单帧红外图像中执行小目标分割。由于缺乏可靠的标记数据和使用不同的评价指标,这些技术的评价和比较受到阻碍。在这项研究中,我们利用空军技术研究所的传感器和场景仿真工具(ASSET)来生成一个包含具有独特背景和目标特征的独立帧的数据集。我们引入了一种标准化的方法,用于为点源目标生成真值分割掩码,从而消除了存在于其他小目标分割数据集中的人工标记错误的风险。本文还引入了局部峰值信杂噪比(pSCNR),并证明其与检测概率密切相关。结果表明,使用生成的数据集,现有的最先进的小目标分割网络可以专门用于点源目标检测任务。在保持低虚警率的同时,始终实现大于80%的检测概率(Pd)。除了目标检测任务外,我们还解决了单帧目标亚像素定位问题。由于单个像素中包含的物理区域很大,因此精确的亚像素定位非常重要。现有的工作通常忽略了这个问题,或者将预测的目标掩模质心作为亚像素位置。在本研究中,我们引入了一种基于变压器的亚像素定位技术,该技术使用预测的目标掩模和局部像素强度来计算精确的亚像素位置。与其他单帧目标亚像素定位方法相比,该方法的平均定位误差降低了72%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Point-Source Target Detection and Localization in Single-Frame Infrared Imagery
Potential military threats often manifest as dim point-source targets embedded in complex clutter and noise back-grounds, which makes threat detection a significant challenge. A variety of machine-learning architectures have been developed in recent years for performing small-object segmentation in single frames of infrared imagery. Evaluation and comparison of these techniques has been hampered by a lack of reliably labeled data and the use of different evaluation metrics. In this research, we leverage the Air Force Institute of Technology Sensor and Scene Emulation Tool (ASSET) to generate a dataset containing independent frames with unique background and target characteristics. We introduce a standardized method for generating ground-truth segmentation masks for point-source targets that eliminates the risk of manual labeling errors that exist in other small-target segmentation datasets. A local peak signal-to-clutter-and-noise ratio (pSCNR) is also introduced and shown to be strongly correlated to probability of detection. Results show that with the use of the generated dataset, existing state-of-the-art small-object segmentation networks can be adapted specifically to the point-source target detection task. A probability of detection (Pd) greater than 80% is consistently achieved while maintaining low false alarm rates. In addition to the task of target detection, we address the problem of target subpixel localization in a single frame. Accurate subpixel localization is important due to the large physical area included in a single pixel. Existing work commonly overlooks this problem or takes the predicted target mask centroid as the subpixel location. In this research, we introduce a transformer-based subpixel localization technique that uses both the predicted target mask and the local pixel intensity to compute an accurate subpixel location. The proposed architecture reduces mean localization error by up to 72% compared to other single-frame methods for target subpixel localization.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信