ISTDiffuser: Infrared small target image generation via conditional denoising diffusion with contrastive learning

IF 4.6 2区 物理与天体物理 Q1 OPTICS
Dongyuan Zang , Weihua Su , Yan Xu , Xu Dang , Ziyi Liu
{"title":"ISTDiffuser: Infrared small target image generation via conditional denoising diffusion with contrastive learning","authors":"Dongyuan Zang ,&nbsp;Weihua Su ,&nbsp;Yan Xu ,&nbsp;Xu Dang ,&nbsp;Ziyi Liu","doi":"10.1016/j.optlastec.2025.113557","DOIUrl":null,"url":null,"abstract":"<div><div>Infrared small target detection (IRSTD) technology is widely applied in modern military systems, such as missile guidance, night vision reconnaissance, target identification, and tracking. Due to the high costs in terms of time, labor, and finances, field experiments for IRSTD are limited and not comprehensive, making sample image collection challenging. Most publicly available datasets are synthesized and manually annotated, which is labor-intensive. Consequently, the number of images in these datasets is limited, and they fail to adequately cover the wide range of scene variations. To overcome these challenges, we propose ISTDiffuser, a novel single-frame infrared small target (SIRST) image generation method based on conditional diffusion and contrastive learning. This is the first time a diffusion model has been applied to SIRST image generation tasks, making this work pioneering in the field. In our approach, we introduce a scene-target interaction generation module that ensures the seamless integration of targets into their backgrounds, enhancing the realism and diversity of the synthetic data. Additionally, to better generate images consistent with the style of the instance images, we propose a style-type contrastive learning module. This module employs contrastive loss to guide the diffusion model, enhancing the realism and coherence of the generated images. Experimental results indicate that ISTDiffuser exhibits outstanding image generation capabilities. Furthermore, the generated images show high quality and stability in detection tasks, offering a promising solution to data scarcity in practice. The complete code will be made accessible via the repository at <span><span>https://github.com/Tianzishu/istdiffuser</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":19511,"journal":{"name":"Optics and Laser Technology","volume":"192 ","pages":"Article 113557"},"PeriodicalIF":4.6000,"publicationDate":"2025-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Optics and Laser Technology","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S003039922501148X","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OPTICS","Score":null,"Total":0}
引用次数: 0

Abstract

Infrared small target detection (IRSTD) technology is widely applied in modern military systems, such as missile guidance, night vision reconnaissance, target identification, and tracking. Due to the high costs in terms of time, labor, and finances, field experiments for IRSTD are limited and not comprehensive, making sample image collection challenging. Most publicly available datasets are synthesized and manually annotated, which is labor-intensive. Consequently, the number of images in these datasets is limited, and they fail to adequately cover the wide range of scene variations. To overcome these challenges, we propose ISTDiffuser, a novel single-frame infrared small target (SIRST) image generation method based on conditional diffusion and contrastive learning. This is the first time a diffusion model has been applied to SIRST image generation tasks, making this work pioneering in the field. In our approach, we introduce a scene-target interaction generation module that ensures the seamless integration of targets into their backgrounds, enhancing the realism and diversity of the synthetic data. Additionally, to better generate images consistent with the style of the instance images, we propose a style-type contrastive learning module. This module employs contrastive loss to guide the diffusion model, enhancing the realism and coherence of the generated images. Experimental results indicate that ISTDiffuser exhibits outstanding image generation capabilities. Furthermore, the generated images show high quality and stability in detection tasks, offering a promising solution to data scarcity in practice. The complete code will be made accessible via the repository at https://github.com/Tianzishu/istdiffuser.
ISTDiffuser:通过条件去噪扩散和对比学习生成红外小目标图像
红外小目标探测技术在导弹制导、夜视侦察、目标识别和跟踪等现代军事系统中有着广泛的应用。由于时间、人力和财力方面的高成本,IRSTD的现场实验有限且不全面,使得样本图像采集具有挑战性。大多数公开可用的数据集都是人工合成和注释的,这是一项劳动密集型工作。因此,这些数据集中的图像数量是有限的,并且它们无法充分覆盖大范围的场景变化。为了克服这些挑战,我们提出了一种基于条件扩散和对比学习的单帧红外小目标(SIRST)图像生成方法ISTDiffuser。这是第一次将扩散模型应用于SIRST图像生成任务,使这项工作在该领域具有开创性。在我们的方法中,我们引入了一个场景-目标交互生成模块,确保目标与背景的无缝集成,增强合成数据的真实感和多样性。此外,为了更好地生成与实例图像风格一致的图像,我们提出了风格-类型对比学习模块。该模块采用对比损失来指导扩散模型,增强生成图像的真实感和连贯性。实验结果表明,ISTDiffuser具有出色的图像生成能力。此外,生成的图像在检测任务中表现出高质量和稳定性,为解决实际应用中的数据稀缺性问题提供了一种有希望的解决方案。完整的代码可以通过https://github.com/Tianzishu/istdiffuser的存储库访问。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
8.50
自引率
10.00%
发文量
1060
审稿时长
3.4 months
期刊介绍: Optics & Laser Technology aims to provide a vehicle for the publication of a broad range of high quality research and review papers in those fields of scientific and engineering research appertaining to the development and application of the technology of optics and lasers. Papers describing original work in these areas are submitted to rigorous refereeing prior to acceptance for publication. The scope of Optics & Laser Technology encompasses, but is not restricted to, the following areas: •development in all types of lasers •developments in optoelectronic devices and photonics •developments in new photonics and optical concepts •developments in conventional optics, optical instruments and components •techniques of optical metrology, including interferometry and optical fibre sensors •LIDAR and other non-contact optical measurement techniques, including optical methods in heat and fluid flow •applications of lasers to materials processing, optical NDT display (including holography) and optical communication •research and development in the field of laser safety including studies of hazards resulting from the applications of lasers (laser safety, hazards of laser fume) •developments in optical computing and optical information processing •developments in new optical materials •developments in new optical characterization methods and techniques •developments in quantum optics •developments in light assisted micro and nanofabrication methods and techniques •developments in nanophotonics and biophotonics •developments in imaging processing and systems
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信