Dongyuan Zang , Weihua Su , Yan Xu , Xu Dang , Ziyi Liu
{"title":"ISTDiffuser: Infrared small target image generation via conditional denoising diffusion with contrastive learning","authors":"Dongyuan Zang , Weihua Su , Yan Xu , Xu Dang , Ziyi Liu","doi":"10.1016/j.optlastec.2025.113557","DOIUrl":null,"url":null,"abstract":"<div><div>Infrared small target detection (IRSTD) technology is widely applied in modern military systems, such as missile guidance, night vision reconnaissance, target identification, and tracking. Due to the high costs in terms of time, labor, and finances, field experiments for IRSTD are limited and not comprehensive, making sample image collection challenging. Most publicly available datasets are synthesized and manually annotated, which is labor-intensive. Consequently, the number of images in these datasets is limited, and they fail to adequately cover the wide range of scene variations. To overcome these challenges, we propose ISTDiffuser, a novel single-frame infrared small target (SIRST) image generation method based on conditional diffusion and contrastive learning. This is the first time a diffusion model has been applied to SIRST image generation tasks, making this work pioneering in the field. In our approach, we introduce a scene-target interaction generation module that ensures the seamless integration of targets into their backgrounds, enhancing the realism and diversity of the synthetic data. Additionally, to better generate images consistent with the style of the instance images, we propose a style-type contrastive learning module. This module employs contrastive loss to guide the diffusion model, enhancing the realism and coherence of the generated images. Experimental results indicate that ISTDiffuser exhibits outstanding image generation capabilities. Furthermore, the generated images show high quality and stability in detection tasks, offering a promising solution to data scarcity in practice. The complete code will be made accessible via the repository at <span><span>https://github.com/Tianzishu/istdiffuser</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":19511,"journal":{"name":"Optics and Laser Technology","volume":"192 ","pages":"Article 113557"},"PeriodicalIF":4.6000,"publicationDate":"2025-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Optics and Laser Technology","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S003039922501148X","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OPTICS","Score":null,"Total":0}
引用次数: 0
Abstract
Infrared small target detection (IRSTD) technology is widely applied in modern military systems, such as missile guidance, night vision reconnaissance, target identification, and tracking. Due to the high costs in terms of time, labor, and finances, field experiments for IRSTD are limited and not comprehensive, making sample image collection challenging. Most publicly available datasets are synthesized and manually annotated, which is labor-intensive. Consequently, the number of images in these datasets is limited, and they fail to adequately cover the wide range of scene variations. To overcome these challenges, we propose ISTDiffuser, a novel single-frame infrared small target (SIRST) image generation method based on conditional diffusion and contrastive learning. This is the first time a diffusion model has been applied to SIRST image generation tasks, making this work pioneering in the field. In our approach, we introduce a scene-target interaction generation module that ensures the seamless integration of targets into their backgrounds, enhancing the realism and diversity of the synthetic data. Additionally, to better generate images consistent with the style of the instance images, we propose a style-type contrastive learning module. This module employs contrastive loss to guide the diffusion model, enhancing the realism and coherence of the generated images. Experimental results indicate that ISTDiffuser exhibits outstanding image generation capabilities. Furthermore, the generated images show high quality and stability in detection tasks, offering a promising solution to data scarcity in practice. The complete code will be made accessible via the repository at https://github.com/Tianzishu/istdiffuser.
期刊介绍:
Optics & Laser Technology aims to provide a vehicle for the publication of a broad range of high quality research and review papers in those fields of scientific and engineering research appertaining to the development and application of the technology of optics and lasers. Papers describing original work in these areas are submitted to rigorous refereeing prior to acceptance for publication.
The scope of Optics & Laser Technology encompasses, but is not restricted to, the following areas:
•development in all types of lasers
•developments in optoelectronic devices and photonics
•developments in new photonics and optical concepts
•developments in conventional optics, optical instruments and components
•techniques of optical metrology, including interferometry and optical fibre sensors
•LIDAR and other non-contact optical measurement techniques, including optical methods in heat and fluid flow
•applications of lasers to materials processing, optical NDT display (including holography) and optical communication
•research and development in the field of laser safety including studies of hazards resulting from the applications of lasers (laser safety, hazards of laser fume)
•developments in optical computing and optical information processing
•developments in new optical materials
•developments in new optical characterization methods and techniques
•developments in quantum optics
•developments in light assisted micro and nanofabrication methods and techniques
•developments in nanophotonics and biophotonics
•developments in imaging processing and systems