Tingting Liu , Yujue Cai , Guiping Chen , Hongguang Wei , Junqi Bai , Yuan Liu , Xiubao Sui , Qian Chen
{"title":"基于全尺度特征融合和余弦对比学习的无监督红外图像着色对抗网络","authors":"Tingting Liu , Yujue Cai , Guiping Chen , Hongguang Wei , Junqi Bai , Yuan Liu , Xiubao Sui , Qian Chen","doi":"10.1016/j.neucom.2025.130713","DOIUrl":null,"url":null,"abstract":"<div><div>Thermal infrared images, unaffected by lighting and haze, are widely used in security surveillance, autonomous vehicles, and nighttime traffic monitoring. However, their grayscale nature lacks color and texture details, limiting applications in image recognition and object detection. Converting infrared images to daytime color enhances visual perception and broadens their utility. Despite advancements in infrared image colorization, challenges such as texture distortion, detail blurring, and poor image quality persist. To address these issues, a novel unsupervised learning framework, termed Cosine Contrastive Learning Generative Adversarial Network (CCLGAN), is proposed. Firstly, the traditional UNet architecture is improved by introducing full-scale skip connections and deep supervision. Full-scale skip connections integrate low-level details with high-level semantic features, while deep supervision aids in learning hierarchical feature maps. Additionally, a parameter-free neuron-based 3D attention mechanism is incorporated into the Mamba module to capture long-range dependencies and enable effective feature selection and fusion. Secondly, a novel contrastive loss function is designed, incorporating cosine distance metrics into the traditional contrastive loss framework. By maximizing cosine decision margins and normalizing, intra-class variance is minimized, and inter-class variance is maximized, ensuring consistency between input infrared image patches and output color image patches. Finally, extensive comparative analysis on common datasets demonstrates that the proposed method outperforms existing state-of-the-art techniques in colorization performance. This research advances infrared image processing and enhances the visual quality of converted images. The code is available at <span><span>https://github.com/LTTdouble/CCLGAN</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"649 ","pages":"Article 130713"},"PeriodicalIF":6.5000,"publicationDate":"2025-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Adversarial network for unsupervised infrared image colorization based on full-scale feature fusion and cosine contrastive learning\",\"authors\":\"Tingting Liu , Yujue Cai , Guiping Chen , Hongguang Wei , Junqi Bai , Yuan Liu , Xiubao Sui , Qian Chen\",\"doi\":\"10.1016/j.neucom.2025.130713\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Thermal infrared images, unaffected by lighting and haze, are widely used in security surveillance, autonomous vehicles, and nighttime traffic monitoring. However, their grayscale nature lacks color and texture details, limiting applications in image recognition and object detection. Converting infrared images to daytime color enhances visual perception and broadens their utility. Despite advancements in infrared image colorization, challenges such as texture distortion, detail blurring, and poor image quality persist. To address these issues, a novel unsupervised learning framework, termed Cosine Contrastive Learning Generative Adversarial Network (CCLGAN), is proposed. Firstly, the traditional UNet architecture is improved by introducing full-scale skip connections and deep supervision. Full-scale skip connections integrate low-level details with high-level semantic features, while deep supervision aids in learning hierarchical feature maps. Additionally, a parameter-free neuron-based 3D attention mechanism is incorporated into the Mamba module to capture long-range dependencies and enable effective feature selection and fusion. Secondly, a novel contrastive loss function is designed, incorporating cosine distance metrics into the traditional contrastive loss framework. By maximizing cosine decision margins and normalizing, intra-class variance is minimized, and inter-class variance is maximized, ensuring consistency between input infrared image patches and output color image patches. Finally, extensive comparative analysis on common datasets demonstrates that the proposed method outperforms existing state-of-the-art techniques in colorization performance. This research advances infrared image processing and enhances the visual quality of converted images. The code is available at <span><span>https://github.com/LTTdouble/CCLGAN</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":19268,\"journal\":{\"name\":\"Neurocomputing\",\"volume\":\"649 \",\"pages\":\"Article 130713\"},\"PeriodicalIF\":6.5000,\"publicationDate\":\"2025-06-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Neurocomputing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0925231225013852\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225013852","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Adversarial network for unsupervised infrared image colorization based on full-scale feature fusion and cosine contrastive learning
Thermal infrared images, unaffected by lighting and haze, are widely used in security surveillance, autonomous vehicles, and nighttime traffic monitoring. However, their grayscale nature lacks color and texture details, limiting applications in image recognition and object detection. Converting infrared images to daytime color enhances visual perception and broadens their utility. Despite advancements in infrared image colorization, challenges such as texture distortion, detail blurring, and poor image quality persist. To address these issues, a novel unsupervised learning framework, termed Cosine Contrastive Learning Generative Adversarial Network (CCLGAN), is proposed. Firstly, the traditional UNet architecture is improved by introducing full-scale skip connections and deep supervision. Full-scale skip connections integrate low-level details with high-level semantic features, while deep supervision aids in learning hierarchical feature maps. Additionally, a parameter-free neuron-based 3D attention mechanism is incorporated into the Mamba module to capture long-range dependencies and enable effective feature selection and fusion. Secondly, a novel contrastive loss function is designed, incorporating cosine distance metrics into the traditional contrastive loss framework. By maximizing cosine decision margins and normalizing, intra-class variance is minimized, and inter-class variance is maximized, ensuring consistency between input infrared image patches and output color image patches. Finally, extensive comparative analysis on common datasets demonstrates that the proposed method outperforms existing state-of-the-art techniques in colorization performance. This research advances infrared image processing and enhances the visual quality of converted images. The code is available at https://github.com/LTTdouble/CCLGAN.
期刊介绍:
Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.