{"title":"LKAT-GAN: A GAN for Thermal Infrared Image Colorization Based on Large Kernel and AttentionUNet-Transformer","authors":"Youwei He;Xin Jin;Qian Jiang;Zien Cheng;Puming Wang;Wei Zhou","doi":"10.1109/TCE.2023.3280165","DOIUrl":null,"url":null,"abstract":"Because thermal infrared (TIR) images are not affected by light and foggy environments, which are widely used in various night traffic scenarios. Especially, thermal infrared images also play an important role in autonomous vehicles. However, low contrast and lack of chromaticity have always been their problems. Image colorization is a vital technique to improve the quality of TIR images, which is beneficial to human interpretation and downstream tasks. Despite thermal infrared image colorization methods have been rapidly improved, the detail blurriness and color distortion in colorized images remain under-addressed. Mostly because these methods cannot effectively extract the ambiguous feature information of TIR images. Hence, we propose a large kernel (LK) U-Net and Attention_U-Net-Transformer (ViT-Based) based generative adversarial network. An LK_U-Net is designed to extract the feature of TIR images. Then, a branch structure composed of Attention_U-Net and ViT-Based can provide the network with semantic information from different perspectives to decode features. In addition, a composite loss function is employed to ensure the network generates a high-quality colorized image. The proposed method is evaluated on KAIST and IRVI datasets. Experimental results demonstrate the superiority of the proposed LKAT-GAN over other methods for the task of thermal infrared image colorization. The code is available at \n<uri>https://github.com/jinxinhuo/LKAT-GAN</uri>\n.","PeriodicalId":13208,"journal":{"name":"IEEE Transactions on Consumer Electronics","volume":"69 3","pages":"478-489"},"PeriodicalIF":10.9000,"publicationDate":"2023-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Consumer Electronics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10153639/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Because thermal infrared (TIR) images are not affected by light and foggy environments, which are widely used in various night traffic scenarios. Especially, thermal infrared images also play an important role in autonomous vehicles. However, low contrast and lack of chromaticity have always been their problems. Image colorization is a vital technique to improve the quality of TIR images, which is beneficial to human interpretation and downstream tasks. Despite thermal infrared image colorization methods have been rapidly improved, the detail blurriness and color distortion in colorized images remain under-addressed. Mostly because these methods cannot effectively extract the ambiguous feature information of TIR images. Hence, we propose a large kernel (LK) U-Net and Attention_U-Net-Transformer (ViT-Based) based generative adversarial network. An LK_U-Net is designed to extract the feature of TIR images. Then, a branch structure composed of Attention_U-Net and ViT-Based can provide the network with semantic information from different perspectives to decode features. In addition, a composite loss function is employed to ensure the network generates a high-quality colorized image. The proposed method is evaluated on KAIST and IRVI datasets. Experimental results demonstrate the superiority of the proposed LKAT-GAN over other methods for the task of thermal infrared image colorization. The code is available at
https://github.com/jinxinhuo/LKAT-GAN
.
期刊介绍:
The main focus for the IEEE Transactions on Consumer Electronics is the engineering and research aspects of the theory, design, construction, manufacture or end use of mass market electronics, systems, software and services for consumers.