LKAT-GAN: A GAN for Thermal Infrared Image Colorization Based on Large Kernel and AttentionUNet-Transformer

IF 10.9 2区 计算机科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC
Youwei He;Xin Jin;Qian Jiang;Zien Cheng;Puming Wang;Wei Zhou
{"title":"LKAT-GAN: A GAN for Thermal Infrared Image Colorization Based on Large Kernel and AttentionUNet-Transformer","authors":"Youwei He;Xin Jin;Qian Jiang;Zien Cheng;Puming Wang;Wei Zhou","doi":"10.1109/TCE.2023.3280165","DOIUrl":null,"url":null,"abstract":"Because thermal infrared (TIR) images are not affected by light and foggy environments, which are widely used in various night traffic scenarios. Especially, thermal infrared images also play an important role in autonomous vehicles. However, low contrast and lack of chromaticity have always been their problems. Image colorization is a vital technique to improve the quality of TIR images, which is beneficial to human interpretation and downstream tasks. Despite thermal infrared image colorization methods have been rapidly improved, the detail blurriness and color distortion in colorized images remain under-addressed. Mostly because these methods cannot effectively extract the ambiguous feature information of TIR images. Hence, we propose a large kernel (LK) U-Net and Attention_U-Net-Transformer (ViT-Based) based generative adversarial network. An LK_U-Net is designed to extract the feature of TIR images. Then, a branch structure composed of Attention_U-Net and ViT-Based can provide the network with semantic information from different perspectives to decode features. In addition, a composite loss function is employed to ensure the network generates a high-quality colorized image. The proposed method is evaluated on KAIST and IRVI datasets. Experimental results demonstrate the superiority of the proposed LKAT-GAN over other methods for the task of thermal infrared image colorization. The code is available at \n<uri>https://github.com/jinxinhuo/LKAT-GAN</uri>\n.","PeriodicalId":13208,"journal":{"name":"IEEE Transactions on Consumer Electronics","volume":"69 3","pages":"478-489"},"PeriodicalIF":10.9000,"publicationDate":"2023-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Consumer Electronics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10153639/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Because thermal infrared (TIR) images are not affected by light and foggy environments, which are widely used in various night traffic scenarios. Especially, thermal infrared images also play an important role in autonomous vehicles. However, low contrast and lack of chromaticity have always been their problems. Image colorization is a vital technique to improve the quality of TIR images, which is beneficial to human interpretation and downstream tasks. Despite thermal infrared image colorization methods have been rapidly improved, the detail blurriness and color distortion in colorized images remain under-addressed. Mostly because these methods cannot effectively extract the ambiguous feature information of TIR images. Hence, we propose a large kernel (LK) U-Net and Attention_U-Net-Transformer (ViT-Based) based generative adversarial network. An LK_U-Net is designed to extract the feature of TIR images. Then, a branch structure composed of Attention_U-Net and ViT-Based can provide the network with semantic information from different perspectives to decode features. In addition, a composite loss function is employed to ensure the network generates a high-quality colorized image. The proposed method is evaluated on KAIST and IRVI datasets. Experimental results demonstrate the superiority of the proposed LKAT-GAN over other methods for the task of thermal infrared image colorization. The code is available at https://github.com/jinxinhuo/LKAT-GAN .
LKAT-GAN:一种基于大核和注意力变换的热红外图像着色GAN
由于热红外(TIR)图像不受光和雾环境的影响,因此广泛应用于各种夜间交通场景。特别是热红外图像在自动驾驶汽车中也发挥着重要作用。然而,低对比度和缺乏色度一直是他们的问题。图像彩色化是提高TIR图像质量的重要技术,有利于人类的判读和下游任务。尽管热红外图像的彩色化方法得到了迅速的改进,但彩色图像中的细节模糊和颜色失真问题仍然没有得到解决。主要是因为这些方法不能有效地提取TIR图像的模糊特征信息。因此,我们提出了一种基于大内核(LK)U-Net和Attention_U-Net-Transformer(基于ViT)的生成对抗性网络。LK_U-Net被设计用于提取TIR图像的特征。然后,由Attention_U-Net和ViT-Based组成的分支结构可以从不同的角度为网络提供语义信息来解码特征。此外,还采用了复合损失函数来确保网络生成高质量的彩色图像。在KAIST和IRVI数据集上对所提出的方法进行了评估。实验结果证明了所提出的LKAT-GAN在热红外图像彩色化方面优于其他方法。代码可在https://github.com/jinxinhuo/LKAT-GAN.
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.70
自引率
9.30%
发文量
59
审稿时长
3.3 months
期刊介绍: The main focus for the IEEE Transactions on Consumer Electronics is the engineering and research aspects of the theory, design, construction, manufacture or end use of mass market electronics, systems, software and services for consumers.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信