{"title":"使用混合跳跃UNet和生成对抗网络的热红外图像着色","authors":"Hangying Liao;Qian Jiang;Xin Jin;Ling Liu;Lin Liu;Shin-Jye Lee;Wei Zhou","doi":"10.1109/TIV.2022.3218833","DOIUrl":null,"url":null,"abstract":"Common cameras cannot capture high quality image in the night or extreme weather conditions that without enough light, while thermal infrared(TIR) cameras are not limited in this situation. Hence, TIR imaging technique is widely used in military, surveillance, nighttime traffic and other scenarios. However, TIR images are monochromatic, and the majority of details of such images are lost, which are difficult for human or computer system to analyze. Translating TIR images into visible images is beneficial to subsequent observation or further processing. Though there are some advances to realize the transformation from TIR images to color visible images, edge distortion and semantic confusion remain to be solved. Therefore, we propose a Mixed-Skipping UNet(MS-UNet) based image colorization model joint Generative Adversarial Network, which is denoted by MUGAN. Firstly, the dense skip connections of UNet++ and full-scale skip connections of UNet 3+ are combined to form the MS-UNet, which is regarded as the generator. In addition, we design a feature extraction module in MS-UNet to effectively capture the multi-scale features in source image. Then, a novel attention mechanism module is designed for decoding stage which can help decoder of MS-UNet to focus on the important information. Moreover, we explore the effect of different loss functions in TIR image colorization task, and the loss function with excellent performance is selected to further optimize the training process of the model. Extensive experiments prove the superiority of our method in the task of TIR image colorization.","PeriodicalId":36532,"journal":{"name":"IEEE Transactions on Intelligent Vehicles","volume":"8 4","pages":"2954-2969"},"PeriodicalIF":14.3000,"publicationDate":"2022-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"MUGAN: Thermal Infrared Image Colorization Using Mixed-Skipping UNet and Generative Adversarial Network\",\"authors\":\"Hangying Liao;Qian Jiang;Xin Jin;Ling Liu;Lin Liu;Shin-Jye Lee;Wei Zhou\",\"doi\":\"10.1109/TIV.2022.3218833\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Common cameras cannot capture high quality image in the night or extreme weather conditions that without enough light, while thermal infrared(TIR) cameras are not limited in this situation. Hence, TIR imaging technique is widely used in military, surveillance, nighttime traffic and other scenarios. However, TIR images are monochromatic, and the majority of details of such images are lost, which are difficult for human or computer system to analyze. Translating TIR images into visible images is beneficial to subsequent observation or further processing. Though there are some advances to realize the transformation from TIR images to color visible images, edge distortion and semantic confusion remain to be solved. Therefore, we propose a Mixed-Skipping UNet(MS-UNet) based image colorization model joint Generative Adversarial Network, which is denoted by MUGAN. Firstly, the dense skip connections of UNet++ and full-scale skip connections of UNet 3+ are combined to form the MS-UNet, which is regarded as the generator. In addition, we design a feature extraction module in MS-UNet to effectively capture the multi-scale features in source image. Then, a novel attention mechanism module is designed for decoding stage which can help decoder of MS-UNet to focus on the important information. Moreover, we explore the effect of different loss functions in TIR image colorization task, and the loss function with excellent performance is selected to further optimize the training process of the model. Extensive experiments prove the superiority of our method in the task of TIR image colorization.\",\"PeriodicalId\":36532,\"journal\":{\"name\":\"IEEE Transactions on Intelligent Vehicles\",\"volume\":\"8 4\",\"pages\":\"2954-2969\"},\"PeriodicalIF\":14.3000,\"publicationDate\":\"2022-11-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Intelligent Vehicles\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/9935274/\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Intelligent Vehicles","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/9935274/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
MUGAN: Thermal Infrared Image Colorization Using Mixed-Skipping UNet and Generative Adversarial Network
Common cameras cannot capture high quality image in the night or extreme weather conditions that without enough light, while thermal infrared(TIR) cameras are not limited in this situation. Hence, TIR imaging technique is widely used in military, surveillance, nighttime traffic and other scenarios. However, TIR images are monochromatic, and the majority of details of such images are lost, which are difficult for human or computer system to analyze. Translating TIR images into visible images is beneficial to subsequent observation or further processing. Though there are some advances to realize the transformation from TIR images to color visible images, edge distortion and semantic confusion remain to be solved. Therefore, we propose a Mixed-Skipping UNet(MS-UNet) based image colorization model joint Generative Adversarial Network, which is denoted by MUGAN. Firstly, the dense skip connections of UNet++ and full-scale skip connections of UNet 3+ are combined to form the MS-UNet, which is regarded as the generator. In addition, we design a feature extraction module in MS-UNet to effectively capture the multi-scale features in source image. Then, a novel attention mechanism module is designed for decoding stage which can help decoder of MS-UNet to focus on the important information. Moreover, we explore the effect of different loss functions in TIR image colorization task, and the loss function with excellent performance is selected to further optimize the training process of the model. Extensive experiments prove the superiority of our method in the task of TIR image colorization.
期刊介绍:
The IEEE Transactions on Intelligent Vehicles (T-IV) is a premier platform for publishing peer-reviewed articles that present innovative research concepts, application results, significant theoretical findings, and application case studies in the field of intelligent vehicles. With a particular emphasis on automated vehicles within roadway environments, T-IV aims to raise awareness of pressing research and application challenges.
Our focus is on providing critical information to the intelligent vehicle community, serving as a dissemination vehicle for IEEE ITS Society members and others interested in learning about the state-of-the-art developments and progress in research and applications related to intelligent vehicles. Join us in advancing knowledge and innovation in this dynamic field.