{"title":"A Novel U-Shaped Hybrid Network for Single Image Dehazing","authors":"Zixin Zhang, Xin Li","doi":"10.1109/AINIT59027.2023.10212555","DOIUrl":null,"url":null,"abstract":"Image dehazing is a challenging problem due to its ill-posed parameter estimation. Despite the significant success of Convolutional Neural Network (CNNs), the inherent locality of CNNs remains a bottleneck for dehazing performance. Though Transformers mitigate the shortcomings of CNNs and have demonstrated promising performance in high-level vision task, the inherent computational complexity makes them infeasible for low-level vision task. In this work, an efficient U-shaped Convolution and Transformer hybrid network, called UCPformer, is proposed. Specifically, Channel Enhanced Transformer (CET) and Efficient Pixel Enhanced Transformer (EPET) is designed in this paper for efficient encoding and decoding of hazy image features. The CET inherits the local representation capability of CNN and general architecture of Transformer, extracting local information efficiently and treating different channels unequally. The EPET inherits the global context modeling capability of Transformer, treating different pixels unequally with linear complexity. Experiments demonstrate the proposed UCPformer achieve superior performance against other dehazing methods.","PeriodicalId":276778,"journal":{"name":"2023 4th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 4th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AINIT59027.2023.10212555","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Image dehazing is a challenging problem due to its ill-posed parameter estimation. Despite the significant success of Convolutional Neural Network (CNNs), the inherent locality of CNNs remains a bottleneck for dehazing performance. Though Transformers mitigate the shortcomings of CNNs and have demonstrated promising performance in high-level vision task, the inherent computational complexity makes them infeasible for low-level vision task. In this work, an efficient U-shaped Convolution and Transformer hybrid network, called UCPformer, is proposed. Specifically, Channel Enhanced Transformer (CET) and Efficient Pixel Enhanced Transformer (EPET) is designed in this paper for efficient encoding and decoding of hazy image features. The CET inherits the local representation capability of CNN and general architecture of Transformer, extracting local information efficiently and treating different channels unequally. The EPET inherits the global context modeling capability of Transformer, treating different pixels unequally with linear complexity. Experiments demonstrate the proposed UCPformer achieve superior performance against other dehazing methods.