{"title":"Unformer: A Transformer-Based Approach for Adaptive Multiscale Feature Aggregation in Underwater Image Enhancement","authors":"Yuhao Qing;Yueying Wang;Huaicheng Yan;Xiangpeng Xie;Zhengguang Wu","doi":"10.1109/TAI.2024.3508667","DOIUrl":null,"url":null,"abstract":"Underwater imaging is often compromised by light scattering and absorption, resulting in image degradation and distortion. This manifests as blurred details, color shifts, and diminished illumination and contrast, thereby hindering advancements in underwater research. To mitigate these issues, we propose Unformer, an innovative underwater image enhancement (UIE) technique that leverages a transformer-based architecture for multiscale adaptive feature aggregation. Our approach employs a multiscale feature fusion strategy that adaptively restores illumination and detail features. We reevaluate the relationship between convolution and transformer to develop a novel encoder structure. This structure effectively integrates both long-range and short-range dependencies, dynamically combines local and global features, and constructs a comprehensive global context. Furthermore, we propose a unique multibranch decoder architecture that enhances and efficiently extracts spatial context information through the transformer module. Extensive experiments on three datasets demonstrate that our proposed method outperforms other techniques in both subjective and objective evaluations. Compared with the latest methods, Unformer has improved the peak signal-to-noise ratio (PSNR) by 19.5% and 14.8% respectively on the LSUI and EUVP datasets. The code is available at: <uri>https://github.com/yhflq/Unformer</uri>.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 4","pages":"1024-1037"},"PeriodicalIF":0.0000,"publicationDate":"2024-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on artificial intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10771743/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Underwater imaging is often compromised by light scattering and absorption, resulting in image degradation and distortion. This manifests as blurred details, color shifts, and diminished illumination and contrast, thereby hindering advancements in underwater research. To mitigate these issues, we propose Unformer, an innovative underwater image enhancement (UIE) technique that leverages a transformer-based architecture for multiscale adaptive feature aggregation. Our approach employs a multiscale feature fusion strategy that adaptively restores illumination and detail features. We reevaluate the relationship between convolution and transformer to develop a novel encoder structure. This structure effectively integrates both long-range and short-range dependencies, dynamically combines local and global features, and constructs a comprehensive global context. Furthermore, we propose a unique multibranch decoder architecture that enhances and efficiently extracts spatial context information through the transformer module. Extensive experiments on three datasets demonstrate that our proposed method outperforms other techniques in both subjective and objective evaluations. Compared with the latest methods, Unformer has improved the peak signal-to-noise ratio (PSNR) by 19.5% and 14.8% respectively on the LSUI and EUVP datasets. The code is available at: https://github.com/yhflq/Unformer.