{"title":"UWMamba: UnderWater Image Enhancement With State Space Model","authors":"Guanhua An;Ao He;Yudong Wang;Jichang Guo","doi":"10.1109/LSP.2024.3470752","DOIUrl":null,"url":null,"abstract":"Recently, state space models (SSM) with efficient design, i.e., Mamba, have shown great potential in modeling long-range dependencies with linear complexity. However, the pure SSM-based model yields sub-optimal underwater enhancement performance due to insufficient local details. Given the superiority of convolution in local perception, we propose a hybrid network, named UWMamba, which combines SSM and convolution for underwater image enhancement. We introduce a conv mamba layer (CML) as the foundation layer to combine the visual state space block (VSSB) with convolution. The convolution is used to capture local detailed features, while the VSSB is employed to capture long-range global features, which complement each other. Furthermore, considering underwater images suffer from severe and uneven degradation of spatial regions and color channels, we propose a Mamba Attention Fusion Module (MAFM), which fuses VSSB with an attention mechanism for better perception of channels and spatial regions. Extensive experiments on real-world underwater image datasets demonstrate the promising performance of our method in both objective metrics and subjective comparisons.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"31 ","pages":"2725-2729"},"PeriodicalIF":3.9000,"publicationDate":"2024-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Signal Processing Letters","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10700679/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Recently, state space models (SSM) with efficient design, i.e., Mamba, have shown great potential in modeling long-range dependencies with linear complexity. However, the pure SSM-based model yields sub-optimal underwater enhancement performance due to insufficient local details. Given the superiority of convolution in local perception, we propose a hybrid network, named UWMamba, which combines SSM and convolution for underwater image enhancement. We introduce a conv mamba layer (CML) as the foundation layer to combine the visual state space block (VSSB) with convolution. The convolution is used to capture local detailed features, while the VSSB is employed to capture long-range global features, which complement each other. Furthermore, considering underwater images suffer from severe and uneven degradation of spatial regions and color channels, we propose a Mamba Attention Fusion Module (MAFM), which fuses VSSB with an attention mechanism for better perception of channels and spatial regions. Extensive experiments on real-world underwater image datasets demonstrate the promising performance of our method in both objective metrics and subjective comparisons.
期刊介绍:
The IEEE Signal Processing Letters is a monthly, archival publication designed to provide rapid dissemination of original, cutting-edge ideas and timely, significant contributions in signal, image, speech, language and audio processing. Papers published in the Letters can be presented within one year of their appearance in signal processing conferences such as ICASSP, GlobalSIP and ICIP, and also in several workshop organized by the Signal Processing Society.