Shaohui Jin , Guangpeng Li , Ziqin Xu , Yanxin Zhang , Zhengguang Qin , Hao Liu , Mingliang Xu
{"title":"Hybrid attention triple branch transformer net for underwater image enhancement","authors":"Shaohui Jin , Guangpeng Li , Ziqin Xu , Yanxin Zhang , Zhengguang Qin , Hao Liu , Mingliang Xu","doi":"10.1016/j.patrec.2026.01.014","DOIUrl":null,"url":null,"abstract":"<div><div>In real underwater scenes, the complexity of the environment leads to issues like light attenuation, scattering, and color distortion, resulting in reduced image quality and loss of details. To resolve these problems, we propose a hybrid attention triple branch transformer network (HATBformer). The backbone network adopts a three-layer encoder-decoder structure, making full use of the spatial and channel feature information of underwater images, and improving the network’s focus on color information and spatial regions with higher levels of attenuation. The detail enhancement branch incorporates the coordinate information perception mechanism and feature integration strategy through three consecutive feature enhancement blocks, aiming to deeply repair and optimize image details and effectively improve the image reconstruction quality. In addition, we established an underwater image dataset NLOS-TW that contains different optical thicknesses, including rich targets and various underwater scenes. Extensive experiments demonstrate that our method significantly enhances image quality and surpasses current state-of-the-art methods both qualitatively and quantitatively.</div></div>","PeriodicalId":54638,"journal":{"name":"Pattern Recognition Letters","volume":"201 ","pages":"Pages 95-102"},"PeriodicalIF":3.3000,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition Letters","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S016786552600019X","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/1/13 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
In real underwater scenes, the complexity of the environment leads to issues like light attenuation, scattering, and color distortion, resulting in reduced image quality and loss of details. To resolve these problems, we propose a hybrid attention triple branch transformer network (HATBformer). The backbone network adopts a three-layer encoder-decoder structure, making full use of the spatial and channel feature information of underwater images, and improving the network’s focus on color information and spatial regions with higher levels of attenuation. The detail enhancement branch incorporates the coordinate information perception mechanism and feature integration strategy through three consecutive feature enhancement blocks, aiming to deeply repair and optimize image details and effectively improve the image reconstruction quality. In addition, we established an underwater image dataset NLOS-TW that contains different optical thicknesses, including rich targets and various underwater scenes. Extensive experiments demonstrate that our method significantly enhances image quality and surpasses current state-of-the-art methods both qualitatively and quantitatively.
期刊介绍:
Pattern Recognition Letters aims at rapid publication of concise articles of a broad interest in pattern recognition.
Subject areas include all the current fields of interest represented by the Technical Committees of the International Association of Pattern Recognition, and other developing themes involving learning and recognition.