Hybrid attention triple branch transformer net for underwater image enhancement

IF 3.3 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Pattern Recognition Letters Pub Date : 2026-03-01 Epub Date: 2026-01-13 DOI:10.1016/j.patrec.2026.01.014
Shaohui Jin , Guangpeng Li , Ziqin Xu , Yanxin Zhang , Zhengguang Qin , Hao Liu , Mingliang Xu
{"title":"Hybrid attention triple branch transformer net for underwater image enhancement","authors":"Shaohui Jin ,&nbsp;Guangpeng Li ,&nbsp;Ziqin Xu ,&nbsp;Yanxin Zhang ,&nbsp;Zhengguang Qin ,&nbsp;Hao Liu ,&nbsp;Mingliang Xu","doi":"10.1016/j.patrec.2026.01.014","DOIUrl":null,"url":null,"abstract":"<div><div>In real underwater scenes, the complexity of the environment leads to issues like light attenuation, scattering, and color distortion, resulting in reduced image quality and loss of details. To resolve these problems, we propose a hybrid attention triple branch transformer network (HATBformer). The backbone network adopts a three-layer encoder-decoder structure, making full use of the spatial and channel feature information of underwater images, and improving the network’s focus on color information and spatial regions with higher levels of attenuation. The detail enhancement branch incorporates the coordinate information perception mechanism and feature integration strategy through three consecutive feature enhancement blocks, aiming to deeply repair and optimize image details and effectively improve the image reconstruction quality. In addition, we established an underwater image dataset NLOS-TW that contains different optical thicknesses, including rich targets and various underwater scenes. Extensive experiments demonstrate that our method significantly enhances image quality and surpasses current state-of-the-art methods both qualitatively and quantitatively.</div></div>","PeriodicalId":54638,"journal":{"name":"Pattern Recognition Letters","volume":"201 ","pages":"Pages 95-102"},"PeriodicalIF":3.3000,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition Letters","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S016786552600019X","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/1/13 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

In real underwater scenes, the complexity of the environment leads to issues like light attenuation, scattering, and color distortion, resulting in reduced image quality and loss of details. To resolve these problems, we propose a hybrid attention triple branch transformer network (HATBformer). The backbone network adopts a three-layer encoder-decoder structure, making full use of the spatial and channel feature information of underwater images, and improving the network’s focus on color information and spatial regions with higher levels of attenuation. The detail enhancement branch incorporates the coordinate information perception mechanism and feature integration strategy through three consecutive feature enhancement blocks, aiming to deeply repair and optimize image details and effectively improve the image reconstruction quality. In addition, we established an underwater image dataset NLOS-TW that contains different optical thicknesses, including rich targets and various underwater scenes. Extensive experiments demonstrate that our method significantly enhances image quality and surpasses current state-of-the-art methods both qualitatively and quantitatively.
水下图像增强的混合关注三支路变压器网
在真实的水下场景中,环境的复杂性导致光线衰减、散射、色彩失真等问题,导致图像质量下降和细节丢失。为了解决这些问题,我们提出了一种混合关注三支路变压器网络(HATBformer)。骨干网采用三层编码器-解码器结构,充分利用了水下图像的空间和信道特征信息,提高了网络对颜色信息和衰减程度较高的空间区域的关注。细节增强分支通过三个连续的特征增强块,结合坐标信息感知机制和特征集成策略,对图像细节进行深度修复和优化,有效提高图像重建质量。此外,我们建立了包含不同光学厚度的水下图像数据集NLOS-TW,包括丰富的目标和各种水下场景。大量的实验表明,我们的方法显著提高了图像质量,在定性和定量上都超过了目前最先进的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Pattern Recognition Letters
Pattern Recognition Letters 工程技术-计算机:人工智能
CiteScore
12.40
自引率
5.90%
发文量
287
审稿时长
9.1 months
期刊介绍: Pattern Recognition Letters aims at rapid publication of concise articles of a broad interest in pattern recognition. Subject areas include all the current fields of interest represented by the Technical Committees of the International Association of Pattern Recognition, and other developing themes involving learning and recognition.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书