基于细节聚焦和偏振引导的水下图像多模态融合

IF 8 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Engineering Applications of Artificial Intelligence Pub Date : 2025-07-19 DOI:10.1016/j.engappai.2025.111677

Mingze Yao , Huibing Wang , Yudong Li , Wenzhe Liu , Xianping Fu

{"title":"基于细节聚焦和偏振引导的水下图像多模态融合","authors":"Mingze Yao , Huibing Wang , Yudong Li , Wenzhe Liu , Xianping Fu","doi":"10.1016/j.engappai.2025.111677","DOIUrl":null,"url":null,"abstract":"<div><div>Underwater optics imaging typically suffer from the impurities scattering and light absorption in dynamic and complex underwater environment, which significantly effect the clarity and visibility of images. Existing cutting-edge Underwater Image Enhancement (UIE) methods mostly focus on color correction and contrast enhancement neglect the textual and detail information of objects, leading to imbalance exposure and edge features missing. To overcome these problems, we propose a novel detail-focused and polarization guided multi-modality fusion network (DFPG-Net), for enhancing underwater images. Unlike the previous methods, we first construct a Detail-Focused Convolution (DFC) block for extracting features from underwater multimodal images, which integrates difference convolutions to capture prior and edge information. Meanwhile, polarization information is introduced with a Multi-scale Polarization Guided (MPG) fusion module, which intends to maintain and enhance the texture and details information from the degree of polarization information and angle of polarization information obtained from the polarized image. Additionally, a parallel progressive attention network is designed to explore and combine the valuable and discriminative information in feature learning stage. Extensive experiments on the constructed underwater dataset validate the effectiveness and superior performance of the proposed DFPG-Net, which against state-of-the-art methods in both machine evaluation metrics and visual perception.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"159 ","pages":"Article 111677"},"PeriodicalIF":8.0000,"publicationDate":"2025-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Detail-focused and polarization-guided multi-modality fusion for underwater image clarity enhancing\",\"authors\":\"Mingze Yao , Huibing Wang , Yudong Li , Wenzhe Liu , Xianping Fu\",\"doi\":\"10.1016/j.engappai.2025.111677\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Underwater optics imaging typically suffer from the impurities scattering and light absorption in dynamic and complex underwater environment, which significantly effect the clarity and visibility of images. Existing cutting-edge Underwater Image Enhancement (UIE) methods mostly focus on color correction and contrast enhancement neglect the textual and detail information of objects, leading to imbalance exposure and edge features missing. To overcome these problems, we propose a novel detail-focused and polarization guided multi-modality fusion network (DFPG-Net), for enhancing underwater images. Unlike the previous methods, we first construct a Detail-Focused Convolution (DFC) block for extracting features from underwater multimodal images, which integrates difference convolutions to capture prior and edge information. Meanwhile, polarization information is introduced with a Multi-scale Polarization Guided (MPG) fusion module, which intends to maintain and enhance the texture and details information from the degree of polarization information and angle of polarization information obtained from the polarized image. Additionally, a parallel progressive attention network is designed to explore and combine the valuable and discriminative information in feature learning stage. Extensive experiments on the constructed underwater dataset validate the effectiveness and superior performance of the proposed DFPG-Net, which against state-of-the-art methods in both machine evaluation metrics and visual perception.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"159 \",\"pages\":\"Article 111677\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-07-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625016793\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625016793","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

水下光学成像在动态复杂的水下环境中容易受到杂质散射和光吸收的影响，严重影响图像的清晰度和可见度。现有的前沿水下图像增强（UIE）方法多侧重于色彩校正和对比度增强，忽略了物体的文本和细节信息，导致曝光不平衡和边缘特征缺失。为了克服这些问题，我们提出了一种新的细节聚焦和偏振引导的多模态融合网络（DFPG-Net），用于增强水下图像。与之前的方法不同，我们首先构建了一个细节聚焦卷积（DFC）块来提取水下多模态图像的特征，该块集成了差分卷积来捕获先验信息和边缘信息。同时，引入偏振信息，采用多尺度偏振制导（MPG）融合模块，从极化图像中获得的偏振度信息和偏振角度信息中保持和增强纹理和细节信息。此外，设计了并行渐进式注意网络，探索和结合特征学习阶段的有价值信息和判别信息。在构建的水下数据集上进行的大量实验验证了所提出的dfpga - net的有效性和优越性能，该方法在机器评估指标和视觉感知方面都优于最先进的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Detail-focused and polarization-guided multi-modality fusion for underwater image clarity enhancing

Underwater optics imaging typically suffer from the impurities scattering and light absorption in dynamic and complex underwater environment, which significantly effect the clarity and visibility of images. Existing cutting-edge Underwater Image Enhancement (UIE) methods mostly focus on color correction and contrast enhancement neglect the textual and detail information of objects, leading to imbalance exposure and edge features missing. To overcome these problems, we propose a novel detail-focused and polarization guided multi-modality fusion network (DFPG-Net), for enhancing underwater images. Unlike the previous methods, we first construct a Detail-Focused Convolution (DFC) block for extracting features from underwater multimodal images, which integrates difference convolutions to capture prior and edge information. Meanwhile, polarization information is introduced with a Multi-scale Polarization Guided (MPG) fusion module, which intends to maintain and enhance the texture and details information from the degree of polarization information and angle of polarization information obtained from the polarized image. Additionally, a parallel progressive attention network is designed to explore and combine the valuable and discriminative information in feature learning stage. Extensive experiments on the constructed underwater dataset validate the effectiveness and superior performance of the proposed DFPG-Net, which against state-of-the-art methods in both machine evaluation metrics and visual perception.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Engineering Applications of Artificial Intelligence 工程技术-工程：电子与电气

CiteScore

9.60

自引率

10.00%

发文量

505

审稿时长

68 days

期刊介绍： Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.