Mingze Yao , Huibing Wang , Yudong Li , Wenzhe Liu , Xianping Fu
{"title":"基于细节聚焦和偏振引导的水下图像多模态融合","authors":"Mingze Yao , Huibing Wang , Yudong Li , Wenzhe Liu , Xianping Fu","doi":"10.1016/j.engappai.2025.111677","DOIUrl":null,"url":null,"abstract":"<div><div>Underwater optics imaging typically suffer from the impurities scattering and light absorption in dynamic and complex underwater environment, which significantly effect the clarity and visibility of images. Existing cutting-edge Underwater Image Enhancement (UIE) methods mostly focus on color correction and contrast enhancement neglect the textual and detail information of objects, leading to imbalance exposure and edge features missing. To overcome these problems, we propose a novel detail-focused and polarization guided multi-modality fusion network (DFPG-Net), for enhancing underwater images. Unlike the previous methods, we first construct a Detail-Focused Convolution (DFC) block for extracting features from underwater multimodal images, which integrates difference convolutions to capture prior and edge information. Meanwhile, polarization information is introduced with a Multi-scale Polarization Guided (MPG) fusion module, which intends to maintain and enhance the texture and details information from the degree of polarization information and angle of polarization information obtained from the polarized image. Additionally, a parallel progressive attention network is designed to explore and combine the valuable and discriminative information in feature learning stage. Extensive experiments on the constructed underwater dataset validate the effectiveness and superior performance of the proposed DFPG-Net, which against state-of-the-art methods in both machine evaluation metrics and visual perception.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"159 ","pages":"Article 111677"},"PeriodicalIF":8.0000,"publicationDate":"2025-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Detail-focused and polarization-guided multi-modality fusion for underwater image clarity enhancing\",\"authors\":\"Mingze Yao , Huibing Wang , Yudong Li , Wenzhe Liu , Xianping Fu\",\"doi\":\"10.1016/j.engappai.2025.111677\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Underwater optics imaging typically suffer from the impurities scattering and light absorption in dynamic and complex underwater environment, which significantly effect the clarity and visibility of images. Existing cutting-edge Underwater Image Enhancement (UIE) methods mostly focus on color correction and contrast enhancement neglect the textual and detail information of objects, leading to imbalance exposure and edge features missing. To overcome these problems, we propose a novel detail-focused and polarization guided multi-modality fusion network (DFPG-Net), for enhancing underwater images. Unlike the previous methods, we first construct a Detail-Focused Convolution (DFC) block for extracting features from underwater multimodal images, which integrates difference convolutions to capture prior and edge information. Meanwhile, polarization information is introduced with a Multi-scale Polarization Guided (MPG) fusion module, which intends to maintain and enhance the texture and details information from the degree of polarization information and angle of polarization information obtained from the polarized image. Additionally, a parallel progressive attention network is designed to explore and combine the valuable and discriminative information in feature learning stage. Extensive experiments on the constructed underwater dataset validate the effectiveness and superior performance of the proposed DFPG-Net, which against state-of-the-art methods in both machine evaluation metrics and visual perception.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"159 \",\"pages\":\"Article 111677\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-07-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625016793\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625016793","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Detail-focused and polarization-guided multi-modality fusion for underwater image clarity enhancing
Underwater optics imaging typically suffer from the impurities scattering and light absorption in dynamic and complex underwater environment, which significantly effect the clarity and visibility of images. Existing cutting-edge Underwater Image Enhancement (UIE) methods mostly focus on color correction and contrast enhancement neglect the textual and detail information of objects, leading to imbalance exposure and edge features missing. To overcome these problems, we propose a novel detail-focused and polarization guided multi-modality fusion network (DFPG-Net), for enhancing underwater images. Unlike the previous methods, we first construct a Detail-Focused Convolution (DFC) block for extracting features from underwater multimodal images, which integrates difference convolutions to capture prior and edge information. Meanwhile, polarization information is introduced with a Multi-scale Polarization Guided (MPG) fusion module, which intends to maintain and enhance the texture and details information from the degree of polarization information and angle of polarization information obtained from the polarized image. Additionally, a parallel progressive attention network is designed to explore and combine the valuable and discriminative information in feature learning stage. Extensive experiments on the constructed underwater dataset validate the effectiveness and superior performance of the proposed DFPG-Net, which against state-of-the-art methods in both machine evaluation metrics and visual perception.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.