{"title":"联合利用RGB和偏振模式的水下图像增强","authors":"Yushan Wang, Jiqing Zhang, Zixuan Wan, Xinbo Zhang, Yafei Wang, Xianping Fu","doi":"10.1016/j.displa.2025.103143","DOIUrl":null,"url":null,"abstract":"<div><div>Underwater images often suffer from severe quality degradation due to light absorption and scattering in the water medium. Most existing underwater image enhancement (UIE) methods rely solely on RGB inputs, which lack the capability to distinguish between scattered and reflected light, thus limiting their performance. In contrast, polarization imaging offers the potential to disentangle physical components and preserve surface structures by capturing the polarization state of light. This paper thus proposes a novel RGB-polarization multimodal fusion framework for UIE tasks. Specifically, we first present a Polarization Feature Extractor (PFE) to capture direction-dependent polarization responses via multi-dimensional interaction modeling. In addition, a cross-modal fusion module is introduced to effectively and adaptively combine meaningful cues from both RGB and polarization domains. The effectiveness is enforced by the channel attention mechanism and the spatial attention mechanism to improve feature representation; the adaptiveness is facilitated by a specifically designed weighting scheme that balances the contributions of the two domains. Extensive experiments show that the proposed approach outperforms state-of-the-art underwater image enhancement methods in terms of both full-reference and non-reference metrics. Furthermore, the contribution of each key component is validated through comprehensive ablation study.</div></div>","PeriodicalId":50570,"journal":{"name":"Displays","volume":"90 ","pages":"Article 103143"},"PeriodicalIF":3.4000,"publicationDate":"2025-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Underwater image enhancement by jointly exploiting RGB and polarization modalities\",\"authors\":\"Yushan Wang, Jiqing Zhang, Zixuan Wan, Xinbo Zhang, Yafei Wang, Xianping Fu\",\"doi\":\"10.1016/j.displa.2025.103143\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Underwater images often suffer from severe quality degradation due to light absorption and scattering in the water medium. Most existing underwater image enhancement (UIE) methods rely solely on RGB inputs, which lack the capability to distinguish between scattered and reflected light, thus limiting their performance. In contrast, polarization imaging offers the potential to disentangle physical components and preserve surface structures by capturing the polarization state of light. This paper thus proposes a novel RGB-polarization multimodal fusion framework for UIE tasks. Specifically, we first present a Polarization Feature Extractor (PFE) to capture direction-dependent polarization responses via multi-dimensional interaction modeling. In addition, a cross-modal fusion module is introduced to effectively and adaptively combine meaningful cues from both RGB and polarization domains. The effectiveness is enforced by the channel attention mechanism and the spatial attention mechanism to improve feature representation; the adaptiveness is facilitated by a specifically designed weighting scheme that balances the contributions of the two domains. Extensive experiments show that the proposed approach outperforms state-of-the-art underwater image enhancement methods in terms of both full-reference and non-reference metrics. Furthermore, the contribution of each key component is validated through comprehensive ablation study.</div></div>\",\"PeriodicalId\":50570,\"journal\":{\"name\":\"Displays\",\"volume\":\"90 \",\"pages\":\"Article 103143\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-07-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Displays\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0141938225001805\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Displays","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0141938225001805","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
Underwater image enhancement by jointly exploiting RGB and polarization modalities
Underwater images often suffer from severe quality degradation due to light absorption and scattering in the water medium. Most existing underwater image enhancement (UIE) methods rely solely on RGB inputs, which lack the capability to distinguish between scattered and reflected light, thus limiting their performance. In contrast, polarization imaging offers the potential to disentangle physical components and preserve surface structures by capturing the polarization state of light. This paper thus proposes a novel RGB-polarization multimodal fusion framework for UIE tasks. Specifically, we first present a Polarization Feature Extractor (PFE) to capture direction-dependent polarization responses via multi-dimensional interaction modeling. In addition, a cross-modal fusion module is introduced to effectively and adaptively combine meaningful cues from both RGB and polarization domains. The effectiveness is enforced by the channel attention mechanism and the spatial attention mechanism to improve feature representation; the adaptiveness is facilitated by a specifically designed weighting scheme that balances the contributions of the two domains. Extensive experiments show that the proposed approach outperforms state-of-the-art underwater image enhancement methods in terms of both full-reference and non-reference metrics. Furthermore, the contribution of each key component is validated through comprehensive ablation study.
期刊介绍:
Displays is the international journal covering the research and development of display technology, its effective presentation and perception of information, and applications and systems including display-human interface.
Technical papers on practical developments in Displays technology provide an effective channel to promote greater understanding and cross-fertilization across the diverse disciplines of the Displays community. Original research papers solving ergonomics issues at the display-human interface advance effective presentation of information. Tutorial papers covering fundamentals intended for display technologies and human factor engineers new to the field will also occasionally featured.