A study of enhanced visual perception of marine biology images based on diffusion-GAN

IF 5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Feifan Yao, Huiying Zhang, Yifei Gong, Qinghua Zhang, Pan Xiao
{"title":"A study of enhanced visual perception of marine biology images based on diffusion-GAN","authors":"Feifan Yao, Huiying Zhang, Yifei Gong, Qinghua Zhang, Pan Xiao","doi":"10.1007/s40747-025-01832-w","DOIUrl":null,"url":null,"abstract":"<p>Aiming at the influence of factors such as the special optical characteristics of water bodies on the perceptual quality of generated images, this paper proposes the DifSG2-CCL model for reducing the special optical characteristics of water bodies and the DPL-SG2 model for introducing perceptual loss. Combining the ideas of cyclic consistency and style migration, this paper builds the Underwater Cycle Consistency Loss (U-CCL) module. The DifSG2-CCL model is based on the method of image reconstruction, which converts the underwater image into the style of the land image to reduce the influence of the water body factors. VGG16 is introduced as a perceptual loss into the DPL-SG2 to enhance the visual perception of the image by feature extraction with different layers and tonal weighting. Furthermore, in addition to the already disclosed SA dataset, a T dataset with a resolution of 256 × 256 in 9.366k sheets is provided in this paper. The experimental results show that DifSG2-CCL and DPL-SG2 can effectively enhance the perceptual quality of the images. The unique underwater image generation of DifSG2-CCL produces excellent results in qualitative analysis and reduces its FID value to 8.97. DPL-SG2 is more outstanding in the training of T dataset, and its FID value is reduced to 5.39. Therefore, the U-CCL and VGG16 can be applied as an innovative approach to enhance visual perception of underwater images. The experimental code with pre-trained models will be published shortly at https://github.com/yff0428/DPL-SG2/tree/main.</p>","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"71 1","pages":""},"PeriodicalIF":5.0000,"publicationDate":"2025-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Complex & Intelligent Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s40747-025-01832-w","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Aiming at the influence of factors such as the special optical characteristics of water bodies on the perceptual quality of generated images, this paper proposes the DifSG2-CCL model for reducing the special optical characteristics of water bodies and the DPL-SG2 model for introducing perceptual loss. Combining the ideas of cyclic consistency and style migration, this paper builds the Underwater Cycle Consistency Loss (U-CCL) module. The DifSG2-CCL model is based on the method of image reconstruction, which converts the underwater image into the style of the land image to reduce the influence of the water body factors. VGG16 is introduced as a perceptual loss into the DPL-SG2 to enhance the visual perception of the image by feature extraction with different layers and tonal weighting. Furthermore, in addition to the already disclosed SA dataset, a T dataset with a resolution of 256 × 256 in 9.366k sheets is provided in this paper. The experimental results show that DifSG2-CCL and DPL-SG2 can effectively enhance the perceptual quality of the images. The unique underwater image generation of DifSG2-CCL produces excellent results in qualitative analysis and reduces its FID value to 8.97. DPL-SG2 is more outstanding in the training of T dataset, and its FID value is reduced to 5.39. Therefore, the U-CCL and VGG16 can be applied as an innovative approach to enhance visual perception of underwater images. The experimental code with pre-trained models will be published shortly at https://github.com/yff0428/DPL-SG2/tree/main.

针对水体特殊光学特性等因素对生成图像感知质量的影响,本文提出了减少水体特殊光学特性的 DifSG2-CCL 模型和引入感知损失的 DPL-SG2 模型。结合循环一致性和风格迁移的思想,本文建立了水下循环一致性损失(U-CCL)模块。DifSG2-CCL 模型基于图像重构的方法,将水下图像转换成陆地图像的样式,以减少水体因素的影响。在 DPL-SG2 中引入了 VGG16 作为感知损失,通过不同层次的特征提取和色调加权来增强图像的视觉感知。此外,除已公开的 SA 数据集外,本文还提供了分辨率为 256 × 256、9.366k 张的 T 数据集。实验结果表明,DifSG2-CCL 和 DPL-SG2 能有效提高图像的感知质量。DifSG2-CCL 独有的水下图像生成功能在定性分析中产生了极佳的效果,并将其 FID 值降至 8.97。DPL-SG2 在 T 数据集的训练中表现更为突出,其 FID 值降至 5.39。因此,U-CCL 和 VGG16 可以作为一种创新方法用于增强水下图像的视觉感知。预训练模型的实验代码将于近期发布在 https://github.com/yff0428/DPL-SG2/tree/main 网站上。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Complex & Intelligent Systems
Complex & Intelligent Systems COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-
CiteScore
9.60
自引率
10.30%
发文量
297
期刊介绍: Complex & Intelligent Systems aims to provide a forum for presenting and discussing novel approaches, tools and techniques meant for attaining a cross-fertilization between the broad fields of complex systems, computational simulation, and intelligent analytics and visualization. The transdisciplinary research that the journal focuses on will expand the boundaries of our understanding by investigating the principles and processes that underlie many of the most profound problems facing society today.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信