Feifan Yao, Huiying Zhang, Yifei Gong, Qinghua Zhang, Pan Xiao
{"title":"A study of enhanced visual perception of marine biology images based on diffusion-GAN","authors":"Feifan Yao, Huiying Zhang, Yifei Gong, Qinghua Zhang, Pan Xiao","doi":"10.1007/s40747-025-01832-w","DOIUrl":null,"url":null,"abstract":"<p>Aiming at the influence of factors such as the special optical characteristics of water bodies on the perceptual quality of generated images, this paper proposes the DifSG2-CCL model for reducing the special optical characteristics of water bodies and the DPL-SG2 model for introducing perceptual loss. Combining the ideas of cyclic consistency and style migration, this paper builds the Underwater Cycle Consistency Loss (U-CCL) module. The DifSG2-CCL model is based on the method of image reconstruction, which converts the underwater image into the style of the land image to reduce the influence of the water body factors. VGG16 is introduced as a perceptual loss into the DPL-SG2 to enhance the visual perception of the image by feature extraction with different layers and tonal weighting. Furthermore, in addition to the already disclosed SA dataset, a T dataset with a resolution of 256 × 256 in 9.366k sheets is provided in this paper. The experimental results show that DifSG2-CCL and DPL-SG2 can effectively enhance the perceptual quality of the images. The unique underwater image generation of DifSG2-CCL produces excellent results in qualitative analysis and reduces its FID value to 8.97. DPL-SG2 is more outstanding in the training of T dataset, and its FID value is reduced to 5.39. Therefore, the U-CCL and VGG16 can be applied as an innovative approach to enhance visual perception of underwater images. The experimental code with pre-trained models will be published shortly at https://github.com/yff0428/DPL-SG2/tree/main.</p>","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"71 1","pages":""},"PeriodicalIF":5.0000,"publicationDate":"2025-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Complex & Intelligent Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s40747-025-01832-w","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Aiming at the influence of factors such as the special optical characteristics of water bodies on the perceptual quality of generated images, this paper proposes the DifSG2-CCL model for reducing the special optical characteristics of water bodies and the DPL-SG2 model for introducing perceptual loss. Combining the ideas of cyclic consistency and style migration, this paper builds the Underwater Cycle Consistency Loss (U-CCL) module. The DifSG2-CCL model is based on the method of image reconstruction, which converts the underwater image into the style of the land image to reduce the influence of the water body factors. VGG16 is introduced as a perceptual loss into the DPL-SG2 to enhance the visual perception of the image by feature extraction with different layers and tonal weighting. Furthermore, in addition to the already disclosed SA dataset, a T dataset with a resolution of 256 × 256 in 9.366k sheets is provided in this paper. The experimental results show that DifSG2-CCL and DPL-SG2 can effectively enhance the perceptual quality of the images. The unique underwater image generation of DifSG2-CCL produces excellent results in qualitative analysis and reduces its FID value to 8.97. DPL-SG2 is more outstanding in the training of T dataset, and its FID value is reduced to 5.39. Therefore, the U-CCL and VGG16 can be applied as an innovative approach to enhance visual perception of underwater images. The experimental code with pre-trained models will be published shortly at https://github.com/yff0428/DPL-SG2/tree/main.
期刊介绍:
Complex & Intelligent Systems aims to provide a forum for presenting and discussing novel approaches, tools and techniques meant for attaining a cross-fertilization between the broad fields of complex systems, computational simulation, and intelligent analytics and visualization. The transdisciplinary research that the journal focuses on will expand the boundaries of our understanding by investigating the principles and processes that underlie many of the most profound problems facing society today.