Image enhancement with art design: a visual feature approach with a CNN-transformer fusion model.

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

PeerJ Computer Science Pub Date : 2024-11-13 eCollection Date: 2024-01-01 DOI:10.7717/peerj-cs.2417

Ming Xu, Jinwei Cui, Xiaoyu Ma, Zhiyi Zou, Zhisheng Xin, Muhammad Bilal

{"title":"Image enhancement with art design: a visual feature approach with a CNN-transformer fusion model.","authors":"Ming Xu, Jinwei Cui, Xiaoyu Ma, Zhiyi Zou, Zhisheng Xin, Muhammad Bilal","doi":"10.7717/peerj-cs.2417","DOIUrl":null,"url":null,"abstract":"<p><p>Graphic design, as a product of the burgeoning new media era, has seen its users' requirements for images continuously evolve. However, external factors such as light and noise often cause graphic design images to become distorted during acquisition. To enhance the definition of these images, this paper introduces a novel image enhancement model based on visual features. Initially, a histogram equalization (HE) algorithm is applied to enhance the graphic design images. Subsequently, image feature extraction is performed using a dual-flow network comprising convolutional neural network (CNN) and Transformer architectures. The CNN employs a residual dense block (RDB) to embed spatial local structure information with varying receptive fields. An improved attention mechanism module, attention feature fusion (AFF), is then introduced to integrate the image features extracted from the dual-flow network. Finally, through image perception quality guided adversarial learning, the model adjusts the initial enhanced image's color and recovers more details. Experimental results demonstrate that the proposed algorithm model achieves enhancement effects exceeding 90% on two large image datasets, which represents a 5%-10% improvement over other models. Furthermore, the algorithm exhibits superior performance in terms of peak signal-to-noise ratio (PSNR) and structural similarity index measure (SSIM) image quality evaluation metrics. Our findings indicate that the fusion model significantly enhances image quality, thereby advancing the field of graphic design and showcasing its potential in cultural and creative product design.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"10 ","pages":"e2417"},"PeriodicalIF":3.5000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11623052/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PeerJ Computer Science","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.7717/peerj-cs.2417","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Graphic design, as a product of the burgeoning new media era, has seen its users' requirements for images continuously evolve. However, external factors such as light and noise often cause graphic design images to become distorted during acquisition. To enhance the definition of these images, this paper introduces a novel image enhancement model based on visual features. Initially, a histogram equalization (HE) algorithm is applied to enhance the graphic design images. Subsequently, image feature extraction is performed using a dual-flow network comprising convolutional neural network (CNN) and Transformer architectures. The CNN employs a residual dense block (RDB) to embed spatial local structure information with varying receptive fields. An improved attention mechanism module, attention feature fusion (AFF), is then introduced to integrate the image features extracted from the dual-flow network. Finally, through image perception quality guided adversarial learning, the model adjusts the initial enhanced image's color and recovers more details. Experimental results demonstrate that the proposed algorithm model achieves enhancement effects exceeding 90% on two large image datasets, which represents a 5%-10% improvement over other models. Furthermore, the algorithm exhibits superior performance in terms of peak signal-to-noise ratio (PSNR) and structural similarity index measure (SSIM) image quality evaluation metrics. Our findings indicate that the fusion model significantly enhances image quality, thereby advancing the field of graphic design and showcasing its potential in cultural and creative product design.

查看原文本刊更多论文

艺术设计的图像增强：CNN-transformer融合模型的视觉特征方法。

平面设计作为蓬勃发展的新媒体时代的产物，其用户对图像的需求也在不断演变。然而，光线、噪音等外部因素往往会导致平面设计图像在获取过程中发生畸变。为了增强这些图像的清晰度，本文引入了一种新的基于视觉特征的图像增强模型。首先，采用直方图均衡化（HE）算法对平面设计图像进行增强。随后，使用包含卷积神经网络（CNN）和Transformer架构的双流网络进行图像特征提取。CNN采用残差密集块（RDB）来嵌入具有不同接受域的空间局部结构信息。然后引入改进的注意机制模块——注意特征融合（AFF），对双流网络提取的图像特征进行融合。最后，通过图像感知质量引导的对抗学习，调整初始增强图像的颜色，恢复更多细节。实验结果表明，该算法模型在两个大型图像数据集上的增强效果超过90%，比其他模型提高了5% ~ 10%。此外，该算法在峰值信噪比（PSNR）和结构相似指数度量（SSIM）图像质量评价指标方面表现出优异的性能。我们的研究结果表明，融合模型显著提高了图像质量，从而推动了平面设计领域的发展，并展示了其在文化创意产品设计中的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

PeerJ Computer Science Computer Science-General Computer Science

CiteScore

6.10

自引率

5.30%

发文量

332

审稿时长

10 weeks

期刊介绍： PeerJ Computer Science is the new open access journal covering all subject areas in computer science, with the backing of a prestigious advisory board and more than 300 academic editors.