基于CNN分类器的生成对抗网络预测图像风格迁移得分

IF 7.2 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Soft Computing Pub Date : 2025-05-19 DOI:10.1016/j.asoc.2025.113303

L.Mary Gladence , Yi-Wei Lai , Fu-Ti Lee , Mu-Yen Chen , Hsin-Te Wu

{"title":"基于CNN分类器的生成对抗网络预测图像风格迁移得分","authors":"L.Mary Gladence , Yi-Wei Lai , Fu-Ti Lee , Mu-Yen Chen , Hsin-Te Wu","doi":"10.1016/j.asoc.2025.113303","DOIUrl":null,"url":null,"abstract":"<div><div>In recent years, image style transfer has emerged as an increasingly popular research theme in the field of computer vision. Image style transfer seeks to convert the style of an image or mix it with other styles to produce an image with stylistic features not found in the original image. In the past, image style conversion was mainly implemented using feature conversion or filters, processes which require a significant degree of manual design, thus limiting these approaches to performing at most a single style conversion. However, with the development of deep learning technologies, an increasing number of studies have begun to break through these research challenges, achieving significant progress. The study proposes generating image data attribute tags from the classification dataset, constructing a GAN architecture for multi-style conversion tasks, and proposes a Classifier Style Scores GAN (CSS-GAN) model. First, a CNN classifier is used to train on the classification dataset. Once its stability is verified, the classifier is used to make predictions and its output layer features are extracted as attribute labels. These labels are subjected to different types of pre-processing to assess the performance difference between smoothed labels and binary classification labels. Finally, the resulting labels are used to train a multi-style transfer GAN. Experiments are conducted using a facial attribute dataset to compare the labeling method with the proposed model architecture. The results indicate that using classifier-predicted features and applying feature smoothing as attribute labels for training the GAN can effectively enhance the quality and stability of images generated for the style transfer task. Additionally, this approach allows for better control over the degree of transformation and improves the overall performance of style transfer.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"178 ","pages":"Article 113303"},"PeriodicalIF":7.2000,"publicationDate":"2025-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Generative adversarial network based on CNN classifier predicted scores for image style transfer\",\"authors\":\"L.Mary Gladence , Yi-Wei Lai , Fu-Ti Lee , Mu-Yen Chen , Hsin-Te Wu\",\"doi\":\"10.1016/j.asoc.2025.113303\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In recent years, image style transfer has emerged as an increasingly popular research theme in the field of computer vision. Image style transfer seeks to convert the style of an image or mix it with other styles to produce an image with stylistic features not found in the original image. In the past, image style conversion was mainly implemented using feature conversion or filters, processes which require a significant degree of manual design, thus limiting these approaches to performing at most a single style conversion. However, with the development of deep learning technologies, an increasing number of studies have begun to break through these research challenges, achieving significant progress. The study proposes generating image data attribute tags from the classification dataset, constructing a GAN architecture for multi-style conversion tasks, and proposes a Classifier Style Scores GAN (CSS-GAN) model. First, a CNN classifier is used to train on the classification dataset. Once its stability is verified, the classifier is used to make predictions and its output layer features are extracted as attribute labels. These labels are subjected to different types of pre-processing to assess the performance difference between smoothed labels and binary classification labels. Finally, the resulting labels are used to train a multi-style transfer GAN. Experiments are conducted using a facial attribute dataset to compare the labeling method with the proposed model architecture. The results indicate that using classifier-predicted features and applying feature smoothing as attribute labels for training the GAN can effectively enhance the quality and stability of images generated for the style transfer task. Additionally, this approach allows for better control over the degree of transformation and improves the overall performance of style transfer.</div></div>\",\"PeriodicalId\":50737,\"journal\":{\"name\":\"Applied Soft Computing\",\"volume\":\"178 \",\"pages\":\"Article 113303\"},\"PeriodicalIF\":7.2000,\"publicationDate\":\"2025-05-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Soft Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1568494625006143\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494625006143","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

近年来，图像风格迁移已成为计算机视觉领域一个日益热门的研究主题。图像风格转换是指将图像的风格进行转换，或将其与其他风格混合，以产生具有原始图像中没有的风格特征的图像。过去，图像样式转换主要使用特征转换或过滤器来实现，这些过程需要很大程度的手工设计，从而限制了这些方法最多只能执行单个样式转换。然而，随着深度学习技术的发展，越来越多的研究开始突破这些研究挑战，取得了重大进展。该研究提出了从分类数据集中生成图像数据属性标签，构建多风格转换任务的GAN架构，并提出了分类器风格评分GAN （CSS-GAN）模型。首先，使用CNN分类器对分类数据集进行训练。一旦其稳定性得到验证，分类器就被用来进行预测，并将其输出层特征提取为属性标签。对这些标签进行不同类型的预处理，以评估平滑标签和二元分类标签之间的性能差异。最后，将得到的标签用于训练多样式转移GAN。使用人脸属性数据集进行了实验，比较了所提出的模型架构和标记方法。结果表明，使用分类器预测特征并将特征平滑作为属性标签进行GAN训练，可以有效地提高风格迁移任务生成图像的质量和稳定性。此外，这种方法可以更好地控制转换的程度，并提高风格转换的整体性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Generative adversarial network based on CNN classifier predicted scores for image style transfer

In recent years, image style transfer has emerged as an increasingly popular research theme in the field of computer vision. Image style transfer seeks to convert the style of an image or mix it with other styles to produce an image with stylistic features not found in the original image. In the past, image style conversion was mainly implemented using feature conversion or filters, processes which require a significant degree of manual design, thus limiting these approaches to performing at most a single style conversion. However, with the development of deep learning technologies, an increasing number of studies have begun to break through these research challenges, achieving significant progress. The study proposes generating image data attribute tags from the classification dataset, constructing a GAN architecture for multi-style conversion tasks, and proposes a Classifier Style Scores GAN (CSS-GAN) model. First, a CNN classifier is used to train on the classification dataset. Once its stability is verified, the classifier is used to make predictions and its output layer features are extracted as attribute labels. These labels are subjected to different types of pre-processing to assess the performance difference between smoothed labels and binary classification labels. Finally, the resulting labels are used to train a multi-style transfer GAN. Experiments are conducted using a facial attribute dataset to compare the labeling method with the proposed model architecture. The results indicate that using classifier-predicted features and applying feature smoothing as attribute labels for training the GAN can effectively enhance the quality and stability of images generated for the style transfer task. Additionally, this approach allows for better control over the degree of transformation and improves the overall performance of style transfer.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Applied Soft Computing 工程技术-计算机：跨学科应用

CiteScore

15.80

自引率

6.90%

发文量

874

审稿时长

10.9 months

期刊介绍： Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities. Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.