L.Mary Gladence , Yi-Wei Lai , Fu-Ti Lee , Mu-Yen Chen , Hsin-Te Wu
{"title":"基于CNN分类器的生成对抗网络预测图像风格迁移得分","authors":"L.Mary Gladence , Yi-Wei Lai , Fu-Ti Lee , Mu-Yen Chen , Hsin-Te Wu","doi":"10.1016/j.asoc.2025.113303","DOIUrl":null,"url":null,"abstract":"<div><div>In recent years, image style transfer has emerged as an increasingly popular research theme in the field of computer vision. Image style transfer seeks to convert the style of an image or mix it with other styles to produce an image with stylistic features not found in the original image. In the past, image style conversion was mainly implemented using feature conversion or filters, processes which require a significant degree of manual design, thus limiting these approaches to performing at most a single style conversion. However, with the development of deep learning technologies, an increasing number of studies have begun to break through these research challenges, achieving significant progress. The study proposes generating image data attribute tags from the classification dataset, constructing a GAN architecture for multi-style conversion tasks, and proposes a Classifier Style Scores GAN (CSS-GAN) model. First, a CNN classifier is used to train on the classification dataset. Once its stability is verified, the classifier is used to make predictions and its output layer features are extracted as attribute labels. These labels are subjected to different types of pre-processing to assess the performance difference between smoothed labels and binary classification labels. Finally, the resulting labels are used to train a multi-style transfer GAN. Experiments are conducted using a facial attribute dataset to compare the labeling method with the proposed model architecture. The results indicate that using classifier-predicted features and applying feature smoothing as attribute labels for training the GAN can effectively enhance the quality and stability of images generated for the style transfer task. Additionally, this approach allows for better control over the degree of transformation and improves the overall performance of style transfer.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"178 ","pages":"Article 113303"},"PeriodicalIF":7.2000,"publicationDate":"2025-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Generative adversarial network based on CNN classifier predicted scores for image style transfer\",\"authors\":\"L.Mary Gladence , Yi-Wei Lai , Fu-Ti Lee , Mu-Yen Chen , Hsin-Te Wu\",\"doi\":\"10.1016/j.asoc.2025.113303\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In recent years, image style transfer has emerged as an increasingly popular research theme in the field of computer vision. Image style transfer seeks to convert the style of an image or mix it with other styles to produce an image with stylistic features not found in the original image. In the past, image style conversion was mainly implemented using feature conversion or filters, processes which require a significant degree of manual design, thus limiting these approaches to performing at most a single style conversion. However, with the development of deep learning technologies, an increasing number of studies have begun to break through these research challenges, achieving significant progress. The study proposes generating image data attribute tags from the classification dataset, constructing a GAN architecture for multi-style conversion tasks, and proposes a Classifier Style Scores GAN (CSS-GAN) model. First, a CNN classifier is used to train on the classification dataset. Once its stability is verified, the classifier is used to make predictions and its output layer features are extracted as attribute labels. These labels are subjected to different types of pre-processing to assess the performance difference between smoothed labels and binary classification labels. Finally, the resulting labels are used to train a multi-style transfer GAN. Experiments are conducted using a facial attribute dataset to compare the labeling method with the proposed model architecture. The results indicate that using classifier-predicted features and applying feature smoothing as attribute labels for training the GAN can effectively enhance the quality and stability of images generated for the style transfer task. Additionally, this approach allows for better control over the degree of transformation and improves the overall performance of style transfer.</div></div>\",\"PeriodicalId\":50737,\"journal\":{\"name\":\"Applied Soft Computing\",\"volume\":\"178 \",\"pages\":\"Article 113303\"},\"PeriodicalIF\":7.2000,\"publicationDate\":\"2025-05-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Soft Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1568494625006143\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494625006143","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Generative adversarial network based on CNN classifier predicted scores for image style transfer
In recent years, image style transfer has emerged as an increasingly popular research theme in the field of computer vision. Image style transfer seeks to convert the style of an image or mix it with other styles to produce an image with stylistic features not found in the original image. In the past, image style conversion was mainly implemented using feature conversion or filters, processes which require a significant degree of manual design, thus limiting these approaches to performing at most a single style conversion. However, with the development of deep learning technologies, an increasing number of studies have begun to break through these research challenges, achieving significant progress. The study proposes generating image data attribute tags from the classification dataset, constructing a GAN architecture for multi-style conversion tasks, and proposes a Classifier Style Scores GAN (CSS-GAN) model. First, a CNN classifier is used to train on the classification dataset. Once its stability is verified, the classifier is used to make predictions and its output layer features are extracted as attribute labels. These labels are subjected to different types of pre-processing to assess the performance difference between smoothed labels and binary classification labels. Finally, the resulting labels are used to train a multi-style transfer GAN. Experiments are conducted using a facial attribute dataset to compare the labeling method with the proposed model architecture. The results indicate that using classifier-predicted features and applying feature smoothing as attribute labels for training the GAN can effectively enhance the quality and stability of images generated for the style transfer task. Additionally, this approach allows for better control over the degree of transformation and improves the overall performance of style transfer.
期刊介绍:
Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities.
Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.