Aesthetic Image Synthesis Using Multiple-Aesthetic-Aware GAN

Yaya Setiyadi, J. Santoso, K. Surendro
{"title":"Aesthetic Image Synthesis Using Multiple-Aesthetic-Aware GAN","authors":"Yaya Setiyadi, J. Santoso, K. Surendro","doi":"10.1145/3587828.3587852","DOIUrl":null,"url":null,"abstract":"The use of synthesized images in fields that prioritize visual appearance, particularly the field of art, places a high value on aesthetics, with the system's output images required to adhere to several aesthetic rules. There are two approaches to aesthetic image synthesis with architecture based on GAN. The first strategy is to modify the loss function on the GAN so that, in addition to the loss from the GAN architecture, aesthetic loss and content/semantic loss are also calculated and the total loss is the sum of the three loss calculations. However, the outcomes still fall short of the expected natural image appearance. The second strategy involves modifying the GAN architecture by adding a new layer to the GAN generator and discriminator network, while the loss function calculation remains unchanged. The results of this second approach to image synthesis have not been optimized in terms of producing meaningful images across multiple semantic classes. This study proposes a method for increasing the aesthetic value of the synthesized image by modifying the two approaches and employing the multiple-aesthetic-aware GAN method. The proposed method takes conditional semantic information and conditional aesthetic information into account not only in the GAN architecture, but also in the loss function value calculation. The proposed method is the result of ongoing research and will be evaluated using the Inception Score (IS), the Frechet Inception Distance (FID), and an aesthetic value metric.","PeriodicalId":340917,"journal":{"name":"Proceedings of the 2023 12th International Conference on Software and Computer Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2023 12th International Conference on Software and Computer Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3587828.3587852","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The use of synthesized images in fields that prioritize visual appearance, particularly the field of art, places a high value on aesthetics, with the system's output images required to adhere to several aesthetic rules. There are two approaches to aesthetic image synthesis with architecture based on GAN. The first strategy is to modify the loss function on the GAN so that, in addition to the loss from the GAN architecture, aesthetic loss and content/semantic loss are also calculated and the total loss is the sum of the three loss calculations. However, the outcomes still fall short of the expected natural image appearance. The second strategy involves modifying the GAN architecture by adding a new layer to the GAN generator and discriminator network, while the loss function calculation remains unchanged. The results of this second approach to image synthesis have not been optimized in terms of producing meaningful images across multiple semantic classes. This study proposes a method for increasing the aesthetic value of the synthesized image by modifying the two approaches and employing the multiple-aesthetic-aware GAN method. The proposed method takes conditional semantic information and conditional aesthetic information into account not only in the GAN architecture, but also in the loss function value calculation. The proposed method is the result of ongoing research and will be evaluated using the Inception Score (IS), the Frechet Inception Distance (FID), and an aesthetic value metric.
基于多重审美感知GAN的美学图像合成
在优先考虑视觉外观的领域,特别是艺术领域,使用合成图像对美学有很高的价值,系统的输出图像需要遵守几个美学规则。基于GAN的建筑美学图像合成有两种方法。第一种策略是修改GAN上的损失函数,这样,除了GAN架构的损失外,还计算美学损失和内容/语义损失,总损失是三种损失计算的总和。然而,结果仍然没有达到预期的自然图像外观。第二种策略是在损失函数计算不变的情况下,通过在GAN生成器和鉴别器网络中添加新层来修改GAN结构。就跨多个语义类生成有意义的图像而言,第二种图像合成方法的结果尚未得到优化。本研究提出了一种通过修改这两种方法并采用多重审美感知GAN方法来提高合成图像美学价值的方法。该方法不仅在GAN结构中考虑了条件语义信息和条件美学信息,而且在损失函数值计算中也考虑了条件语义信息和条件美学信息。提出的方法是正在进行的研究的结果,并将使用盗梦评分(is), Frechet盗梦距离(FID)和美学价值度量来评估。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信