基于多重审美感知GAN的美学图像合成

Proceedings of the 2023 12th International Conference on Software and Computer Applications Pub Date : 2023-02-23 DOI:10.1145/3587828.3587852

Yaya Setiyadi, J. Santoso, K. Surendro

{"title":"基于多重审美感知GAN的美学图像合成","authors":"Yaya Setiyadi, J. Santoso, K. Surendro","doi":"10.1145/3587828.3587852","DOIUrl":null,"url":null,"abstract":"The use of synthesized images in fields that prioritize visual appearance, particularly the field of art, places a high value on aesthetics, with the system's output images required to adhere to several aesthetic rules. There are two approaches to aesthetic image synthesis with architecture based on GAN. The first strategy is to modify the loss function on the GAN so that, in addition to the loss from the GAN architecture, aesthetic loss and content/semantic loss are also calculated and the total loss is the sum of the three loss calculations. However, the outcomes still fall short of the expected natural image appearance. The second strategy involves modifying the GAN architecture by adding a new layer to the GAN generator and discriminator network, while the loss function calculation remains unchanged. The results of this second approach to image synthesis have not been optimized in terms of producing meaningful images across multiple semantic classes. This study proposes a method for increasing the aesthetic value of the synthesized image by modifying the two approaches and employing the multiple-aesthetic-aware GAN method. The proposed method takes conditional semantic information and conditional aesthetic information into account not only in the GAN architecture, but also in the loss function value calculation. The proposed method is the result of ongoing research and will be evaluated using the Inception Score (IS), the Frechet Inception Distance (FID), and an aesthetic value metric.","PeriodicalId":340917,"journal":{"name":"Proceedings of the 2023 12th International Conference on Software and Computer Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Aesthetic Image Synthesis Using Multiple-Aesthetic-Aware GAN\",\"authors\":\"Yaya Setiyadi, J. Santoso, K. Surendro\",\"doi\":\"10.1145/3587828.3587852\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The use of synthesized images in fields that prioritize visual appearance, particularly the field of art, places a high value on aesthetics, with the system's output images required to adhere to several aesthetic rules. There are two approaches to aesthetic image synthesis with architecture based on GAN. The first strategy is to modify the loss function on the GAN so that, in addition to the loss from the GAN architecture, aesthetic loss and content/semantic loss are also calculated and the total loss is the sum of the three loss calculations. However, the outcomes still fall short of the expected natural image appearance. The second strategy involves modifying the GAN architecture by adding a new layer to the GAN generator and discriminator network, while the loss function calculation remains unchanged. The results of this second approach to image synthesis have not been optimized in terms of producing meaningful images across multiple semantic classes. This study proposes a method for increasing the aesthetic value of the synthesized image by modifying the two approaches and employing the multiple-aesthetic-aware GAN method. The proposed method takes conditional semantic information and conditional aesthetic information into account not only in the GAN architecture, but also in the loss function value calculation. The proposed method is the result of ongoing research and will be evaluated using the Inception Score (IS), the Frechet Inception Distance (FID), and an aesthetic value metric.\",\"PeriodicalId\":340917,\"journal\":{\"name\":\"Proceedings of the 2023 12th International Conference on Software and Computer Applications\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-02-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2023 12th International Conference on Software and Computer Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3587828.3587852\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2023 12th International Conference on Software and Computer Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3587828.3587852","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在优先考虑视觉外观的领域，特别是艺术领域，使用合成图像对美学有很高的价值，系统的输出图像需要遵守几个美学规则。基于GAN的建筑美学图像合成有两种方法。第一种策略是修改GAN上的损失函数，这样，除了GAN架构的损失外，还计算美学损失和内容/语义损失，总损失是三种损失计算的总和。然而，结果仍然没有达到预期的自然图像外观。第二种策略是在损失函数计算不变的情况下，通过在GAN生成器和鉴别器网络中添加新层来修改GAN结构。就跨多个语义类生成有意义的图像而言，第二种图像合成方法的结果尚未得到优化。本研究提出了一种通过修改这两种方法并采用多重审美感知GAN方法来提高合成图像美学价值的方法。该方法不仅在GAN结构中考虑了条件语义信息和条件美学信息，而且在损失函数值计算中也考虑了条件语义信息和条件美学信息。提出的方法是正在进行的研究的结果，并将使用盗梦评分(is)， Frechet盗梦距离(FID)和美学价值度量来评估。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Aesthetic Image Synthesis Using Multiple-Aesthetic-Aware GAN

The use of synthesized images in fields that prioritize visual appearance, particularly the field of art, places a high value on aesthetics, with the system's output images required to adhere to several aesthetic rules. There are two approaches to aesthetic image synthesis with architecture based on GAN. The first strategy is to modify the loss function on the GAN so that, in addition to the loss from the GAN architecture, aesthetic loss and content/semantic loss are also calculated and the total loss is the sum of the three loss calculations. However, the outcomes still fall short of the expected natural image appearance. The second strategy involves modifying the GAN architecture by adding a new layer to the GAN generator and discriminator network, while the loss function calculation remains unchanged. The results of this second approach to image synthesis have not been optimized in terms of producing meaningful images across multiple semantic classes. This study proposes a method for increasing the aesthetic value of the synthesized image by modifying the two approaches and employing the multiple-aesthetic-aware GAN method. The proposed method takes conditional semantic information and conditional aesthetic information into account not only in the GAN architecture, but also in the loss function value calculation. The proposed method is the result of ongoing research and will be evaluated using the Inception Score (IS), the Frechet Inception Distance (FID), and an aesthetic value metric.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2023 12th International Conference on Software and Computer Applications

自引率

0.00%

发文量