潜在空间维数对合成人脸图像质量的影响

IF 0.7 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of Communications Software and Systems Pub Date : 2021-01-01 DOI:10.24138/JCOMSS-2021-0035

Ivana Marin, S. Gotovac, M. Russo, Dunja Božić-Štulić

{"title":"潜在空间维数对合成人脸图像质量的影响","authors":"Ivana Marin, S. Gotovac, M. Russo, Dunja Božić-Štulić","doi":"10.24138/JCOMSS-2021-0035","DOIUrl":null,"url":null,"abstract":"In recent years Generative Adversarial Networks (GANs) have achieved remarkable results in the task of realistic image synthesis. Despite their continued success and advances, there still lacks a thorough understanding of how precisely GANs map random latent vectors to realistic-looking images and how the priors set on the latent space affect the learned mapping. In this work, we analyze the effect of the chosen latent dimension on the final quality of synthesized images of human faces and learned data representations. We show that GANs can generate images plausibly even with latent dimensions significantly smaller than the standard dimensions like 100 or 512. Although one might expect that larger latent dimensions encourage the generation of more diverse and enhanced quality images, we show that an increase of latent dimension after some point does not lead to visible improvements in perceptual image quality nor in quantitative estimates of its generalization abilities.","PeriodicalId":38910,"journal":{"name":"Journal of Communications Software and Systems","volume":"1 1","pages":""},"PeriodicalIF":0.7000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"The Effect of Latent Space Dimension on the Quality of Synthesized Human Face Images\",\"authors\":\"Ivana Marin, S. Gotovac, M. Russo, Dunja Božić-Štulić\",\"doi\":\"10.24138/JCOMSS-2021-0035\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years Generative Adversarial Networks (GANs) have achieved remarkable results in the task of realistic image synthesis. Despite their continued success and advances, there still lacks a thorough understanding of how precisely GANs map random latent vectors to realistic-looking images and how the priors set on the latent space affect the learned mapping. In this work, we analyze the effect of the chosen latent dimension on the final quality of synthesized images of human faces and learned data representations. We show that GANs can generate images plausibly even with latent dimensions significantly smaller than the standard dimensions like 100 or 512. Although one might expect that larger latent dimensions encourage the generation of more diverse and enhanced quality images, we show that an increase of latent dimension after some point does not lead to visible improvements in perceptual image quality nor in quantitative estimates of its generalization abilities.\",\"PeriodicalId\":38910,\"journal\":{\"name\":\"Journal of Communications Software and Systems\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":0.7000,\"publicationDate\":\"2021-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Communications Software and Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.24138/JCOMSS-2021-0035\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Communications Software and Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.24138/JCOMSS-2021-0035","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 6

摘要

近年来，生成对抗网络(GANs)在真实感图像合成任务中取得了显著的成果。尽管它们不断取得成功和进步，但对于gan如何精确地将随机潜在向量映射到逼真的图像，以及在潜在空间上设置的先验如何影响学习映射，仍然缺乏透彻的理解。在这项工作中，我们分析了所选择的潜在维度对人脸合成图像和学习数据表示的最终质量的影响。我们表明，即使潜在维数明显小于标准维数(如100或512)，gan也可以合理地生成图像。尽管人们可能会期望更大的潜在维数会鼓励生成更多样化和更高质量的图像，但我们表明，在某个点之后增加潜在维数不会导致感知图像质量的明显改善，也不会导致其泛化能力的定量估计。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

The Effect of Latent Space Dimension on the Quality of Synthesized Human Face Images

In recent years Generative Adversarial Networks (GANs) have achieved remarkable results in the task of realistic image synthesis. Despite their continued success and advances, there still lacks a thorough understanding of how precisely GANs map random latent vectors to realistic-looking images and how the priors set on the latent space affect the learned mapping. In this work, we analyze the effect of the chosen latent dimension on the final quality of synthesized images of human faces and learned data representations. We show that GANs can generate images plausibly even with latent dimensions significantly smaller than the standard dimensions like 100 or 512. Although one might expect that larger latent dimensions encourage the generation of more diverse and enhanced quality images, we show that an increase of latent dimension after some point does not lead to visible improvements in perceptual image quality nor in quantitative estimates of its generalization abilities.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Communications Software and Systems Engineering-Electrical and Electronic Engineering

CiteScore

2.00

自引率

14.30%

发文量

审稿时长

8 weeks