{"title":"Generative adversarial networks with data augmentation and multiple penalty areas for image synthesis","authors":"Li Chen, H. Chan","doi":"10.34028/iajit/20/3/15","DOIUrl":null,"url":null,"abstract":"The quality of generated images is one of the significant criteria for Generative Adversarial Networks (GANs) evaluation in image synthesis research. Previous researches proposed a great many tricks to modify the model structure or loss functions. However, seldom of them consider the effect of combination of data augmentation and multiple penalty areas on image quality improvement. This research introduces a GAN architecture based on data augmentation, in order to make the model fulfill 1-Lipschitz constraints, it proposes to consider these additional data included into the penalty areas which can improve ability of discriminator and generator. With the help of these techniques, compared with previous model Deep Convolutional GAN (DCGAN) and Wasserstein GAN with gradient penalty (WGAN-GP), the model proposed in this research can get lower Frechet Inception Distance score (FID) score 2.973 and 2.941 on celebA and LSUN towers at 64×64 resolution respectively which proves that this model can produce high visual quality results.","PeriodicalId":13624,"journal":{"name":"Int. Arab J. Inf. Technol.","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. Arab J. Inf. Technol.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.34028/iajit/20/3/15","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The quality of generated images is one of the significant criteria for Generative Adversarial Networks (GANs) evaluation in image synthesis research. Previous researches proposed a great many tricks to modify the model structure or loss functions. However, seldom of them consider the effect of combination of data augmentation and multiple penalty areas on image quality improvement. This research introduces a GAN architecture based on data augmentation, in order to make the model fulfill 1-Lipschitz constraints, it proposes to consider these additional data included into the penalty areas which can improve ability of discriminator and generator. With the help of these techniques, compared with previous model Deep Convolutional GAN (DCGAN) and Wasserstein GAN with gradient penalty (WGAN-GP), the model proposed in this research can get lower Frechet Inception Distance score (FID) score 2.973 and 2.941 on celebA and LSUN towers at 64×64 resolution respectively which proves that this model can produce high visual quality results.
在图像合成研究中,生成图像的质量是评价生成对抗网络(GANs)的重要标准之一。以往的研究提出了许多修改模型结构或损失函数的技巧。但是,很少考虑数据增强和多惩罚区域相结合对图像质量提高的影响。本研究引入了一种基于数据增强的GAN结构,为了使模型满足1-Lipschitz约束,提出将这些附加数据纳入罚域,以提高鉴别器和生成器的能力。在这些技术的帮助下,与之前的模型Deep Convolutional GAN (DCGAN)和Wasserstein GAN With gradient penalty (WGAN-GP)相比,本研究提出的模型在celebA和LSUN塔上分别获得了较低的Frechet Inception Distance score (FID),分别为2.973和2.941,分辨率分别为64×64,证明了该模型可以产生较高的视觉质量结果。