Enhancing the Performance of Generative Adversarial Networks with Identity Blocks and Revised Loss Function to Improve Training Stability

IJCI. International Journal of Computers and Information Pub Date : 2023-09-01 DOI:10.21608/ijci.2023.200402.1101

Mohamed Salem, Mohamed Sakr, Sherif Eletriby

{"title":"Enhancing the Performance of Generative Adversarial Networks with Identity Blocks and Revised Loss Function to Improve Training Stability","authors":"Mohamed Salem, Mohamed Sakr, Sherif Eletriby","doi":"10.21608/ijci.2023.200402.1101","DOIUrl":null,"url":null,"abstract":"Generative adversarial networks (GANs) are a powerful deep learning model for synthesizing realistic images; however, they can be difficult to train and are prone to instability and mode collapse. This paper presents a modified deep learning model called Identity Generative Adversarial Network (IGAN) to address the challenges of training and instability faced by generative adversarial models in synthesizing realistic images. The IGAN model includes three modifications to improve the performance of DCGAN: a non-linear identity block to ease complex data fitting and reduce training time; a modified loss function with label smoothing to smooth the standard GAN loss function; and minibatch training to use other examples from the same minibatch as side information for better quality and variety of generated images. The effectiveness of IGAN was evaluated and compared with other state-of-the-art generative models using the inception score (IS) and Fréchet inception distance (FID) on CelebA and stacked MNIST datasets. The experiments demonstrated that IGAN outperformed the other models in terms of convergence speed, stability, and diversity of results. Specifically, in 200 epochs, IGAN achieved an IS of 13.6 and an FID of 46.2. Furthermore, the IGAN collapsed modes were compared with other generative models using a stacked MNIST dataset, showing the superiority of IGAN in producing all the modes while the other models failed to do so. These results demonstrate that the modifications implemented in IGAN can significantly enhance the performance of GANs in synthesizing realistic images, providing a more stable, high-quality, and diverse output.","PeriodicalId":137729,"journal":{"name":"IJCI. International Journal of Computers and Information","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IJCI. International Journal of Computers and Information","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21608/ijci.2023.200402.1101","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Generative adversarial networks (GANs) are a powerful deep learning model for synthesizing realistic images; however, they can be difficult to train and are prone to instability and mode collapse. This paper presents a modified deep learning model called Identity Generative Adversarial Network (IGAN) to address the challenges of training and instability faced by generative adversarial models in synthesizing realistic images. The IGAN model includes three modifications to improve the performance of DCGAN: a non-linear identity block to ease complex data fitting and reduce training time; a modified loss function with label smoothing to smooth the standard GAN loss function; and minibatch training to use other examples from the same minibatch as side information for better quality and variety of generated images. The effectiveness of IGAN was evaluated and compared with other state-of-the-art generative models using the inception score (IS) and Fréchet inception distance (FID) on CelebA and stacked MNIST datasets. The experiments demonstrated that IGAN outperformed the other models in terms of convergence speed, stability, and diversity of results. Specifically, in 200 epochs, IGAN achieved an IS of 13.6 and an FID of 46.2. Furthermore, the IGAN collapsed modes were compared with other generative models using a stacked MNIST dataset, showing the superiority of IGAN in producing all the modes while the other models failed to do so. These results demonstrate that the modifications implemented in IGAN can significantly enhance the performance of GANs in synthesizing realistic images, providing a more stable, high-quality, and diverse output.

查看原文本刊更多论文

利用身份块和修正损失函数增强生成对抗网络的性能以提高训练稳定性

生成对抗网络(GANs)是一种强大的深度学习模型，用于合成逼真的图像;然而，它们很难训练，而且容易出现不稳定和模式崩溃。本文提出了一种改进的深度学习模型，称为身份生成对抗网络(IGAN)，以解决生成对抗模型在合成真实图像时面临的训练和不稳定性挑战。IGAN模型包括三个改进以提高DCGAN的性能:非线性身份块以简化复杂的数据拟合并减少训练时间;一个带有标签平滑的修正损失函数来平滑标准GAN损失函数;以及minibatch训练，使用来自同一minibatch的其他示例作为侧信息，以获得更好的质量和各种生成的图像。在CelebA和堆叠的MNIST数据集上，使用初始分数(IS)和fr起始距离(FID)来评估IGAN的有效性，并与其他最先进的生成模型进行比较。实验表明，IGAN在收敛速度、稳定性和结果多样性方面优于其他模型。具体来说，在200个epoch中，IGAN实现了13.6的IS和46.2的FID。此外，使用堆叠的MNIST数据集将IGAN的折叠模式与其他生成模型进行了比较，结果表明IGAN在产生所有模式方面具有优势，而其他模型则无法产生所有模式。这些结果表明，在IGAN中实施的修改可以显着提高gan合成逼真图像的性能，提供更稳定，高质量和多样化的输出。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IJCI. International Journal of Computers and Information

自引率

0.00%

发文量