Generation of Human Images with Clothing using Advanced Conditional Generative Adversarial Networks

Sheela Raju Kurupathi, Pramod Murthy, D. Stricker
{"title":"Generation of Human Images with Clothing using Advanced Conditional Generative Adversarial Networks","authors":"Sheela Raju Kurupathi, Pramod Murthy, D. Stricker","doi":"10.5220/0009832200300041","DOIUrl":null,"url":null,"abstract":"One of the main challenges of human-image generation is generating a person along with pose and clothing details. However, it is still a difficult task due to challenging backgrounds and appearance variance. Recently, various deep learning models like Stacked Hourglass networks, Variational Auto Encoders (VAE), and Generative Adversarial Networks (GANs) have been used to solve this problem. However, still, they do not generalize well to the real-world human-image generation task qualitatively. The main goal is to use the Spectral Normalization (SN) technique for training GAN to synthesize the human-image along with the perfect pose and appearance details of the person. In this paper, we have investigated how Conditional GANs, along with Spectral Normalization (SN), could synthesize the new image of the target person given the image of the person and the target (novel) pose desired. The model uses 2D keypoints to represent human poses. We also use adversarial hinge loss and present an ablation study. The proposed model variants have generated promising results on both the Market-1501 and DeepFashion Datasets. We supported our claims by benchmarking the proposed model with recent state-of-the-art models. Finally, we show how the Spectral Normalization (SN) technique influences the process of human-image synthesis.","PeriodicalId":88612,"journal":{"name":"News. Phi Delta Epsilon","volume":"40 1","pages":"30-41"},"PeriodicalIF":0.0000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"News. Phi Delta Epsilon","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5220/0009832200300041","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

One of the main challenges of human-image generation is generating a person along with pose and clothing details. However, it is still a difficult task due to challenging backgrounds and appearance variance. Recently, various deep learning models like Stacked Hourglass networks, Variational Auto Encoders (VAE), and Generative Adversarial Networks (GANs) have been used to solve this problem. However, still, they do not generalize well to the real-world human-image generation task qualitatively. The main goal is to use the Spectral Normalization (SN) technique for training GAN to synthesize the human-image along with the perfect pose and appearance details of the person. In this paper, we have investigated how Conditional GANs, along with Spectral Normalization (SN), could synthesize the new image of the target person given the image of the person and the target (novel) pose desired. The model uses 2D keypoints to represent human poses. We also use adversarial hinge loss and present an ablation study. The proposed model variants have generated promising results on both the Market-1501 and DeepFashion Datasets. We supported our claims by benchmarking the proposed model with recent state-of-the-art models. Finally, we show how the Spectral Normalization (SN) technique influences the process of human-image synthesis.
使用高级条件生成对抗网络生成带有服装的人体图像
人类图像生成的主要挑战之一是生成一个人以及姿势和服装细节。然而,由于具有挑战性的背景和外观差异,这仍然是一项艰巨的任务。最近,各种深度学习模型,如堆叠沙漏网络,变分自动编码器(VAE)和生成对抗网络(gan)被用来解决这个问题。然而,它们仍然不能很好地定性地推广到现实世界的人类图像生成任务。主要目标是使用光谱归一化(SN)技术训练GAN来合成人类图像以及人的完美姿势和外观细节。在本文中,我们研究了条件gan以及谱归一化(SN)如何在给定人物图像和目标(新)姿势的情况下合成目标人物的新图像。该模型使用2D关键点来表示人体姿势。我们也使用了对抗性铰链损失,并提出了消融研究。所提出的模型变体在Market-1501和DeepFashion数据集上都产生了有希望的结果。我们通过用最新的最先进的模型对所提出的模型进行基准测试来支持我们的主张。最后,我们展示了光谱归一化(SN)技术对人体图像合成过程的影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信