Implementation of a WGAN-GP for Human Pose Transfer using a 3-channel pose representation

2021 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies (3ICT) Pub Date : 2021-09-29 DOI:10.1109/3ICT53449.2021.9581386

Tamal Das, Saurav Sutradhar, Mrinmoy Das, Simantini Chakraborty, S. Deb

{"title":"Implementation of a WGAN-GP for Human Pose Transfer using a 3-channel pose representation","authors":"Tamal Das, Saurav Sutradhar, Mrinmoy Das, Simantini Chakraborty, S. Deb","doi":"10.1109/3ICT53449.2021.9581386","DOIUrl":null,"url":null,"abstract":"The computational problem of Human Pose Transfer (HPT) is addressed in this paper. HPT in recent days have become an emerging research topic which can be used in fields like fashion design, media production, animation, virtual reality. Given the image of a human subject and a target pose, the goal of HPT is to generate a new image of the human subject with the novel pose. That is, the pose of the target pose is transferred to the human subject. HPT has been carried out in two stages. In stage 1, a rough estimate is generated and in stage 2, the rough estimate is refined with a generative adversarial network. The novelty of this work is the way pose information is represented. Earlier methods used computationally expensive pose representations like 3D DensePose and 18-channel pose heatmaps. This work uses a 3-channel colour image of a stick figure to represent human pose. Different body parts are encoded with different colours. The convolutional neural networks will now have to recognize colours only, and since these colours encode body parts, eventually the network will also learn about the position of the body parts.","PeriodicalId":133021,"journal":{"name":"2021 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies (3ICT)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies (3ICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/3ICT53449.2021.9581386","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The computational problem of Human Pose Transfer (HPT) is addressed in this paper. HPT in recent days have become an emerging research topic which can be used in fields like fashion design, media production, animation, virtual reality. Given the image of a human subject and a target pose, the goal of HPT is to generate a new image of the human subject with the novel pose. That is, the pose of the target pose is transferred to the human subject. HPT has been carried out in two stages. In stage 1, a rough estimate is generated and in stage 2, the rough estimate is refined with a generative adversarial network. The novelty of this work is the way pose information is represented. Earlier methods used computationally expensive pose representations like 3D DensePose and 18-channel pose heatmaps. This work uses a 3-channel colour image of a stick figure to represent human pose. Different body parts are encoded with different colours. The convolutional neural networks will now have to recognize colours only, and since these colours encode body parts, eventually the network will also learn about the position of the body parts.

查看原文本刊更多论文

基于3通道姿态表示的人体姿态传递WGAN-GP实现

研究了人体姿态转换的计算问题。近年来，HPT已经成为一个新兴的研究课题，可以应用于服装设计、媒体制作、动画、虚拟现实等领域。给定人体主体的图像和目标姿态，HPT的目标是生成具有新姿态的人体主体的新图像。即将目标姿态的姿态传递给人体主体。HPT分两个阶段进行。在阶段1中，生成粗略估计，在阶段2中，使用生成对抗网络对粗略估计进行细化。这项工作的新颖之处在于姿态信息的表示方式。早期的方法使用像3D DensePose和18通道姿态热图这样计算代价昂贵的姿态表示。这个作品使用了一个3通道的彩色图像的简笔画来代表人类的姿势。不同的身体部位用不同的颜色编码。卷积神经网络现在只能识别颜色，由于这些颜色编码身体部位，最终网络也将了解身体部位的位置。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies (3ICT)

自引率

0.00%

发文量