使用生成对抗网络的人类视频合成

... International Workshop on Pattern Recognition in NeuroImaging. International Workshop on Pattern Recognition in NeuroImaging Pub Date : 2020-06-24 DOI:10.1117/12.2574615

Abdullah Azeem, Waqar Riaz, Abubakar Siddique, Tahir Junaid Saifullah

{"title":"使用生成对抗网络的人类视频合成","authors":"Abdullah Azeem, Waqar Riaz, Abubakar Siddique, Tahir Junaid Saifullah","doi":"10.1117/12.2574615","DOIUrl":null,"url":null,"abstract":"In this work, a video synthesis model based on Generative Adversarial Networks (Human GAN) is proposed, whose objective is to generate a photorealistic output by learning the mapping function from an input source to output video. However, the image to image generation is a quite popular problem, but the video synthesis problem is still unexplored. Directly employing existing image generation method without taking temporal dynamics into account leads to frequent temporally incoherent output with low visual quality. The proposed approach solves this problem by wisely designing generators and discriminators combined with Spatio-temporal adversarial objects. While comparing it to some robust baselines on public benchmarks, the proposed model proves to be superior in generating temporally coherent videos with extremely low artifacts. And results achieved by the proposed model are more realistic on both quantitative and qualitative measures compared to other existing baselines techniques.","PeriodicalId":90079,"journal":{"name":"... International Workshop on Pattern Recognition in NeuroImaging. International Workshop on Pattern Recognition in NeuroImaging","volume":"415 14","pages":"115260I - 115260I-5"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Human video synthesis using generative adversarial networks\",\"authors\":\"Abdullah Azeem, Waqar Riaz, Abubakar Siddique, Tahir Junaid Saifullah\",\"doi\":\"10.1117/12.2574615\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this work, a video synthesis model based on Generative Adversarial Networks (Human GAN) is proposed, whose objective is to generate a photorealistic output by learning the mapping function from an input source to output video. However, the image to image generation is a quite popular problem, but the video synthesis problem is still unexplored. Directly employing existing image generation method without taking temporal dynamics into account leads to frequent temporally incoherent output with low visual quality. The proposed approach solves this problem by wisely designing generators and discriminators combined with Spatio-temporal adversarial objects. While comparing it to some robust baselines on public benchmarks, the proposed model proves to be superior in generating temporally coherent videos with extremely low artifacts. And results achieved by the proposed model are more realistic on both quantitative and qualitative measures compared to other existing baselines techniques.\",\"PeriodicalId\":90079,\"journal\":{\"name\":\"... International Workshop on Pattern Recognition in NeuroImaging. International Workshop on Pattern Recognition in NeuroImaging\",\"volume\":\"415 14\",\"pages\":\"115260I - 115260I-5\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"... International Workshop on Pattern Recognition in NeuroImaging. International Workshop on Pattern Recognition in NeuroImaging\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2574615\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"... International Workshop on Pattern Recognition in NeuroImaging. International Workshop on Pattern Recognition in NeuroImaging","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2574615","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文提出了一种基于生成式对抗网络(Human GAN)的视频合成模型，其目标是通过学习从输入源到输出视频的映射函数来生成逼真的视频输出。然而，图像到图像的生成是一个相当流行的问题，但视频合成问题仍未被探索。直接使用现有的图像生成方法而不考虑时间动态，会导致频繁的时间不连贯输出和低视觉质量。该方法通过结合时空对抗对象巧妙地设计生成器和鉴别器来解决这一问题。将其与公共基准上的一些鲁棒基线进行比较，证明该模型在生成具有极低伪影的时间连贯视频方面具有优越性。与其他现有的基线技术相比，该模型在定量和定性指标上都更加真实。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Human video synthesis using generative adversarial networks

In this work, a video synthesis model based on Generative Adversarial Networks (Human GAN) is proposed, whose objective is to generate a photorealistic output by learning the mapping function from an input source to output video. However, the image to image generation is a quite popular problem, but the video synthesis problem is still unexplored. Directly employing existing image generation method without taking temporal dynamics into account leads to frequent temporally incoherent output with low visual quality. The proposed approach solves this problem by wisely designing generators and discriminators combined with Spatio-temporal adversarial objects. While comparing it to some robust baselines on public benchmarks, the proposed model proves to be superior in generating temporally coherent videos with extremely low artifacts. And results achieved by the proposed model are more realistic on both quantitative and qualitative measures compared to other existing baselines techniques.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

... International Workshop on Pattern Recognition in NeuroImaging. International Workshop on Pattern Recognition in NeuroImaging

自引率

0.00%

发文量