基于全变异损失的人脸超分辨SRGAN

2020 7th NAFOSTED Conference on Information and Computer Science (NICS) Pub Date : 2020-11-26 DOI:10.1109/NICS51282.2020.9335836

Hai Nguyen-Truong, Khoa Nguyen, San Cao

{"title":"基于全变异损失的人脸超分辨SRGAN","authors":"Hai Nguyen-Truong, Khoa Nguyen, San Cao","doi":"10.1109/NICS51282.2020.9335836","DOIUrl":null,"url":null,"abstract":"Facial image super-resolution is a crucial preprocessing for facial image analysis, face recognition, and image-based 3D face reconstruction. Convolutional neural networks were earlier used to produce high-resolution images that train quicker and shown excellent performance by learning mapping relation using pairs of low-resolution and high-resolution images. However, in some cases, they are incapable of recovering finer details and often generate blurry images. In this paper, we evaluate a method of applying Generative adversarial networks in generating realistic super-resolution images from low-resolution ones by using three typical losses for super-resolution: Content Loss, Adversarial Loss, Perceptual Loss, and proposed to use Total Variation Loss. We try different pre-trained famous Convolutional neural networks models (VGG19, FaceNet, and EfficientNet) in Perceptual Loss to have a general view with different backbones. Our network gains 32.67 of Peak signal-to-noise ratio (PSNR) and 0.89 of Structural similarity index (SSIM) in 100 random samples from the Flickr-Faces-HQ dataset.","PeriodicalId":308944,"journal":{"name":"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)","volume":"456 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"SRGAN with Total Variation Loss in Face Super-Resolution\",\"authors\":\"Hai Nguyen-Truong, Khoa Nguyen, San Cao\",\"doi\":\"10.1109/NICS51282.2020.9335836\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Facial image super-resolution is a crucial preprocessing for facial image analysis, face recognition, and image-based 3D face reconstruction. Convolutional neural networks were earlier used to produce high-resolution images that train quicker and shown excellent performance by learning mapping relation using pairs of low-resolution and high-resolution images. However, in some cases, they are incapable of recovering finer details and often generate blurry images. In this paper, we evaluate a method of applying Generative adversarial networks in generating realistic super-resolution images from low-resolution ones by using three typical losses for super-resolution: Content Loss, Adversarial Loss, Perceptual Loss, and proposed to use Total Variation Loss. We try different pre-trained famous Convolutional neural networks models (VGG19, FaceNet, and EfficientNet) in Perceptual Loss to have a general view with different backbones. Our network gains 32.67 of Peak signal-to-noise ratio (PSNR) and 0.89 of Structural similarity index (SSIM) in 100 random samples from the Flickr-Faces-HQ dataset.\",\"PeriodicalId\":308944,\"journal\":{\"name\":\"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)\",\"volume\":\"456 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NICS51282.2020.9335836\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NICS51282.2020.9335836","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

人脸图像超分辨率是人脸图像分析、人脸识别和基于图像的三维人脸重建的关键预处理技术。卷积神经网络早期用于生成高分辨率图像，通过使用低分辨率和高分辨率图像对学习映射关系，训练速度更快，表现出出色的性能。然而，在某些情况下，他们不能恢复更精细的细节，经常产生模糊的图像。在本文中，我们通过使用三种典型的超分辨率损失:内容损失、对抗损失、感知损失，评估了一种应用生成对抗网络从低分辨率图像生成逼真超分辨率图像的方法，并提出了使用总变化损失的方法。我们尝试了不同的预训练著名的卷积神经网络模型(VGG19, FaceNet和EfficientNet)，以获得不同主干的总体视图。我们的网络在来自Flickr-Faces-HQ数据集的100个随机样本中获得了32.67的峰值信噪比(PSNR)和0.89的结构相似指数(SSIM)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

SRGAN with Total Variation Loss in Face Super-Resolution

Facial image super-resolution is a crucial preprocessing for facial image analysis, face recognition, and image-based 3D face reconstruction. Convolutional neural networks were earlier used to produce high-resolution images that train quicker and shown excellent performance by learning mapping relation using pairs of low-resolution and high-resolution images. However, in some cases, they are incapable of recovering finer details and often generate blurry images. In this paper, we evaluate a method of applying Generative adversarial networks in generating realistic super-resolution images from low-resolution ones by using three typical losses for super-resolution: Content Loss, Adversarial Loss, Perceptual Loss, and proposed to use Total Variation Loss. We try different pre-trained famous Convolutional neural networks models (VGG19, FaceNet, and EfficientNet) in Perceptual Loss to have a general view with different backbones. Our network gains 32.67 of Peak signal-to-noise ratio (PSNR) and 0.89 of Structural similarity index (SSIM) in 100 random samples from the Flickr-Faces-HQ dataset.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 7th NAFOSTED Conference on Information and Computer Science (NICS)

自引率

0.00%

发文量