{"title":"基于全变异损失的人脸超分辨SRGAN","authors":"Hai Nguyen-Truong, Khoa Nguyen, San Cao","doi":"10.1109/NICS51282.2020.9335836","DOIUrl":null,"url":null,"abstract":"Facial image super-resolution is a crucial preprocessing for facial image analysis, face recognition, and image-based 3D face reconstruction. Convolutional neural networks were earlier used to produce high-resolution images that train quicker and shown excellent performance by learning mapping relation using pairs of low-resolution and high-resolution images. However, in some cases, they are incapable of recovering finer details and often generate blurry images. In this paper, we evaluate a method of applying Generative adversarial networks in generating realistic super-resolution images from low-resolution ones by using three typical losses for super-resolution: Content Loss, Adversarial Loss, Perceptual Loss, and proposed to use Total Variation Loss. We try different pre-trained famous Convolutional neural networks models (VGG19, FaceNet, and EfficientNet) in Perceptual Loss to have a general view with different backbones. Our network gains 32.67 of Peak signal-to-noise ratio (PSNR) and 0.89 of Structural similarity index (SSIM) in 100 random samples from the Flickr-Faces-HQ dataset.","PeriodicalId":308944,"journal":{"name":"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)","volume":"456 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"SRGAN with Total Variation Loss in Face Super-Resolution\",\"authors\":\"Hai Nguyen-Truong, Khoa Nguyen, San Cao\",\"doi\":\"10.1109/NICS51282.2020.9335836\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Facial image super-resolution is a crucial preprocessing for facial image analysis, face recognition, and image-based 3D face reconstruction. Convolutional neural networks were earlier used to produce high-resolution images that train quicker and shown excellent performance by learning mapping relation using pairs of low-resolution and high-resolution images. However, in some cases, they are incapable of recovering finer details and often generate blurry images. In this paper, we evaluate a method of applying Generative adversarial networks in generating realistic super-resolution images from low-resolution ones by using three typical losses for super-resolution: Content Loss, Adversarial Loss, Perceptual Loss, and proposed to use Total Variation Loss. We try different pre-trained famous Convolutional neural networks models (VGG19, FaceNet, and EfficientNet) in Perceptual Loss to have a general view with different backbones. Our network gains 32.67 of Peak signal-to-noise ratio (PSNR) and 0.89 of Structural similarity index (SSIM) in 100 random samples from the Flickr-Faces-HQ dataset.\",\"PeriodicalId\":308944,\"journal\":{\"name\":\"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)\",\"volume\":\"456 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NICS51282.2020.9335836\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NICS51282.2020.9335836","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
SRGAN with Total Variation Loss in Face Super-Resolution
Facial image super-resolution is a crucial preprocessing for facial image analysis, face recognition, and image-based 3D face reconstruction. Convolutional neural networks were earlier used to produce high-resolution images that train quicker and shown excellent performance by learning mapping relation using pairs of low-resolution and high-resolution images. However, in some cases, they are incapable of recovering finer details and often generate blurry images. In this paper, we evaluate a method of applying Generative adversarial networks in generating realistic super-resolution images from low-resolution ones by using three typical losses for super-resolution: Content Loss, Adversarial Loss, Perceptual Loss, and proposed to use Total Variation Loss. We try different pre-trained famous Convolutional neural networks models (VGG19, FaceNet, and EfficientNet) in Perceptual Loss to have a general view with different backbones. Our network gains 32.67 of Peak signal-to-noise ratio (PSNR) and 0.89 of Structural similarity index (SSIM) in 100 random samples from the Flickr-Faces-HQ dataset.