Real-time deblurring network for face AR applications

Proceedings of the 19th International Conference on Content-based Multimedia Indexing Pub Date : 2022-09-14 DOI:10.1145/3549555.3549577

Juhwan Lee, Jonghan Lee, S. Yoo

{"title":"Real-time deblurring network for face AR applications","authors":"Juhwan Lee, Jonghan Lee, S. Yoo","doi":"10.1145/3549555.3549577","DOIUrl":null,"url":null,"abstract":"Deblurring is a problem that has been studied for a long time. Extant works have primarily focused on deblurring real-world images. However, face images are different from real-world images. Because face images have fewer textures and weaker edges than real-world images, the deblurring of real-world images focuses on restoring the overall texture of the image; however, restoring the particular face structure (e.g., eyes, nose, and ears) is essential for face images. Recently, a convolutional neural network(CNN)-based deblurring network has been proposed. There are various types of CNN-based deblurring networks. Recently, multiscale architecture has been widely used; however, these types of networks need large amounts of resources. Further, because of the multitude of parameters, it requires a significant amount of time for inference. In this study, we developed a end-to-end network for face image deblurring, wherein novel CNN-based feature attention (FA) blocks are adopted, and a low inference time is achieved. Moreover, discrete Fourier transform (DFT) is employed for high-quality deblurring. FA blocks combine channel attention layer and pixel attention layer for feature extraction. The spectrum obtained using DFT is used as a loss function by comparing the ground truth image with the deblurring image. Experimental results show that the ours network is comparable to other deblurring networks in terms of performance as indicated by the PSNR, SSIM. Moreover we also demonstrated performance improvement by measuring the mean Intersection over Union (mIoU) of the deblurred image using a face-segmentation network.","PeriodicalId":191591,"journal":{"name":"Proceedings of the 19th International Conference on Content-based Multimedia Indexing","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 19th International Conference on Content-based Multimedia Indexing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3549555.3549577","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Deblurring is a problem that has been studied for a long time. Extant works have primarily focused on deblurring real-world images. However, face images are different from real-world images. Because face images have fewer textures and weaker edges than real-world images, the deblurring of real-world images focuses on restoring the overall texture of the image; however, restoring the particular face structure (e.g., eyes, nose, and ears) is essential for face images. Recently, a convolutional neural network(CNN)-based deblurring network has been proposed. There are various types of CNN-based deblurring networks. Recently, multiscale architecture has been widely used; however, these types of networks need large amounts of resources. Further, because of the multitude of parameters, it requires a significant amount of time for inference. In this study, we developed a end-to-end network for face image deblurring, wherein novel CNN-based feature attention (FA) blocks are adopted, and a low inference time is achieved. Moreover, discrete Fourier transform (DFT) is employed for high-quality deblurring. FA blocks combine channel attention layer and pixel attention layer for feature extraction. The spectrum obtained using DFT is used as a loss function by comparing the ground truth image with the deblurring image. Experimental results show that the ours network is comparable to other deblurring networks in terms of performance as indicated by the PSNR, SSIM. Moreover we also demonstrated performance improvement by measuring the mean Intersection over Union (mIoU) of the deblurred image using a face-segmentation network.

查看原文本刊更多论文

面部增强现实应用的实时去模糊网络

去模糊是一个研究了很长时间的问题。现存的作品主要集中在消除现实世界的图像模糊。然而，人脸图像不同于现实世界的图像。由于人脸图像的纹理比真实图像少，边缘较弱，因此真实图像的去模糊主要是恢复图像的整体纹理;然而，恢复特定的面部结构(例如，眼睛，鼻子和耳朵)对于人脸图像至关重要。最近，人们提出了一种基于卷积神经网络(CNN)的去模糊网络。基于cnn的去模糊网络有多种类型。近年来，多尺度体系结构得到了广泛的应用;然而，这些类型的网络需要大量的资源。此外，由于参数众多，它需要大量的时间进行推理。在这项研究中，我们开发了一个端到端的人脸图像去模糊网络，其中采用了新颖的基于cnn的特征注意(FA)块，实现了较低的推理时间。此外，采用离散傅里叶变换(DFT)实现高质量的去模糊。FA块结合通道注意层和像素注意层进行特征提取。利用DFT获得的频谱作为损失函数，将真地图像与去模糊图像进行比较。实验结果表明，我们的网络在性能上与其他去模糊网络相当，如PSNR, SSIM。此外，我们还通过使用人脸分割网络测量去模糊图像的平均交联(mIoU)来展示性能改进。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 19th International Conference on Content-based Multimedia Indexing

自引率

0.00%

发文量