LBPNet: Inserting Local Binary Patterns into Neural Networks to Enhance Manipulation Invariance of Fake Face Detection

2021 International Conference on Digital Society and Intelligent Systems (DSInS) Pub Date : 2021-12-03 DOI:10.1109/dsins54396.2021.9670608

Sida Chen

{"title":"LBPNet: Inserting Local Binary Patterns into Neural Networks to Enhance Manipulation Invariance of Fake Face Detection","authors":"Sida Chen","doi":"10.1109/dsins54396.2021.9670608","DOIUrl":null,"url":null,"abstract":"Fake face detection is an essential yet under-explored area, because faces generated by modern Generative Adversarial Networks (GANs) are virtually indiscernible by human observers. A recent work, GramNet[1], proposed using global texture information as heuristic because the features that previous approaches rely on are largely lost after image corruption and are not generalizable to different GANs. However, our theoretical reasoning and empirical studies both lead to the realization that their GLCM descriptor, as a global texture descriptor, is still not robust enough in all cases of image manipulation, because it doesn't take enough pixels into account, making it distortion-prone. Statistical analyses show that LBP is more generalizable to different GANs, and also reaches high consistency in outputs after image manipulation. Motivated by this finding, we implemented a Convolutional Neural Network with a ResNet backbone that uses LBP to enhance its global texture perception, effectively describing the texture at various semantic levels in an image with improved robustness. We conducted experiments with our model on images generated from StyleGAN and StyleGAN2, as well as images manipulated by different filters, and showed that our model reaches better and more consistent performance during image manipulation or in cross-domain settings, especially when images are subjected to Gaussian noise, in which we reached a performance increase from 82% to 90%. We open-sourced our code at https://github.com/Josh-Cena/lbpnet.","PeriodicalId":243724,"journal":{"name":"2021 International Conference on Digital Society and Intelligent Systems (DSInS)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Digital Society and Intelligent Systems (DSInS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/dsins54396.2021.9670608","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Fake face detection is an essential yet under-explored area, because faces generated by modern Generative Adversarial Networks (GANs) are virtually indiscernible by human observers. A recent work, GramNet[1], proposed using global texture information as heuristic because the features that previous approaches rely on are largely lost after image corruption and are not generalizable to different GANs. However, our theoretical reasoning and empirical studies both lead to the realization that their GLCM descriptor, as a global texture descriptor, is still not robust enough in all cases of image manipulation, because it doesn't take enough pixels into account, making it distortion-prone. Statistical analyses show that LBP is more generalizable to different GANs, and also reaches high consistency in outputs after image manipulation. Motivated by this finding, we implemented a Convolutional Neural Network with a ResNet backbone that uses LBP to enhance its global texture perception, effectively describing the texture at various semantic levels in an image with improved robustness. We conducted experiments with our model on images generated from StyleGAN and StyleGAN2, as well as images manipulated by different filters, and showed that our model reaches better and more consistent performance during image manipulation or in cross-domain settings, especially when images are subjected to Gaussian noise, in which we reached a performance increase from 82% to 90%. We open-sourced our code at https://github.com/Josh-Cena/lbpnet.

查看原文本刊更多论文

LBPNet:在神经网络中插入局部二值模式以增强假人脸检测的操作不变性

假人脸检测是一个重要但尚未开发的领域，因为由现代生成对抗网络(gan)生成的人脸几乎无法被人类观察者识别。最近的一项工作，GramNet[1]，提出使用全局纹理信息作为启发式，因为以前的方法所依赖的特征在图像损坏后很大程度上丢失了，并且不能推广到不同的gan。然而，我们的理论推理和实证研究都表明，他们的GLCM描述符作为一个全局纹理描述符，在所有的图像处理情况下仍然不够鲁棒，因为它没有考虑到足够的像素，使其容易失真。统计分析表明，LBP在不同的gan中具有更强的泛化性，并且经过图像处理后的输出具有较高的一致性。基于这一发现，我们实现了一个带有ResNet主干的卷积神经网络，该网络使用LBP来增强其全局纹理感知，有效地描述了图像中不同语义层次的纹理，并提高了鲁棒性。我们用我们的模型对StyleGAN和StyleGAN2生成的图像以及不同滤波器处理的图像进行了实验，结果表明，我们的模型在图像处理或跨域设置中达到了更好和更一致的性能，特别是当图像受到高斯噪声时，我们的性能从82%提高到90%。我们在https://github.com/Josh-Cena/lbpnet上开源了我们的代码。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 International Conference on Digital Society and Intelligent Systems (DSInS)

自引率

0.00%

发文量