Efficient-Frequency: a hybrid visual forensic framework for facial forgery detection

2020 IEEE Symposium Series on Computational Intelligence (SSCI) Pub Date : 2020-12-01 DOI:10.1109/SSCI47803.2020.9308305

Chau Xuan Truong Du, Le Hoang Duong, T. T. Huynh, Minh Tam Pham, Nguyen Quoc Viet Hung, Jun Jo

{"title":"Efficient-Frequency: a hybrid visual forensic framework for facial forgery detection","authors":"Chau Xuan Truong Du, Le Hoang Duong, T. T. Huynh, Minh Tam Pham, Nguyen Quoc Viet Hung, Jun Jo","doi":"10.1109/SSCI47803.2020.9308305","DOIUrl":null,"url":null,"abstract":"The recent years have witnessed the significant development of visual forgery techniques and their malicious applications such as spreading of fake news and rumours, defamation or blackmailing of politicians and celebrities, manipulation of election result in political warfare. The manipulated contents have reached to such sophisticated level that human cannot tell apart whether a given content is real or fake. To deal with this serious threat, a rich body of visual forensic techniques has been proposed for detecting forged video and images. However, existing techniques either rely solely on engineered features or require a complex deep learning model to extract the underlying patterns. In this paper, we propose a novel end-to-end visual forensic framework that can incorporate different modalities to efficiently classify real and forged contents. The model employs both the original content and its frequency domain analysis to fully exploit the richness of the image latent patterns. They are forwarded into two separated EfficientNet, a light yet efficient neural network architecture specialized for image classification, for pattern extraction. Then, we design a late-fusion mechanism to fuse the learnt features in original and frequency domain based on the importance of the underlying information. Our experimental results show that our proposed technique outperforms other state-of-the-art forensic approaches in many datasets and being robust to various visual forgery techniques.","PeriodicalId":413489,"journal":{"name":"2020 IEEE Symposium Series on Computational Intelligence (SSCI)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Symposium Series on Computational Intelligence (SSCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSCI47803.2020.9308305","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

The recent years have witnessed the significant development of visual forgery techniques and their malicious applications such as spreading of fake news and rumours, defamation or blackmailing of politicians and celebrities, manipulation of election result in political warfare. The manipulated contents have reached to such sophisticated level that human cannot tell apart whether a given content is real or fake. To deal with this serious threat, a rich body of visual forensic techniques has been proposed for detecting forged video and images. However, existing techniques either rely solely on engineered features or require a complex deep learning model to extract the underlying patterns. In this paper, we propose a novel end-to-end visual forensic framework that can incorporate different modalities to efficiently classify real and forged contents. The model employs both the original content and its frequency domain analysis to fully exploit the richness of the image latent patterns. They are forwarded into two separated EfficientNet, a light yet efficient neural network architecture specialized for image classification, for pattern extraction. Then, we design a late-fusion mechanism to fuse the learnt features in original and frequency domain based on the importance of the underlying information. Our experimental results show that our proposed technique outperforms other state-of-the-art forensic approaches in many datasets and being robust to various visual forgery techniques.

查看原文本刊更多论文

高效-频率:用于面部伪造检测的混合视觉取证框架

近年来，视觉伪造技术及其恶意应用取得了重大发展，例如传播假新闻和谣言，诽谤或勒索政治家和名人，在政治战争中操纵选举结果。被操纵的内容已经达到了如此复杂的程度，以至于人类无法区分给定的内容是真是假。为了应对这种严重的威胁，人们提出了丰富的视觉取证技术来检测伪造的视频和图像。然而，现有的技术要么完全依赖于工程特征，要么需要一个复杂的深度学习模型来提取潜在的模式。在本文中，我们提出了一种新颖的端到端视觉取证框架，它可以结合不同的模式来有效地分类真实和伪造的内容。该模型将原始内容和其频域分析相结合，充分利用了图像潜在模式的丰富性。它们被转发到两个独立的effentnet中，这是一个轻量级但高效的神经网络架构，专门用于图像分类和模式提取。然后，我们设计了一种基于基础信息重要性的后期融合机制，将学习到的特征在原始域和频域进行融合。我们的实验结果表明，我们提出的技术在许多数据集中优于其他最先进的取证方法，并且对各种视觉伪造技术具有鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 IEEE Symposium Series on Computational Intelligence (SSCI)

自引率

0.00%

发文量