Efficient-Frequency: a hybrid visual forensic framework for facial forgery detection

Chau Xuan Truong Du, Le Hoang Duong, T. T. Huynh, Minh Tam Pham, Nguyen Quoc Viet Hung, Jun Jo
{"title":"Efficient-Frequency: a hybrid visual forensic framework for facial forgery detection","authors":"Chau Xuan Truong Du, Le Hoang Duong, T. T. Huynh, Minh Tam Pham, Nguyen Quoc Viet Hung, Jun Jo","doi":"10.1109/SSCI47803.2020.9308305","DOIUrl":null,"url":null,"abstract":"The recent years have witnessed the significant development of visual forgery techniques and their malicious applications such as spreading of fake news and rumours, defamation or blackmailing of politicians and celebrities, manipulation of election result in political warfare. The manipulated contents have reached to such sophisticated level that human cannot tell apart whether a given content is real or fake. To deal with this serious threat, a rich body of visual forensic techniques has been proposed for detecting forged video and images. However, existing techniques either rely solely on engineered features or require a complex deep learning model to extract the underlying patterns. In this paper, we propose a novel end-to-end visual forensic framework that can incorporate different modalities to efficiently classify real and forged contents. The model employs both the original content and its frequency domain analysis to fully exploit the richness of the image latent patterns. They are forwarded into two separated EfficientNet, a light yet efficient neural network architecture specialized for image classification, for pattern extraction. Then, we design a late-fusion mechanism to fuse the learnt features in original and frequency domain based on the importance of the underlying information. Our experimental results show that our proposed technique outperforms other state-of-the-art forensic approaches in many datasets and being robust to various visual forgery techniques.","PeriodicalId":413489,"journal":{"name":"2020 IEEE Symposium Series on Computational Intelligence (SSCI)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Symposium Series on Computational Intelligence (SSCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSCI47803.2020.9308305","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

The recent years have witnessed the significant development of visual forgery techniques and their malicious applications such as spreading of fake news and rumours, defamation or blackmailing of politicians and celebrities, manipulation of election result in political warfare. The manipulated contents have reached to such sophisticated level that human cannot tell apart whether a given content is real or fake. To deal with this serious threat, a rich body of visual forensic techniques has been proposed for detecting forged video and images. However, existing techniques either rely solely on engineered features or require a complex deep learning model to extract the underlying patterns. In this paper, we propose a novel end-to-end visual forensic framework that can incorporate different modalities to efficiently classify real and forged contents. The model employs both the original content and its frequency domain analysis to fully exploit the richness of the image latent patterns. They are forwarded into two separated EfficientNet, a light yet efficient neural network architecture specialized for image classification, for pattern extraction. Then, we design a late-fusion mechanism to fuse the learnt features in original and frequency domain based on the importance of the underlying information. Our experimental results show that our proposed technique outperforms other state-of-the-art forensic approaches in many datasets and being robust to various visual forgery techniques.
高效-频率:用于面部伪造检测的混合视觉取证框架
近年来,视觉伪造技术及其恶意应用取得了重大发展,例如传播假新闻和谣言,诽谤或勒索政治家和名人,在政治战争中操纵选举结果。被操纵的内容已经达到了如此复杂的程度,以至于人类无法区分给定的内容是真是假。为了应对这种严重的威胁,人们提出了丰富的视觉取证技术来检测伪造的视频和图像。然而,现有的技术要么完全依赖于工程特征,要么需要一个复杂的深度学习模型来提取潜在的模式。在本文中,我们提出了一种新颖的端到端视觉取证框架,它可以结合不同的模式来有效地分类真实和伪造的内容。该模型将原始内容和其频域分析相结合,充分利用了图像潜在模式的丰富性。它们被转发到两个独立的effentnet中,这是一个轻量级但高效的神经网络架构,专门用于图像分类和模式提取。然后,我们设计了一种基于基础信息重要性的后期融合机制,将学习到的特征在原始域和频域进行融合。我们的实验结果表明,我们提出的技术在许多数据集中优于其他最先进的取证方法,并且对各种视觉伪造技术具有鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信