Eff-YNet: A Dual Task Network for DeepFake Detection and Segmentation

2021 15th International Conference on Ubiquitous Information Management and Communication (IMCOM) Pub Date : 2021-01-04 DOI:10.1109/IMCOM51814.2021.9377373

Eric Tjon, M. Moh, Teng-Sheng Moh

{"title":"Eff-YNet: A Dual Task Network for DeepFake Detection and Segmentation","authors":"Eric Tjon, M. Moh, Teng-Sheng Moh","doi":"10.1109/IMCOM51814.2021.9377373","DOIUrl":null,"url":null,"abstract":"Advances in generative models and manipulation techniques have given rise to digitally altered videos known as deepfakes. These videos are difficult to identify for both humans and machines. Modern detection methods exploit various weaknesses in deepfake videos, such as visual artifacts and inconsistent posing. In this paper, we describe a novel architecture called Eff-YNet designed to detect visual differences between altered and unaltered areas. The architecture combines an EfficientNet encoder and a U-Net with a classification branch into a model capable of both classifying and segmenting deepfake videos. The task of segmentation helps train the classifier and also produces useful segmentation masks. We also implement ResNet 3D to detect spatiotemporal inconsistencies. To test these models, we run experiments against the Deepfake Detection Challenge dataset and show improvements over baseline classification models. Furthermore, we find that an ensemble of these two approaches improves performance over a single approach alone.","PeriodicalId":275121,"journal":{"name":"2021 15th International Conference on Ubiquitous Information Management and Communication (IMCOM)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 15th International Conference on Ubiquitous Information Management and Communication (IMCOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IMCOM51814.2021.9377373","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

Abstract

Advances in generative models and manipulation techniques have given rise to digitally altered videos known as deepfakes. These videos are difficult to identify for both humans and machines. Modern detection methods exploit various weaknesses in deepfake videos, such as visual artifacts and inconsistent posing. In this paper, we describe a novel architecture called Eff-YNet designed to detect visual differences between altered and unaltered areas. The architecture combines an EfficientNet encoder and a U-Net with a classification branch into a model capable of both classifying and segmenting deepfake videos. The task of segmentation helps train the classifier and also produces useful segmentation masks. We also implement ResNet 3D to detect spatiotemporal inconsistencies. To test these models, we run experiments against the Deepfake Detection Challenge dataset and show improvements over baseline classification models. Furthermore, we find that an ensemble of these two approaches improves performance over a single approach alone.

查看原文本刊更多论文

efff - ynet:一种用于深度伪造检测和分割的双任务网络

生成模型和操作技术的进步导致了被称为深度伪造的数字修改视频的出现。这些视频对人类和机器来说都很难识别。现代检测方法利用了深度假视频的各种弱点，比如视觉伪影和不一致的姿势。在本文中，我们描述了一种名为Eff-YNet的新架构，旨在检测改变和未改变区域之间的视觉差异。该架构将一个高效率编码器和一个U-Net与一个分类分支结合在一起，形成一个能够对深度假视频进行分类和分割的模型。分割任务有助于训练分类器，也产生有用的分割掩码。我们还实现了ResNet 3D来检测时空不一致性。为了测试这些模型，我们对Deepfake Detection Challenge数据集进行了实验，并展示了对基线分类模型的改进。此外，我们发现这两种方法的集成比单独使用一种方法提高了性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 15th International Conference on Ubiquitous Information Management and Communication (IMCOM)

自引率

0.00%

发文量