Deep fake detection using an optimal deep learning model with multi head attention-based feature extraction scheme

The Visual Computer Pub Date : 2024-07-17 DOI:10.1007/s00371-024-03567-0

R. Raja Sekar, T. Dhiliphan Rajkumar, Koteswara Rao Anne

{"title":"Deep fake detection using an optimal deep learning model with multi head attention-based feature extraction scheme","authors":"R. Raja Sekar, T. Dhiliphan Rajkumar, Koteswara Rao Anne","doi":"10.1007/s00371-024-03567-0","DOIUrl":null,"url":null,"abstract":"<p>Face forgery, or deep fake, is a frequently used method to produce fake face images, network pornography, blackmail, and other illegal activities. Researchers developed several detection approaches based on the changing traces presented by deep forgery to limit the damage caused by deep fake methods. They obtain limited performance when evaluating cross-datum scenarios. This paper proposes an optimal deep learning approach with an attention-based feature learning scheme to perform DFD more accurately. The proposed system mainly comprises ‘5’ phases: face detection, preprocessing, texture feature extraction, spatial feature extraction, and classification. The face regions are initially detected from the collected data using the Viola–Jones (VJ) algorithm. Then, preprocessing is carried out, which resizes and normalizes the detected face regions to improve their quality for detection purposes. Next, texture features are learned using the Butterfly Optimized Gabor Filter to get information about the local features of objects in an image. Then, the spatial features are extracted using Residual Network-50 with Multi Head Attention (RN50MHA) to represent the data globally. Finally, classification is done using the Optimal Long Short-Term Memory (OLSTM), which classifies the data as fake or real, in which optimization of network is done using Enhanced Archimedes Optimization Algorithm. The proposed system is evaluated on four benchmark datasets such as Face Forensics + + (FF + +), Deepfake Detection Challenge, Celebrity Deepfake (CDF), and Wild Deepfake. The experimental results show that DFD using OLSTM and RN50MHA achieves a higher inter and intra-dataset detection rate than existing state-of-the-art methods.</p>","PeriodicalId":501186,"journal":{"name":"The Visual Computer","volume":"76 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Visual Computer","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s00371-024-03567-0","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Face forgery, or deep fake, is a frequently used method to produce fake face images, network pornography, blackmail, and other illegal activities. Researchers developed several detection approaches based on the changing traces presented by deep forgery to limit the damage caused by deep fake methods. They obtain limited performance when evaluating cross-datum scenarios. This paper proposes an optimal deep learning approach with an attention-based feature learning scheme to perform DFD more accurately. The proposed system mainly comprises ‘5’ phases: face detection, preprocessing, texture feature extraction, spatial feature extraction, and classification. The face regions are initially detected from the collected data using the Viola–Jones (VJ) algorithm. Then, preprocessing is carried out, which resizes and normalizes the detected face regions to improve their quality for detection purposes. Next, texture features are learned using the Butterfly Optimized Gabor Filter to get information about the local features of objects in an image. Then, the spatial features are extracted using Residual Network-50 with Multi Head Attention (RN50MHA) to represent the data globally. Finally, classification is done using the Optimal Long Short-Term Memory (OLSTM), which classifies the data as fake or real, in which optimization of network is done using Enhanced Archimedes Optimization Algorithm. The proposed system is evaluated on four benchmark datasets such as Face Forensics + + (FF + +), Deepfake Detection Challenge, Celebrity Deepfake (CDF), and Wild Deepfake. The experimental results show that DFD using OLSTM and RN50MHA achieves a higher inter and intra-dataset detection rate than existing state-of-the-art methods.

Abstract Image

查看原文本刊更多论文

使用基于多头注意力特征提取方案的最佳深度学习模型进行深度假货检测

人脸伪造或深度伪造是一种常用的方法，用于制作虚假人脸图像、网络色情、勒索和其他非法活动。研究人员开发了几种基于深度伪造所呈现的变化痕迹的检测方法，以限制深度伪造方法所造成的破坏。这些方法在评估跨数据场景时性能有限。本文提出了一种基于注意力特征学习方案的最优深度学习方法，以更准确地执行 DFD。所提出的系统主要包括 "5 "个阶段：人脸检测、预处理、纹理特征提取、空间特征提取和分类。首先使用 Viola-Jones (VJ) 算法从收集到的数据中检测出人脸区域。然后进行预处理，调整检测到的人脸区域的大小并使其正常化，以提高检测质量。接着，使用蝴蝶优化 Gabor 滤波器学习纹理特征，以获取图像中物体的局部特征信息。然后，使用多头注意力残差网络-50（RN50MHA）提取空间特征，以表示全局数据。最后，使用最佳长短期记忆（OLSTM）进行分类，将数据分为假数据和真数据，并使用增强型阿基米德优化算法对网络进行优化。所提出的系统在四个基准数据集上进行了评估，如人脸取证 + +（FF + +）、Deepfake Detection Challenge、Celebrity Deepfake（CDF）和 Wild Deepfake。实验结果表明，与现有的最先进方法相比，使用 OLSTM 和 RN50MHA 的 DFD 在数据集之间和数据集内部实现了更高的检测率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

The Visual Computer

自引率

0.00%

发文量