求助PDF
{"title":"A Comparative Analysis of Deepfake Detection Methods Using Overlapping Multiple Dynamic Images","authors":"Enkhtaivan Purevsuren, Junya Sato, Takuya Akashi","doi":"10.1002/tee.24258","DOIUrl":null,"url":null,"abstract":"<p>Deepfake technology, which uses artificial intelligence to create realistic fake images, audio, and videos, has raised significant concerns due to its potential for misuse and manipulation. The emergence of deepfake technology poses a significant threat to the integrity of digital content, necessitating robust detection mechanisms. This paper proposes a novel method for deepfake detection by combining Overlapping Multiple Dynamic Images (OMDI) and Inversed Overlapping Multiple Dynamic Images (I-OMDI). Both representations capture temporal inconsistencies and subtle visual artifacts in fake videos by effectively utilizing spatial–temporal information. Our approach employs EfficientNetB7 as the backbone for feature extraction, enabling the model to distinguish between real and fake videos with high accuracy. By combining OMDI and I-OMDI with a weighted average strategy, we amplify the strengths of each method. Specifically, we assign equal weights of 0.5 to OMDI and I-OMDI based on their individual contributions to performance metrics. This balance yields substantial performance improvements across multiple datasets. When evaluated on the Celeb-DF v2 and DFDC datasets, our proposed model achieves state-of-the-art results, with an AUC score of 0.9952 on Celeb-DF v2 and 0.9947 on DFDC. These results underscore the robustness of the combined OMDI and I-OMDI methods in identifying deepfake videos. Furthermore, our model demonstrates superior performance compared to existing methods, including those by Tran <i>et al.</i> and Heo <i>et al.</i>, underscoring its effectiveness in practical deepfake detection applications. © 2025 Institute of Electrical Engineers of Japan and Wiley Periodicals LLC.</p>","PeriodicalId":13435,"journal":{"name":"IEEJ Transactions on Electrical and Electronic Engineering","volume":"20 6","pages":"886-898"},"PeriodicalIF":1.0000,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEJ Transactions on Electrical and Electronic Engineering","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/tee.24258","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
引用
批量引用
Abstract
Deepfake technology, which uses artificial intelligence to create realistic fake images, audio, and videos, has raised significant concerns due to its potential for misuse and manipulation. The emergence of deepfake technology poses a significant threat to the integrity of digital content, necessitating robust detection mechanisms. This paper proposes a novel method for deepfake detection by combining Overlapping Multiple Dynamic Images (OMDI) and Inversed Overlapping Multiple Dynamic Images (I-OMDI). Both representations capture temporal inconsistencies and subtle visual artifacts in fake videos by effectively utilizing spatial–temporal information. Our approach employs EfficientNetB7 as the backbone for feature extraction, enabling the model to distinguish between real and fake videos with high accuracy. By combining OMDI and I-OMDI with a weighted average strategy, we amplify the strengths of each method. Specifically, we assign equal weights of 0.5 to OMDI and I-OMDI based on their individual contributions to performance metrics. This balance yields substantial performance improvements across multiple datasets. When evaluated on the Celeb-DF v2 and DFDC datasets, our proposed model achieves state-of-the-art results, with an AUC score of 0.9952 on Celeb-DF v2 and 0.9947 on DFDC. These results underscore the robustness of the combined OMDI and I-OMDI methods in identifying deepfake videos. Furthermore, our model demonstrates superior performance compared to existing methods, including those by Tran et al. and Heo et al. , underscoring its effectiveness in practical deepfake detection applications. © 2025 Institute of Electrical Engineers of Japan and Wiley Periodicals LLC.
基于多幅动态图像重叠的深度造假检测方法比较分析
Deepfake技术利用人工智能来制作逼真的假图像、音频和视频,由于可能被滥用和操纵,该技术引起了极大的担忧。深度伪造技术的出现对数字内容的完整性构成了重大威胁,需要强大的检测机制。本文提出了一种将重叠多动态图像(OMDI)和逆重叠多动态图像(I-OMDI)相结合的深度伪造检测新方法。这两种表示都通过有效利用时空信息来捕捉假视频中的时间不一致性和微妙的视觉伪影。我们的方法采用了effentnetb7作为特征提取的主干,使模型能够以高精度区分真假视频。通过将OMDI和I-OMDI与加权平均策略相结合,我们放大了每种方法的优势。具体来说,我们根据OMDI和I-OMDI各自对性能指标的贡献为它们分配了相同的权重0.5。这种平衡在多个数据集上产生了实质性的性能改进。当对Celeb-DF v2和DFDC数据集进行评估时,我们提出的模型达到了最先进的结果,Celeb-DF v2和DFDC的AUC得分分别为0.9952和0.9947。这些结果强调了结合OMDI和I-OMDI方法在识别深度假视频方面的鲁棒性。此外,与现有方法(包括Tran等人和Heo等人的方法)相比,我们的模型表现出卓越的性能,强调了其在实际深度伪造检测应用中的有效性。©2025日本电气工程师协会和Wiley期刊有限责任公司。
本文章由计算机程序翻译,如有差异,请以英文原文为准。