{"title":"RLGC: Reconstruction Learning Fusing Gradient and Content Features for Efficient Deepfake Detection","authors":"Kaiwen Xu;Xiyuan Hu;Xiaokang Zhou;Xiaolong Xu;Lianyong Qi;Chen Chen","doi":"10.1109/TCE.2024.3435032","DOIUrl":null,"url":null,"abstract":"Current deepfake detection methods, which utilize noise features, localized textures, or frequency statistics, may perform well in special domains or forgery methods. But the generalization performance of these methods is often unsatisfactory because of the ignorance of mining intrinsic facial features. To address this problem, we re-evaluated the fusion of image gradient features in neural networks and delved deeper into the intrinsic structure of input images. Consequently, we propose a reconstruction-classification network that initially learns face content and gradient separately from a reconstruction perspective and then detects forged faces by fusing them together. This paper introduces three well-designed components: 1) a dual-branch feature extraction module to excite distributional inconsistencies between real and forged faces; 2) a content-gradient feature fusion module to investigate the relationship between face content and image gradient; 3) a reconstruction disparity based Bi-Directional attention module that guides the model in efficiently categorizing the fused features. Extensive experiments on large-scale benchmark datasets demonstrate that our method significantly enhances performance, especially for generalization ability, compared to state-of-the-art methods.","PeriodicalId":13208,"journal":{"name":"IEEE Transactions on Consumer Electronics","volume":"70 3","pages":"6084-6094"},"PeriodicalIF":4.3000,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Consumer Electronics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10612835/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Current deepfake detection methods, which utilize noise features, localized textures, or frequency statistics, may perform well in special domains or forgery methods. But the generalization performance of these methods is often unsatisfactory because of the ignorance of mining intrinsic facial features. To address this problem, we re-evaluated the fusion of image gradient features in neural networks and delved deeper into the intrinsic structure of input images. Consequently, we propose a reconstruction-classification network that initially learns face content and gradient separately from a reconstruction perspective and then detects forged faces by fusing them together. This paper introduces three well-designed components: 1) a dual-branch feature extraction module to excite distributional inconsistencies between real and forged faces; 2) a content-gradient feature fusion module to investigate the relationship between face content and image gradient; 3) a reconstruction disparity based Bi-Directional attention module that guides the model in efficiently categorizing the fused features. Extensive experiments on large-scale benchmark datasets demonstrate that our method significantly enhances performance, especially for generalization ability, compared to state-of-the-art methods.
期刊介绍:
The main focus for the IEEE Transactions on Consumer Electronics is the engineering and research aspects of the theory, design, construction, manufacture or end use of mass market electronics, systems, software and services for consumers.