{"title":"Centroid-based Contrastive Consistency Learning for transferable deepfake detection","authors":"Ruiqi Zha, Zhichao Lian, Qianmu Li","doi":"10.1016/j.neucom.2025.130009","DOIUrl":null,"url":null,"abstract":"<div><div>Previous research efforts in deepfake detection mainly concentrated on identifying and differentiating artifacts discernible to humans. Those methods have left a bias in learned models, as they tend to concentrate on the disparities between forged and natural regions from the perspective of a single sample while overlooking consistency within categories from the perspective of the entire sample set, which remains crucial across various real-world applications. Therefore, inspired by contrastive learning, we tackle the deepfake detection problem by learning the invariant representations of both categories. Our proposed method, termed Centroid-based Contrastive Consistency Learning (C3L) method, integrates constraints on representations at both the data preprocessing and feature extraction stages. Specifically, during data preprocessing, we consider both temporal relationships within videos and the latent relationships within synthesis data. We introduce a novel Positive Enhancement Module (PEM) designed to characterize natural and forged samples in a facial semantically irrelevant way, thereby guiding a task-oriented positive pair contrasting strategy. In addition, at the feature extraction stage, we introduce the Margin Feature Simulation Module (MFSM), which leverages the centroid of the natural category to simulate marginal features for both categories. Subsequently, we employ the Supervised Contrastive Margin Loss (SCML), utilizing simulated features to emphasize differences at decision boundaries and optimize the learning process. The effectiveness and robustness of the proposed method have been demonstrated through extensive experiments.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"637 ","pages":"Article 130009"},"PeriodicalIF":5.5000,"publicationDate":"2025-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225006812","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Previous research efforts in deepfake detection mainly concentrated on identifying and differentiating artifacts discernible to humans. Those methods have left a bias in learned models, as they tend to concentrate on the disparities between forged and natural regions from the perspective of a single sample while overlooking consistency within categories from the perspective of the entire sample set, which remains crucial across various real-world applications. Therefore, inspired by contrastive learning, we tackle the deepfake detection problem by learning the invariant representations of both categories. Our proposed method, termed Centroid-based Contrastive Consistency Learning (C3L) method, integrates constraints on representations at both the data preprocessing and feature extraction stages. Specifically, during data preprocessing, we consider both temporal relationships within videos and the latent relationships within synthesis data. We introduce a novel Positive Enhancement Module (PEM) designed to characterize natural and forged samples in a facial semantically irrelevant way, thereby guiding a task-oriented positive pair contrasting strategy. In addition, at the feature extraction stage, we introduce the Margin Feature Simulation Module (MFSM), which leverages the centroid of the natural category to simulate marginal features for both categories. Subsequently, we employ the Supervised Contrastive Margin Loss (SCML), utilizing simulated features to emphasize differences at decision boundaries and optimize the learning process. The effectiveness and robustness of the proposed method have been demonstrated through extensive experiments.
期刊介绍:
Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.