{"title":"基于一致表示学习的稳定扩散模型对人工智能生成图像的暴露","authors":"Van-Nhan Tran;Piljoo Choi;Hoanh-Su Le;Suk-Hwan Lee;Ki-Ryong Kwon","doi":"10.1109/OJCS.2025.3575507","DOIUrl":null,"url":null,"abstract":"Diffusion-based generative models have significantly advanced the field of image synthesis, presenting additional challenges regarding the integrity and authenticity of digital images. Consequently, the identification of AI-generated images has become a critical problem in image forensics. However, there is a lack of literature addressing the detection of images generated by diffusion models. In this article, our focus is on developing a model capable of detecting images generated through both GAN techniques and diffusion models. We propose DiffCoR, a novel detection method for identifying AI-generated images. It consists of two main modules: Stable Diffusion Processing (SDP) and Image Representation Learning (IRL). The SDP module uses a pre-trained Stable Diffusion model to reconstruct input images via reverse diffusion and captures subtle manipulations through reconstruction discrepancies. The IRL module applies self-supervised learning with Latent Consistency Loss (LCL) to extract robust, invariant features, ensuring consistent latent representations across augmented views. We also incorporate frequency domain analysis using Discrete Fourier Transform (DFT) to enhance manipulation detection. Additionally, we introduce ForensicsImage, a publicly available dataset of over 400,000 real and AI-generated images from LSUN-Bedroom, CelebA-HQ, CelebDFv2, and various diffusion models. Experiments on ForensicsImage and GenImage show that DiffCoR achieves state-of-the-art performance, with strong cross-dataset generalization, making it suitable for real-world use in digital forensics, content verification, and social media moderation.","PeriodicalId":13205,"journal":{"name":"IEEE Open Journal of the Computer Society","volume":"6 ","pages":"1353-1365"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11018794","citationCount":"0","resultStr":"{\"title\":\"DiffCoR: Exposing AI-Generated Image by Using Stable Diffusion Model Based on Consistent Representation Learning\",\"authors\":\"Van-Nhan Tran;Piljoo Choi;Hoanh-Su Le;Suk-Hwan Lee;Ki-Ryong Kwon\",\"doi\":\"10.1109/OJCS.2025.3575507\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Diffusion-based generative models have significantly advanced the field of image synthesis, presenting additional challenges regarding the integrity and authenticity of digital images. Consequently, the identification of AI-generated images has become a critical problem in image forensics. However, there is a lack of literature addressing the detection of images generated by diffusion models. In this article, our focus is on developing a model capable of detecting images generated through both GAN techniques and diffusion models. We propose DiffCoR, a novel detection method for identifying AI-generated images. It consists of two main modules: Stable Diffusion Processing (SDP) and Image Representation Learning (IRL). The SDP module uses a pre-trained Stable Diffusion model to reconstruct input images via reverse diffusion and captures subtle manipulations through reconstruction discrepancies. The IRL module applies self-supervised learning with Latent Consistency Loss (LCL) to extract robust, invariant features, ensuring consistent latent representations across augmented views. We also incorporate frequency domain analysis using Discrete Fourier Transform (DFT) to enhance manipulation detection. Additionally, we introduce ForensicsImage, a publicly available dataset of over 400,000 real and AI-generated images from LSUN-Bedroom, CelebA-HQ, CelebDFv2, and various diffusion models. Experiments on ForensicsImage and GenImage show that DiffCoR achieves state-of-the-art performance, with strong cross-dataset generalization, making it suitable for real-world use in digital forensics, content verification, and social media moderation.\",\"PeriodicalId\":13205,\"journal\":{\"name\":\"IEEE Open Journal of the Computer Society\",\"volume\":\"6 \",\"pages\":\"1353-1365\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-03-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11018794\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Open Journal of the Computer Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11018794/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Open Journal of the Computer Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11018794/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
DiffCoR: Exposing AI-Generated Image by Using Stable Diffusion Model Based on Consistent Representation Learning
Diffusion-based generative models have significantly advanced the field of image synthesis, presenting additional challenges regarding the integrity and authenticity of digital images. Consequently, the identification of AI-generated images has become a critical problem in image forensics. However, there is a lack of literature addressing the detection of images generated by diffusion models. In this article, our focus is on developing a model capable of detecting images generated through both GAN techniques and diffusion models. We propose DiffCoR, a novel detection method for identifying AI-generated images. It consists of two main modules: Stable Diffusion Processing (SDP) and Image Representation Learning (IRL). The SDP module uses a pre-trained Stable Diffusion model to reconstruct input images via reverse diffusion and captures subtle manipulations through reconstruction discrepancies. The IRL module applies self-supervised learning with Latent Consistency Loss (LCL) to extract robust, invariant features, ensuring consistent latent representations across augmented views. We also incorporate frequency domain analysis using Discrete Fourier Transform (DFT) to enhance manipulation detection. Additionally, we introduce ForensicsImage, a publicly available dataset of over 400,000 real and AI-generated images from LSUN-Bedroom, CelebA-HQ, CelebDFv2, and various diffusion models. Experiments on ForensicsImage and GenImage show that DiffCoR achieves state-of-the-art performance, with strong cross-dataset generalization, making it suitable for real-world use in digital forensics, content verification, and social media moderation.