{"title":"优化频率协同策略驱动人工智能图像检测","authors":"Jun Li;Wentao Jiang;Liyan Shen;Yawei Ren","doi":"10.1109/JIOT.2025.3531053","DOIUrl":null,"url":null,"abstract":"Artificial intelligence (AI) image generation powered by large language model (large language modelss (LLMs)) enables highly realistic image synthesis and manipulation, posing significant security risks to Internet of Things (IoT) systems, particularly in identity authentication and data integrity. Although multidomain synthetic image detection has advanced, how spatial and frequency domain features affect decision making is still an open question, causing models to emphasize less critical areas and fall into local optimality. Through multidomain empirical analysis, we reveal the common contrastive differences in image textures and further demonstrate that frequency analysis helps capture the spectral differences in images. Building on this, we propose the collaborative spatial and frequency detector (CSFD). First, the image is decomposed into strong and weak texture regions in the spatial domain. Second, it aggregates different components in the frequency domain, using weighted channel attention to enhance spatial reasoning. Finally, the texture regions are combined to discriminate synthetic images. Experimental results demonstrate that incorporating channel attention based on frequency information improves the detection of synthetic images with spectral defects. On a comprehensive AI-generated image detection benchmark, the proposed method improves accuracy by 2.61% over current methods. Our code is available at <uri>https://github.com/JackPotProject/Frequency-Collaborative</uri>.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 11","pages":"16192-16203"},"PeriodicalIF":8.9000,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimized Frequency Collaborative Strategy Drives AI Image Detection\",\"authors\":\"Jun Li;Wentao Jiang;Liyan Shen;Yawei Ren\",\"doi\":\"10.1109/JIOT.2025.3531053\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Artificial intelligence (AI) image generation powered by large language model (large language modelss (LLMs)) enables highly realistic image synthesis and manipulation, posing significant security risks to Internet of Things (IoT) systems, particularly in identity authentication and data integrity. Although multidomain synthetic image detection has advanced, how spatial and frequency domain features affect decision making is still an open question, causing models to emphasize less critical areas and fall into local optimality. Through multidomain empirical analysis, we reveal the common contrastive differences in image textures and further demonstrate that frequency analysis helps capture the spectral differences in images. Building on this, we propose the collaborative spatial and frequency detector (CSFD). First, the image is decomposed into strong and weak texture regions in the spatial domain. Second, it aggregates different components in the frequency domain, using weighted channel attention to enhance spatial reasoning. Finally, the texture regions are combined to discriminate synthetic images. Experimental results demonstrate that incorporating channel attention based on frequency information improves the detection of synthetic images with spectral defects. On a comprehensive AI-generated image detection benchmark, the proposed method improves accuracy by 2.61% over current methods. Our code is available at <uri>https://github.com/JackPotProject/Frequency-Collaborative</uri>.\",\"PeriodicalId\":54347,\"journal\":{\"name\":\"IEEE Internet of Things Journal\",\"volume\":\"12 11\",\"pages\":\"16192-16203\"},\"PeriodicalIF\":8.9000,\"publicationDate\":\"2025-01-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Internet of Things Journal\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10848481/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10848481/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
Optimized Frequency Collaborative Strategy Drives AI Image Detection
Artificial intelligence (AI) image generation powered by large language model (large language modelss (LLMs)) enables highly realistic image synthesis and manipulation, posing significant security risks to Internet of Things (IoT) systems, particularly in identity authentication and data integrity. Although multidomain synthetic image detection has advanced, how spatial and frequency domain features affect decision making is still an open question, causing models to emphasize less critical areas and fall into local optimality. Through multidomain empirical analysis, we reveal the common contrastive differences in image textures and further demonstrate that frequency analysis helps capture the spectral differences in images. Building on this, we propose the collaborative spatial and frequency detector (CSFD). First, the image is decomposed into strong and weak texture regions in the spatial domain. Second, it aggregates different components in the frequency domain, using weighted channel attention to enhance spatial reasoning. Finally, the texture regions are combined to discriminate synthetic images. Experimental results demonstrate that incorporating channel attention based on frequency information improves the detection of synthetic images with spectral defects. On a comprehensive AI-generated image detection benchmark, the proposed method improves accuracy by 2.61% over current methods. Our code is available at https://github.com/JackPotProject/Frequency-Collaborative.
期刊介绍:
The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.