{"title":"基于双域融合的伪装目标检测","authors":"Chenxing Shen, Zhisheng Cui, Leheng Zhang, Miaohui Zhang","doi":"10.1016/j.dsp.2025.105436","DOIUrl":null,"url":null,"abstract":"<div><div>In camouflaged object detection (COD), the goal of accurately and completely segmenting foreground objects from the background on a per-pixel basis is pursued by various network architectures. Despite their specialized module designs tailored for camouflaged object detection, most of these approaches operate primarily within the RGB domain to locate camouflaged objects. Given the intrinsic characteristics of camouflaged objects, solely segmenting them within the RGB domain often encounters significant interference and challenges. To address this issue and leverage the advantages of detecting camouflaged objects from the frequency domain, this paper introduces a Dual-Domain Fusion Network (DDFNet). By integrating frequency domain features with RGB domain features, DDFNet exploits the complementary strengths of both domains, achieving precise localization and segmentation of camouflaged objects. DDFNet is composed of three main modules. The MHFM performs group fusion of backbone features to enhance the representation of these features. The GFGM, based on a dual-phase architecture, extracts the camouflaged target hidden in the RGB domain by operating in the frequency domain. It achieves this by leveraging the detailed information contained in high-frequency features and the spatial information in low-frequency features, thereby obtaining a frequency-domain feature representation of the camouflaged target, which supplements the RGB domain features. Finally, through GHIM, DDFNet performs complementary fusion of the frequency-domain features and the RGB-domain features, utilizing their respective advantages to achieve precise localization of the camouflaged target. Compared to other state-of-the-art methods in camouflaged object detection, DDFNet demonstrates superior performance. Additionally, we introduce polyp segmentation as a downstream task for the proposed network to showcase the ability of DDFNet in solving real-world problems.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"167 ","pages":"Article 105436"},"PeriodicalIF":2.9000,"publicationDate":"2025-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Camouflaged object detection via dual domain fusion\",\"authors\":\"Chenxing Shen, Zhisheng Cui, Leheng Zhang, Miaohui Zhang\",\"doi\":\"10.1016/j.dsp.2025.105436\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In camouflaged object detection (COD), the goal of accurately and completely segmenting foreground objects from the background on a per-pixel basis is pursued by various network architectures. Despite their specialized module designs tailored for camouflaged object detection, most of these approaches operate primarily within the RGB domain to locate camouflaged objects. Given the intrinsic characteristics of camouflaged objects, solely segmenting them within the RGB domain often encounters significant interference and challenges. To address this issue and leverage the advantages of detecting camouflaged objects from the frequency domain, this paper introduces a Dual-Domain Fusion Network (DDFNet). By integrating frequency domain features with RGB domain features, DDFNet exploits the complementary strengths of both domains, achieving precise localization and segmentation of camouflaged objects. DDFNet is composed of three main modules. The MHFM performs group fusion of backbone features to enhance the representation of these features. The GFGM, based on a dual-phase architecture, extracts the camouflaged target hidden in the RGB domain by operating in the frequency domain. It achieves this by leveraging the detailed information contained in high-frequency features and the spatial information in low-frequency features, thereby obtaining a frequency-domain feature representation of the camouflaged target, which supplements the RGB domain features. Finally, through GHIM, DDFNet performs complementary fusion of the frequency-domain features and the RGB-domain features, utilizing their respective advantages to achieve precise localization of the camouflaged target. Compared to other state-of-the-art methods in camouflaged object detection, DDFNet demonstrates superior performance. Additionally, we introduce polyp segmentation as a downstream task for the proposed network to showcase the ability of DDFNet in solving real-world problems.</div></div>\",\"PeriodicalId\":51011,\"journal\":{\"name\":\"Digital Signal Processing\",\"volume\":\"167 \",\"pages\":\"Article 105436\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-06-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digital Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1051200425004580\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1051200425004580","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Camouflaged object detection via dual domain fusion
In camouflaged object detection (COD), the goal of accurately and completely segmenting foreground objects from the background on a per-pixel basis is pursued by various network architectures. Despite their specialized module designs tailored for camouflaged object detection, most of these approaches operate primarily within the RGB domain to locate camouflaged objects. Given the intrinsic characteristics of camouflaged objects, solely segmenting them within the RGB domain often encounters significant interference and challenges. To address this issue and leverage the advantages of detecting camouflaged objects from the frequency domain, this paper introduces a Dual-Domain Fusion Network (DDFNet). By integrating frequency domain features with RGB domain features, DDFNet exploits the complementary strengths of both domains, achieving precise localization and segmentation of camouflaged objects. DDFNet is composed of three main modules. The MHFM performs group fusion of backbone features to enhance the representation of these features. The GFGM, based on a dual-phase architecture, extracts the camouflaged target hidden in the RGB domain by operating in the frequency domain. It achieves this by leveraging the detailed information contained in high-frequency features and the spatial information in low-frequency features, thereby obtaining a frequency-domain feature representation of the camouflaged target, which supplements the RGB domain features. Finally, through GHIM, DDFNet performs complementary fusion of the frequency-domain features and the RGB-domain features, utilizing their respective advantages to achieve precise localization of the camouflaged target. Compared to other state-of-the-art methods in camouflaged object detection, DDFNet demonstrates superior performance. Additionally, we introduce polyp segmentation as a downstream task for the proposed network to showcase the ability of DDFNet in solving real-world problems.
期刊介绍:
Digital Signal Processing: A Review Journal is one of the oldest and most established journals in the field of signal processing yet it aims to be the most innovative. The Journal invites top quality research articles at the frontiers of research in all aspects of signal processing. Our objective is to provide a platform for the publication of ground-breaking research in signal processing with both academic and industrial appeal.
The journal has a special emphasis on statistical signal processing methodology such as Bayesian signal processing, and encourages articles on emerging applications of signal processing such as:
• big data• machine learning• internet of things• information security• systems biology and computational biology,• financial time series analysis,• autonomous vehicles,• quantum computing,• neuromorphic engineering,• human-computer interaction and intelligent user interfaces,• environmental signal processing,• geophysical signal processing including seismic signal processing,• chemioinformatics and bioinformatics,• audio, visual and performance arts,• disaster management and prevention,• renewable energy,