基于双域融合的伪装目标检测

IF 2.9 3区工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

Digital Signal Processing Pub Date : 2025-06-26 DOI:10.1016/j.dsp.2025.105436

Chenxing Shen, Zhisheng Cui, Leheng Zhang, Miaohui Zhang

{"title":"基于双域融合的伪装目标检测","authors":"Chenxing Shen, Zhisheng Cui, Leheng Zhang, Miaohui Zhang","doi":"10.1016/j.dsp.2025.105436","DOIUrl":null,"url":null,"abstract":"<div><div>In camouflaged object detection (COD), the goal of accurately and completely segmenting foreground objects from the background on a per-pixel basis is pursued by various network architectures. Despite their specialized module designs tailored for camouflaged object detection, most of these approaches operate primarily within the RGB domain to locate camouflaged objects. Given the intrinsic characteristics of camouflaged objects, solely segmenting them within the RGB domain often encounters significant interference and challenges. To address this issue and leverage the advantages of detecting camouflaged objects from the frequency domain, this paper introduces a Dual-Domain Fusion Network (DDFNet). By integrating frequency domain features with RGB domain features, DDFNet exploits the complementary strengths of both domains, achieving precise localization and segmentation of camouflaged objects. DDFNet is composed of three main modules. The MHFM performs group fusion of backbone features to enhance the representation of these features. The GFGM, based on a dual-phase architecture, extracts the camouflaged target hidden in the RGB domain by operating in the frequency domain. It achieves this by leveraging the detailed information contained in high-frequency features and the spatial information in low-frequency features, thereby obtaining a frequency-domain feature representation of the camouflaged target, which supplements the RGB domain features. Finally, through GHIM, DDFNet performs complementary fusion of the frequency-domain features and the RGB-domain features, utilizing their respective advantages to achieve precise localization of the camouflaged target. Compared to other state-of-the-art methods in camouflaged object detection, DDFNet demonstrates superior performance. Additionally, we introduce polyp segmentation as a downstream task for the proposed network to showcase the ability of DDFNet in solving real-world problems.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"167 ","pages":"Article 105436"},"PeriodicalIF":2.9000,"publicationDate":"2025-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Camouflaged object detection via dual domain fusion\",\"authors\":\"Chenxing Shen, Zhisheng Cui, Leheng Zhang, Miaohui Zhang\",\"doi\":\"10.1016/j.dsp.2025.105436\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In camouflaged object detection (COD), the goal of accurately and completely segmenting foreground objects from the background on a per-pixel basis is pursued by various network architectures. Despite their specialized module designs tailored for camouflaged object detection, most of these approaches operate primarily within the RGB domain to locate camouflaged objects. Given the intrinsic characteristics of camouflaged objects, solely segmenting them within the RGB domain often encounters significant interference and challenges. To address this issue and leverage the advantages of detecting camouflaged objects from the frequency domain, this paper introduces a Dual-Domain Fusion Network (DDFNet). By integrating frequency domain features with RGB domain features, DDFNet exploits the complementary strengths of both domains, achieving precise localization and segmentation of camouflaged objects. DDFNet is composed of three main modules. The MHFM performs group fusion of backbone features to enhance the representation of these features. The GFGM, based on a dual-phase architecture, extracts the camouflaged target hidden in the RGB domain by operating in the frequency domain. It achieves this by leveraging the detailed information contained in high-frequency features and the spatial information in low-frequency features, thereby obtaining a frequency-domain feature representation of the camouflaged target, which supplements the RGB domain features. Finally, through GHIM, DDFNet performs complementary fusion of the frequency-domain features and the RGB-domain features, utilizing their respective advantages to achieve precise localization of the camouflaged target. Compared to other state-of-the-art methods in camouflaged object detection, DDFNet demonstrates superior performance. Additionally, we introduce polyp segmentation as a downstream task for the proposed network to showcase the ability of DDFNet in solving real-world problems.</div></div>\",\"PeriodicalId\":51011,\"journal\":{\"name\":\"Digital Signal Processing\",\"volume\":\"167 \",\"pages\":\"Article 105436\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-06-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digital Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1051200425004580\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1051200425004580","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

摘要

在伪装目标检测（COD）中，各种网络架构都追求在逐像素的基础上准确、完整地分割前景目标和背景目标。尽管他们专门为伪装对象检测设计了专门的模块，但大多数这些方法主要在RGB域内操作，以定位伪装对象。考虑到伪装对象的内在特征，在RGB域内单独分割它们往往会遇到重大的干扰和挑战。为了解决这一问题，并利用从频域检测伪装目标的优势，本文引入了双域融合网络（DDFNet）。DDFNet通过将频域特征与RGB域特征相结合，利用两个域的互补优势，实现对伪装目标的精确定位和分割。DDFNet由三个主要模块组成。MHFM对骨干特征进行分组融合，增强骨干特征的表示能力。GFGM基于双相位结构，通过在频域操作提取隐藏在RGB域中的伪装目标。利用高频特征中包含的详细信息和低频特征中的空间信息，得到伪装目标的频域特征表示，补充了RGB域特征。最后，DDFNet通过GHIM对频域特征和rgb域特征进行互补融合，利用各自优势实现对伪装目标的精确定位。与其他最先进的伪装目标检测方法相比，DDFNet显示出优越的性能。此外，我们引入息肉分割作为所提出网络的下游任务，以展示DDFNet在解决现实问题方面的能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Camouflaged object detection via dual domain fusion

In camouflaged object detection (COD), the goal of accurately and completely segmenting foreground objects from the background on a per-pixel basis is pursued by various network architectures. Despite their specialized module designs tailored for camouflaged object detection, most of these approaches operate primarily within the RGB domain to locate camouflaged objects. Given the intrinsic characteristics of camouflaged objects, solely segmenting them within the RGB domain often encounters significant interference and challenges. To address this issue and leverage the advantages of detecting camouflaged objects from the frequency domain, this paper introduces a Dual-Domain Fusion Network (DDFNet). By integrating frequency domain features with RGB domain features, DDFNet exploits the complementary strengths of both domains, achieving precise localization and segmentation of camouflaged objects. DDFNet is composed of three main modules. The MHFM performs group fusion of backbone features to enhance the representation of these features. The GFGM, based on a dual-phase architecture, extracts the camouflaged target hidden in the RGB domain by operating in the frequency domain. It achieves this by leveraging the detailed information contained in high-frequency features and the spatial information in low-frequency features, thereby obtaining a frequency-domain feature representation of the camouflaged target, which supplements the RGB domain features. Finally, through GHIM, DDFNet performs complementary fusion of the frequency-domain features and the RGB-domain features, utilizing their respective advantages to achieve precise localization of the camouflaged target. Compared to other state-of-the-art methods in camouflaged object detection, DDFNet demonstrates superior performance. Additionally, we introduce polyp segmentation as a downstream task for the proposed network to showcase the ability of DDFNet in solving real-world problems.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Digital Signal Processing 工程技术-工程：电子与电气

CiteScore

5.30

自引率

17.20%

发文量

435

审稿时长

66 days

期刊介绍： Digital Signal Processing: A Review Journal is one of the oldest and most established journals in the field of signal processing yet it aims to be the most innovative. The Journal invites top quality research articles at the frontiers of research in all aspects of signal processing. Our objective is to provide a platform for the publication of ground-breaking research in signal processing with both academic and industrial appeal. The journal has a special emphasis on statistical signal processing methodology such as Bayesian signal processing, and encourages articles on emerging applications of signal processing such as: • big data• machine learning• internet of things• information security• systems biology and computational biology,• financial time series analysis,• autonomous vehicles,• quantum computing,• neuromorphic engineering,• human-computer interaction and intelligent user interfaces,• environmental signal processing,• geophysical signal processing including seismic signal processing,• chemioinformatics and bioinformatics,• audio, visual and performance arts,• disaster management and prevention,• renewable energy,