Camouflaged Object Detection with Feature Grafting and Distractor Aware

2023 IEEE International Conference on Multimedia and Expo (ICME) Pub Date : 2023-07-01 DOI:10.1109/ICME55011.2023.00419

Yuxuan Song, Xinyue Li, Lin Qi

{"title":"Camouflaged Object Detection with Feature Grafting and Distractor Aware","authors":"Yuxuan Song, Xinyue Li, Lin Qi","doi":"10.1109/ICME55011.2023.00419","DOIUrl":null,"url":null,"abstract":"The task of Camouflaged Object Detection (COD) aims to accurately segment camouflaged objects that integrated into the environment, which is more challenging than ordinary detection as the texture between the target and background is visually indistinguishable. In this paper, we proposed a novel Feature Grafting and Distractor Aware network (FDNet) to handle the COD task. Specifically, we use CNN and Transformer to encode multi-scale images in parallel. In order to better explore the advantages of the two encoders, we design a cross-attention-based Feature Grafting Module to graft features extracted from Transformer branch into CNN branch, after which the features are aggregated in the Feature Fusion Module. A Distractor Aware Module is designed to explicitly model the two possible distractor in the COD task to refine the coarse camouflage map. We also proposed the largest artificial camouflaged object dataset which contains 2000 images with annotations, named ACOD2K. We conducted extensive experiments on four widely used benchmark datasets and the ACOD2K dataset. The results show that our method significantly outperforms other state-of-the-art methods. The code and the ACOD2K will be available at https://github.com/syxvision/FDNet.","PeriodicalId":321830,"journal":{"name":"2023 IEEE International Conference on Multimedia and Expo (ICME)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Multimedia and Expo (ICME)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICME55011.2023.00419","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The task of Camouflaged Object Detection (COD) aims to accurately segment camouflaged objects that integrated into the environment, which is more challenging than ordinary detection as the texture between the target and background is visually indistinguishable. In this paper, we proposed a novel Feature Grafting and Distractor Aware network (FDNet) to handle the COD task. Specifically, we use CNN and Transformer to encode multi-scale images in parallel. In order to better explore the advantages of the two encoders, we design a cross-attention-based Feature Grafting Module to graft features extracted from Transformer branch into CNN branch, after which the features are aggregated in the Feature Fusion Module. A Distractor Aware Module is designed to explicitly model the two possible distractor in the COD task to refine the coarse camouflage map. We also proposed the largest artificial camouflaged object dataset which contains 2000 images with annotations, named ACOD2K. We conducted extensive experiments on four widely used benchmark datasets and the ACOD2K dataset. The results show that our method significantly outperforms other state-of-the-art methods. The code and the ACOD2K will be available at https://github.com/syxvision/FDNet.

查看原文本刊更多论文

基于特征嫁接和干扰感知的伪装目标检测

伪装目标检测(COD)的任务是准确分割融入环境的伪装目标，这比普通检测更具挑战性，因为目标和背景之间的纹理在视觉上难以区分。本文提出了一种新的特征嫁接和干扰感知网络(FDNet)来处理COD任务。具体来说，我们使用CNN和Transformer对多尺度图像进行并行编码。为了更好地挖掘两种编码器的优势，我们设计了一个基于交叉关注的特征嫁接模块，将Transformer分支提取的特征嫁接到CNN分支中，然后在特征融合模块中进行特征聚合。设计了干扰物感知模块，对COD任务中可能存在的两种干扰物进行显式建模，以细化粗伪装图。我们还提出了最大的人工伪装对象数据集，包含2000张带有注释的图像，命名为ACOD2K。我们在四个广泛使用的基准数据集和ACOD2K数据集上进行了广泛的实验。结果表明，我们的方法明显优于其他最先进的方法。代码和ACOD2K将在https://github.com/syxvision/FDNet上提供。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 IEEE International Conference on Multimedia and Expo (ICME)

自引率

0.00%

发文量