Mscnet: Mask stepwise calibration network for camouflaged object detection

The Journal of Supercomputing Pub Date : 2024-07-27 DOI:10.1007/s11227-024-06376-3

Haishun Du, Minghao Zhang, Wenzhe Zhang, Kangyi Qiao

{"title":"Mscnet: Mask stepwise calibration network for camouflaged object detection","authors":"Haishun Du, Minghao Zhang, Wenzhe Zhang, Kangyi Qiao","doi":"10.1007/s11227-024-06376-3","DOIUrl":null,"url":null,"abstract":"<p>Camouflaged object detection (COD) aims to accurately segment camouflaged objects blending into the environment and is a challenging task. Most existing deep learning-based COD methods do not explicitly enhance the region information of camouflaged objects, nor do they use the region information for mask calibration. To solve this issue, we propose a novel mask stepwise calibration network (MSCNet) for camouflaged object detection, which achieves high-precision detection of camouflaged objects. Specifically, MSCNet consists of a region information enhancement encoder and a mask stepwise calibration decoder. In the encoder, we first utilize a PVT backbone network to extract different levels of features from camouflaged images. Then, we design a region information enhancement module to enhance the region information of camouflaged objects while suppressing the interference of background information by mining, embedding, and aggregating the region information in different levels of features. In the decoder, we first design a coarse mask generation module to generate coarse prediction masks of camouflaged objects by directly cross-fusing different levels of features extracted by the backbone. In addition, we also design a mask calibration module to calibrate coarse prediction masks of camouflaged objects using the region information of different levels of camouflaged objects as a guide. Extensive experimental results on four benchmark datasets show that our method effectively identifies camouflaged objects and surpasses most state-of-the-art COD methods.</p>","PeriodicalId":501596,"journal":{"name":"The Journal of Supercomputing","volume":"70 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Journal of Supercomputing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s11227-024-06376-3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Camouflaged object detection (COD) aims to accurately segment camouflaged objects blending into the environment and is a challenging task. Most existing deep learning-based COD methods do not explicitly enhance the region information of camouflaged objects, nor do they use the region information for mask calibration. To solve this issue, we propose a novel mask stepwise calibration network (MSCNet) for camouflaged object detection, which achieves high-precision detection of camouflaged objects. Specifically, MSCNet consists of a region information enhancement encoder and a mask stepwise calibration decoder. In the encoder, we first utilize a PVT backbone network to extract different levels of features from camouflaged images. Then, we design a region information enhancement module to enhance the region information of camouflaged objects while suppressing the interference of background information by mining, embedding, and aggregating the region information in different levels of features. In the decoder, we first design a coarse mask generation module to generate coarse prediction masks of camouflaged objects by directly cross-fusing different levels of features extracted by the backbone. In addition, we also design a mask calibration module to calibrate coarse prediction masks of camouflaged objects using the region information of different levels of camouflaged objects as a guide. Extensive experimental results on four benchmark datasets show that our method effectively identifies camouflaged objects and surpasses most state-of-the-art COD methods.

Abstract Image

查看原文本刊更多论文

Mscnet：用于伪装物体检测的掩码逐步校准网络

伪装物体检测（COD）旨在准确分割融入环境的伪装物体，是一项具有挑战性的任务。现有的基于深度学习的伪装物体检测方法大多没有明确增强伪装物体的区域信息，也没有使用区域信息进行掩膜校准。为了解决这个问题，我们提出了一种用于伪装物体检测的新型掩膜逐步校准网络（MSCNet），它可以实现伪装物体的高精度检测。具体来说，MSCNet 由区域信息增强编码器和掩膜逐步校准解码器组成。在编码器中，我们首先利用 PVT 骨干网络从伪装图像中提取不同层次的特征。然后，我们设计了一个区域信息增强模块，通过挖掘、嵌入和聚合不同层次特征中的区域信息，增强伪装物体的区域信息，同时抑制背景信息的干扰。在解码器中，我们首先设计了粗掩码生成模块，通过直接交叉融合骨干提取的不同层次的特征，生成伪装物体的粗预测掩码。此外，我们还设计了一个掩码校准模块，以不同层次伪装物体的区域信息为指导，校准伪装物体的粗预测掩码。在四个基准数据集上的大量实验结果表明，我们的方法能有效识别伪装物体，并超越了大多数最先进的 COD 方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

The Journal of Supercomputing

自引率

0.00%

发文量