A novel hybrid model for video salient object detection

2020 International Conference on Computer Engineering and Intelligent Control (ICCEIC) Pub Date : 2020-11-01 DOI:10.1109/ICCEIC51584.2020.00059

Jinping Cai, Sheng Lin

{"title":"A novel hybrid model for video salient object detection","authors":"Jinping Cai, Sheng Lin","doi":"10.1109/ICCEIC51584.2020.00059","DOIUrl":null,"url":null,"abstract":"At present, there are a few video salient object detection models that can simulate the attention behavior in the dynamic scene. However, due to the lack of video salient object detection data sets and the camera motion interference, the existing models are insufficient to capture the overall shape and precise boundaries of targets. Hence, a new hybrid model, called NHM, connects the attention feedback network and pyramid dilated convolution module to obtain abundant spatial saliency information, then uses the saliency-shift-aware convLSTM module to learn temporal saliency information. Instead of directly feeding the attention feedback network results into the pyramid dilated convolution module, we extract feature maps of different scales from five decoder blocks and transfer them to the pyramid dilated convolution module. In this way, we could make better use of multi-scale features. Furthermore, a new hybrid loss function is proposed to obtain fine boundaries by introducing the boundary- enhanced loss. The experimental results show that the proposed model is superior or equal to the state-of-the-art models.","PeriodicalId":135840,"journal":{"name":"2020 International Conference on Computer Engineering and Intelligent Control (ICCEIC)","volume":"225 3","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Computer Engineering and Intelligent Control (ICCEIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCEIC51584.2020.00059","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

At present, there are a few video salient object detection models that can simulate the attention behavior in the dynamic scene. However, due to the lack of video salient object detection data sets and the camera motion interference, the existing models are insufficient to capture the overall shape and precise boundaries of targets. Hence, a new hybrid model, called NHM, connects the attention feedback network and pyramid dilated convolution module to obtain abundant spatial saliency information, then uses the saliency-shift-aware convLSTM module to learn temporal saliency information. Instead of directly feeding the attention feedback network results into the pyramid dilated convolution module, we extract feature maps of different scales from five decoder blocks and transfer them to the pyramid dilated convolution module. In this way, we could make better use of multi-scale features. Furthermore, a new hybrid loss function is proposed to obtain fine boundaries by introducing the boundary- enhanced loss. The experimental results show that the proposed model is superior or equal to the state-of-the-art models.

查看原文本刊更多论文

一种新的视频显著目标检测混合模型

目前，能够模拟动态场景中注意行为的视频显著目标检测模型很少。然而，由于缺乏视频显著目标检测数据集和摄像机运动干扰，现有模型不足以捕捉目标的整体形状和精确边界。因此，一种新的混合模型NHM将注意力反馈网络和金字塔扩张卷积模块连接起来，获得丰富的空间显著性信息，然后使用显著性移位感知的convLSTM模块学习时间显著性信息。我们不是将注意力反馈网络的结果直接输入到金字塔扩展卷积模块中，而是从五个解码块中提取不同尺度的特征映射，并将其传输到金字塔扩展卷积模块中。这样，我们可以更好地利用多尺度特征。在此基础上，提出了一种新的混合损失函数，通过引入边界增强损失来获得精细边界。实验结果表明，该模型优于或等于现有的模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 International Conference on Computer Engineering and Intelligent Control (ICCEIC)

自引率

0.00%

发文量