增强视频对象分割的边界

Proceedings of the 2nd International Conference on Vision, Image and Signal Processing Pub Date : 2018-08-27 DOI:10.1145/3271553.3271581

Qi Zhang, Xiaoqiang Lu, Yuan Yuan

{"title":"增强视频对象分割的边界","authors":"Qi Zhang, Xiaoqiang Lu, Yuan Yuan","doi":"10.1145/3271553.3271581","DOIUrl":null,"url":null,"abstract":"Video object segmentation aims to separate objects from background in successive video sequence accurately. It is a challenging task as the huge variance in object regions and similarity between object and background. Among previous methods, inner region of an object can be easily separated from background while the region around object boundary is often classified improperly. To address this problem, a novel video object segmentation method is proposed to enhance the object boundary by integrating video supervoxel into Convolutional Neural Network (CNN) model. Supervoxel is exploited in our method for its ability of preserving spatial details. The proposed method can be divided into four steps: 1) convolutional feature of video is extracted with CNN model; 2) supervoxel feature is constructed through averaging the convolutional features within each supervoxel to preserve spatial details of video; 3) the supervoxel feature and original convolutional feature are fused to construct video representation; 4) a softmax classifier is trained based on video representation to classify each pixel in video. The proposed method is evaluated both on DAVIS and Youtube-Objects datasets. Experimental results show that by considering supervoxel with spatial details, the proposed method can achieve impressive performance for video object segmentation through enhancing object boundary.","PeriodicalId":414782,"journal":{"name":"Proceedings of the 2nd International Conference on Vision, Image and Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing Boundary for Video Object Segmentation\",\"authors\":\"Qi Zhang, Xiaoqiang Lu, Yuan Yuan\",\"doi\":\"10.1145/3271553.3271581\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Video object segmentation aims to separate objects from background in successive video sequence accurately. It is a challenging task as the huge variance in object regions and similarity between object and background. Among previous methods, inner region of an object can be easily separated from background while the region around object boundary is often classified improperly. To address this problem, a novel video object segmentation method is proposed to enhance the object boundary by integrating video supervoxel into Convolutional Neural Network (CNN) model. Supervoxel is exploited in our method for its ability of preserving spatial details. The proposed method can be divided into four steps: 1) convolutional feature of video is extracted with CNN model; 2) supervoxel feature is constructed through averaging the convolutional features within each supervoxel to preserve spatial details of video; 3) the supervoxel feature and original convolutional feature are fused to construct video representation; 4) a softmax classifier is trained based on video representation to classify each pixel in video. The proposed method is evaluated both on DAVIS and Youtube-Objects datasets. Experimental results show that by considering supervoxel with spatial details, the proposed method can achieve impressive performance for video object segmentation through enhancing object boundary.\",\"PeriodicalId\":414782,\"journal\":{\"name\":\"Proceedings of the 2nd International Conference on Vision, Image and Signal Processing\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-08-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2nd International Conference on Vision, Image and Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3271553.3271581\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2nd International Conference on Vision, Image and Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3271553.3271581","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

视频目标分割的目的是将连续视频序列中的目标与背景准确分离。由于目标区域的巨大差异和目标与背景的相似性，这是一项具有挑战性的任务。在以往的方法中，目标的内部区域很容易与背景分离，而目标边界附近的区域往往分类不正确。为了解决这一问题，提出了一种新的视频目标分割方法，通过将视频超体素与卷积神经网络(CNN)模型相结合来增强目标边界。我们的方法利用了超体素，因为它具有保留空间细节的能力。该方法分为四个步骤:1)利用CNN模型提取视频的卷积特征;2)对每个超体素内的卷积特征进行平均，构建超体素特征，以保持视频的空间细节;3)融合超体素特征和原始卷积特征构建视频表示;4)基于视频表示训练softmax分类器，对视频中的每个像素进行分类。在DAVIS和Youtube-Objects数据集上对该方法进行了评估。实验结果表明，该方法考虑了具有空间细节的超体素，通过增强目标边界，可以取得较好的分割效果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Enhancing Boundary for Video Object Segmentation

Video object segmentation aims to separate objects from background in successive video sequence accurately. It is a challenging task as the huge variance in object regions and similarity between object and background. Among previous methods, inner region of an object can be easily separated from background while the region around object boundary is often classified improperly. To address this problem, a novel video object segmentation method is proposed to enhance the object boundary by integrating video supervoxel into Convolutional Neural Network (CNN) model. Supervoxel is exploited in our method for its ability of preserving spatial details. The proposed method can be divided into four steps: 1) convolutional feature of video is extracted with CNN model; 2) supervoxel feature is constructed through averaging the convolutional features within each supervoxel to preserve spatial details of video; 3) the supervoxel feature and original convolutional feature are fused to construct video representation; 4) a softmax classifier is trained based on video representation to classify each pixel in video. The proposed method is evaluated both on DAVIS and Youtube-Objects datasets. Experimental results show that by considering supervoxel with spatial details, the proposed method can achieve impressive performance for video object segmentation through enhancing object boundary.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2nd International Conference on Vision, Image and Signal Processing

自引率

0.00%

发文量