Multi-layer background sprite model for 2D-to-3D video conversion

2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) Pub Date : 2017-12-01 DOI:10.1109/APSIPA.2017.8282033

W. Lie, Chih-Hao Hu, Yi-Kai Chen, J. Chiang

{"title":"Multi-layer background sprite model for 2D-to-3D video conversion","authors":"W. Lie, Chih-Hao Hu, Yi-Kai Chen, J. Chiang","doi":"10.1109/APSIPA.2017.8282033","DOIUrl":null,"url":null,"abstract":"This paper presents a technique for semi-automatic 2D-to-3D stereo video conversion, which was known to provide user intervention in segmenting foregrounds and assigning corresponding depth information for key frames and then get the depth maps for other non-key frames via automatic depth propagation. Our algorithm escapes from traditional depth propagation paradigm based on motion estimation and compensation. For foregrounds in non- key frames, object kernels standing for the most confident parts are identified first and then used as the seeds for graph-cut segmentation. Since the graph-cut segmentation for foregrounds is performed independently for each non-key frame, the results will be free of the limitation by objects' motion activity. For backgrounds, all video frames, after foregrounds being removed, are integrated into a common multi-layer background sprite model (ML-BSM) based on image registration algorithm. Users can then draw background depth profiles for the ML-BSM in a video-based manner (not frame-based), thus reducing the human efforts required significantly. Our ML-BSM algorithm is an extension of our prior work, BSM [8], aiming to solve the cases when the foreground and the background have a large depth variation or the camera has a substantial panning/rotating motion. Experiments show that the adoption of multi-layers BSM architecture and iterative foreground refinement based on BSM validation can improve the depth image quality significantly.","PeriodicalId":142091,"journal":{"name":"2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APSIPA.2017.8282033","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

This paper presents a technique for semi-automatic 2D-to-3D stereo video conversion, which was known to provide user intervention in segmenting foregrounds and assigning corresponding depth information for key frames and then get the depth maps for other non-key frames via automatic depth propagation. Our algorithm escapes from traditional depth propagation paradigm based on motion estimation and compensation. For foregrounds in non- key frames, object kernels standing for the most confident parts are identified first and then used as the seeds for graph-cut segmentation. Since the graph-cut segmentation for foregrounds is performed independently for each non-key frame, the results will be free of the limitation by objects' motion activity. For backgrounds, all video frames, after foregrounds being removed, are integrated into a common multi-layer background sprite model (ML-BSM) based on image registration algorithm. Users can then draw background depth profiles for the ML-BSM in a video-based manner (not frame-based), thus reducing the human efforts required significantly. Our ML-BSM algorithm is an extension of our prior work, BSM [8], aiming to solve the cases when the foreground and the background have a large depth variation or the camera has a substantial panning/rotating motion. Experiments show that the adoption of multi-layers BSM architecture and iterative foreground refinement based on BSM validation can improve the depth image quality significantly.

查看原文本刊更多论文

用于2d到3d视频转换的多层背景精灵模型

本文提出了一种半自动二维到三维立体视频转换技术，该技术为用户提供干预，对关键帧分割前景并分配相应的深度信息，然后通过自动深度传播获得其他非关键帧的深度图。该算法摆脱了传统的基于运动估计和补偿的深度传播模式。对于非关键帧的前景，首先识别代表最可靠部分的对象核，然后将其用作图切分割的种子。由于前景的图切分割是对每个非关键帧独立执行的，因此结果将不受物体运动活动的限制。对于背景，去除前景后的所有视频帧，基于图像配准算法集成到一个通用的多层背景精灵模型(ML-BSM)中。然后，用户可以以基于视频的方式(而不是基于帧的方式)为ML-BSM绘制背景深度配置文件，从而大大减少了所需的人力。我们的ML-BSM算法是我们之前工作的扩展，BSM[8]，旨在解决前景和背景有很大的深度变化或相机有很大的平移/旋转运动的情况。实验表明，采用多层BSM架构和基于BSM验证的迭代前景细化可以显著提高深度图像质量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)

自引率

0.00%

发文量