用于2d到3d视频转换的多层背景精灵模型

W. Lie, Chih-Hao Hu, Yi-Kai Chen, J. Chiang
{"title":"用于2d到3d视频转换的多层背景精灵模型","authors":"W. Lie, Chih-Hao Hu, Yi-Kai Chen, J. Chiang","doi":"10.1109/APSIPA.2017.8282033","DOIUrl":null,"url":null,"abstract":"This paper presents a technique for semi-automatic 2D-to-3D stereo video conversion, which was known to provide user intervention in segmenting foregrounds and assigning corresponding depth information for key frames and then get the depth maps for other non-key frames via automatic depth propagation. Our algorithm escapes from traditional depth propagation paradigm based on motion estimation and compensation. For foregrounds in non- key frames, object kernels standing for the most confident parts are identified first and then used as the seeds for graph-cut segmentation. Since the graph-cut segmentation for foregrounds is performed independently for each non-key frame, the results will be free of the limitation by objects' motion activity. For backgrounds, all video frames, after foregrounds being removed, are integrated into a common multi-layer background sprite model (ML-BSM) based on image registration algorithm. Users can then draw background depth profiles for the ML-BSM in a video-based manner (not frame-based), thus reducing the human efforts required significantly. Our ML-BSM algorithm is an extension of our prior work, BSM [8], aiming to solve the cases when the foreground and the background have a large depth variation or the camera has a substantial panning/rotating motion. Experiments show that the adoption of multi-layers BSM architecture and iterative foreground refinement based on BSM validation can improve the depth image quality significantly.","PeriodicalId":142091,"journal":{"name":"2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Multi-layer background sprite model for 2D-to-3D video conversion\",\"authors\":\"W. Lie, Chih-Hao Hu, Yi-Kai Chen, J. Chiang\",\"doi\":\"10.1109/APSIPA.2017.8282033\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a technique for semi-automatic 2D-to-3D stereo video conversion, which was known to provide user intervention in segmenting foregrounds and assigning corresponding depth information for key frames and then get the depth maps for other non-key frames via automatic depth propagation. Our algorithm escapes from traditional depth propagation paradigm based on motion estimation and compensation. For foregrounds in non- key frames, object kernels standing for the most confident parts are identified first and then used as the seeds for graph-cut segmentation. Since the graph-cut segmentation for foregrounds is performed independently for each non-key frame, the results will be free of the limitation by objects' motion activity. For backgrounds, all video frames, after foregrounds being removed, are integrated into a common multi-layer background sprite model (ML-BSM) based on image registration algorithm. Users can then draw background depth profiles for the ML-BSM in a video-based manner (not frame-based), thus reducing the human efforts required significantly. Our ML-BSM algorithm is an extension of our prior work, BSM [8], aiming to solve the cases when the foreground and the background have a large depth variation or the camera has a substantial panning/rotating motion. Experiments show that the adoption of multi-layers BSM architecture and iterative foreground refinement based on BSM validation can improve the depth image quality significantly.\",\"PeriodicalId\":142091,\"journal\":{\"name\":\"2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/APSIPA.2017.8282033\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APSIPA.2017.8282033","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

本文提出了一种半自动二维到三维立体视频转换技术,该技术为用户提供干预,对关键帧分割前景并分配相应的深度信息,然后通过自动深度传播获得其他非关键帧的深度图。该算法摆脱了传统的基于运动估计和补偿的深度传播模式。对于非关键帧的前景,首先识别代表最可靠部分的对象核,然后将其用作图切分割的种子。由于前景的图切分割是对每个非关键帧独立执行的,因此结果将不受物体运动活动的限制。对于背景,去除前景后的所有视频帧,基于图像配准算法集成到一个通用的多层背景精灵模型(ML-BSM)中。然后,用户可以以基于视频的方式(而不是基于帧的方式)为ML-BSM绘制背景深度配置文件,从而大大减少了所需的人力。我们的ML-BSM算法是我们之前工作的扩展,BSM[8],旨在解决前景和背景有很大的深度变化或相机有很大的平移/旋转运动的情况。实验表明,采用多层BSM架构和基于BSM验证的迭代前景细化可以显著提高深度图像质量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Multi-layer background sprite model for 2D-to-3D video conversion
This paper presents a technique for semi-automatic 2D-to-3D stereo video conversion, which was known to provide user intervention in segmenting foregrounds and assigning corresponding depth information for key frames and then get the depth maps for other non-key frames via automatic depth propagation. Our algorithm escapes from traditional depth propagation paradigm based on motion estimation and compensation. For foregrounds in non- key frames, object kernels standing for the most confident parts are identified first and then used as the seeds for graph-cut segmentation. Since the graph-cut segmentation for foregrounds is performed independently for each non-key frame, the results will be free of the limitation by objects' motion activity. For backgrounds, all video frames, after foregrounds being removed, are integrated into a common multi-layer background sprite model (ML-BSM) based on image registration algorithm. Users can then draw background depth profiles for the ML-BSM in a video-based manner (not frame-based), thus reducing the human efforts required significantly. Our ML-BSM algorithm is an extension of our prior work, BSM [8], aiming to solve the cases when the foreground and the background have a large depth variation or the camera has a substantial panning/rotating motion. Experiments show that the adoption of multi-layers BSM architecture and iterative foreground refinement based on BSM validation can improve the depth image quality significantly.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信