2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON)最新文献

筛选
英文 中文
Depth-based image completion for view synthesis 用于视图合成的基于深度的图像补全
J. Gautier, O. Le Meur, C. Guillemot
{"title":"Depth-based image completion for view synthesis","authors":"J. Gautier, O. Le Meur, C. Guillemot","doi":"10.1109/3DTV.2011.5877193","DOIUrl":"https://doi.org/10.1109/3DTV.2011.5877193","url":null,"abstract":"This paper describes a depth-based inpainting algorithm which efficiently handles disocclusion occurring on virtual viewpoint rendering. A single reference view and a set of depth maps are used in the proposed approach. The method not only deals with small disocclusion filling related to small camera baseline, but also manages to fill in larger disocclusions in distant synthesized views. This relies on a coherent tensor-based color and geometry structure propagation. The depth is used to drive the filling order, while enforcing the structure diffusion from similar candidate-patches. By acting on patch prioritization, selection and combination, the completion of distant synthesized views allows a consistent and realistic rendering of virtual viewpoints.","PeriodicalId":158764,"journal":{"name":"2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130710389","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 99
Spatio-temporal consistent depth maps from multi-view video 多视点视频的时空一致深度图
Marcus Mueller, Frederik Zilly, C. Riechert, P. Kauff
{"title":"Spatio-temporal consistent depth maps from multi-view video","authors":"Marcus Mueller, Frederik Zilly, C. Riechert, P. Kauff","doi":"10.1109/3DTV.2011.5877221","DOIUrl":"https://doi.org/10.1109/3DTV.2011.5877221","url":null,"abstract":"The demand for high quality depth maps from stereo and multi-camera videos increases constantly. The main application for these depth maps is rendering new perspectives of the captured scene by means of Depth Image Based Rendering (DIBR). Accurate depth maps are the linchpin of DIBR. On the basis of a four-camera set-up, we show that combining hybrid recursive matching with motion estimation, cross -bilateral post-processing and mutual depth map fusion produces spatio-temporal consistent depth maps appropriate for artifact-free view synthesis.","PeriodicalId":158764,"journal":{"name":"2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126553789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
A novel depth map quality metric and its usage in depth map coding 一种新的深度图质量度量及其在深度图编码中的应用
D. D. De Silva, W. Fernando, S. Worrall, A. Kondoz
{"title":"A novel depth map quality metric and its usage in depth map coding","authors":"D. D. De Silva, W. Fernando, S. Worrall, A. Kondoz","doi":"10.1109/3DTV.2011.5877203","DOIUrl":"https://doi.org/10.1109/3DTV.2011.5877203","url":null,"abstract":"While the depth maps of 3D video are represented as luminance images, they are used to aid rendering of novel views and are not viewed by an end user. Therefore, metrics that measure the quality of images that are for end user viewing does not necessarily reflect the quality of depth maps in terms of its ability to render views. This paper investigates the relationship between the quality of the rendered views and different quality measures of the depth map. A novel depth map quality metric is proposed based on a distortion model that approximates rendering errors due to pixel errors in the depth map. The proposed depth map quality metric correlates very well with the quality of the rendered views, as compared to the PSNR and SSIM of the depth map. The application of the proposed depth map quality metric is further illustrated by incorporating the metric at the encoding mode selection stage of a video encoder. Experimental results suggest that with the proposed encoding mode selection scheme bit rate savings of up to 30% can be achieved compared to traditional encoding mode selection scheme based on sum of squared errors.","PeriodicalId":158764,"journal":{"name":"2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117241255","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
A graph-based approach for latency modeling and optimization in multiview video encoding 基于图的多视点视频编码延迟建模与优化方法
P. Carballeira, J. Cabrera, Antonio Ortega, F. Jaureguizar, N. García
{"title":"A graph-based approach for latency modeling and optimization in multiview video encoding","authors":"P. Carballeira, J. Cabrera, Antonio Ortega, F. Jaureguizar, N. García","doi":"10.1109/3DTV.2011.5877204","DOIUrl":"https://doi.org/10.1109/3DTV.2011.5877204","url":null,"abstract":"We present a novel framework for encoding latency analysis of arbitrary multiview video coding prediction structures. This framework avoids the need to consider an specific encoder architecture for encoding latency analysis by assuming an unlimited processing capacity on the multiview encoder. Under this assumption, only the influence of the prediction structure and the processing times have to be considered, and the encoding latency is solved systematically by means of a graph model. The results obtained with this model are valid for a multiview encoder with sufficient processing capacity and serve as a lower bound otherwise. Furthermore, with the objective of low latency encoder design with low penalty on rate-distortion performance, the graph model allows us to identify the prediction relationships that add higher encoding latency to the encoder. Experimental results for JMVM prediction structures illustrate how low latency prediction structures with a low rate-distortion penalty can be derived in a systematic manner using the new model.","PeriodicalId":158764,"journal":{"name":"2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127778411","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Audio personalization using head related transfer function in 3DTV 3DTV中使用头部相关传输功能的音频个性化
Yongqing Tang, Yong Fang, Qinghua Huang
{"title":"Audio personalization using head related transfer function in 3DTV","authors":"Yongqing Tang, Yong Fang, Qinghua Huang","doi":"10.1109/3DTV.2011.5877191","DOIUrl":"https://doi.org/10.1109/3DTV.2011.5877191","url":null,"abstract":"In 3DTV, head related transfer function (HRTFs) can promote immersive feeling of listeners because it contains spatial information on sound source. Audio can be customized through using personalized HRTF. So, listening distortions are caused if HRTFs do not match anthropometric parameters concerning different listeners. In this paper, personalized method is proposed to customize individual HRTF based on non-negative matrix factorization (NMF) and support vector regression (SVR). The anthropometric parameters are selected and high dimensional HRTFs are decomposed into low dimensional matrix using NMF. Nonlinear regression model is derived between the selected anthropometric parameters and low dimensional matrix by SVR. Experimental results demonstrated that personalized HRTF has better performance than using the same HRTF for different listeners in 3DTV audio.","PeriodicalId":158764,"journal":{"name":"2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON)","volume":"132 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133616079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Microscopic and macroscopic 3D imaging and display by integral imaging 显微及宏观三维成像及整体成像显示
M. Martínez-Corral, H. Navarro, G. Saavedra, B. Javidi
{"title":"Microscopic and macroscopic 3D imaging and display by integral imaging","authors":"M. Martínez-Corral, H. Navarro, G. Saavedra, B. Javidi","doi":"10.1109/3DTV.2011.5877150","DOIUrl":"https://doi.org/10.1109/3DTV.2011.5877150","url":null,"abstract":"Integral imaging is a rising 3D imaging technique that can be considered the incoherent version of holography. In integral imaging the multiperspective information of 3D scenes is stored in a 2D picture. Such picture, composed by a set of elemental images, is obtained through a 2D array of microlenses. The elemental-images set can be used for many purposes. One is the display of 3D color scenes to audiences or much more than one person. Other is the 3D display, with full parallax, in personal monitors, like the screen of a smartphone, a tablet, or the monitor used by a surgeon in an endoscopic operation. Other important types of applications are connected with the topographic reconstruction, slice by slice, of the 3D scene. This is especially important in the case of microscopy applications. In this talk, we review the important capacities of integral imaging and the results obtained by our group, the 3D Imaging & Display Lab, in the field of integral imaging.","PeriodicalId":158764,"journal":{"name":"2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126603237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Reduced complexity multi-view video coding scheme for 2D camera arrays 降低复杂度的二维摄像机阵列多视点视频编码方案
Aykut Avci1, J. De Cock, R. Beernaert, J. De Smet, Lawrence Bogaert, Y. Meuret, H. Thienpont, P. Lambert, H. De Smet
{"title":"Reduced complexity multi-view video coding scheme for 2D camera arrays","authors":"Aykut Avci1, J. De Cock, R. Beernaert, J. De Smet, Lawrence Bogaert, Y. Meuret, H. Thienpont, P. Lambert, H. De Smet","doi":"10.1109/3DTV.2011.5877178","DOIUrl":"https://doi.org/10.1109/3DTV.2011.5877178","url":null,"abstract":"As the number of views comprised in multi-view videos increases, some challenging problems emerge. Besides the bandwidth problems caused by the huge data flow, the calculation power needed by the multi-view encoder is an even higher burden than that of a single view encoder. In this paper, a complexity efficient way to encode a single-time instant of 5×3 view frames is presented. Some of the P frames known from the traditional encoding schemes have been replaced by a new type of frame called the D frame, in which the disparity vector of a block in a view can be derived from the other views due to the strong geometrical correspondence existing between adjacent views. Experimental results show that 20.2% complexity gain is achieved without compromising quality and bit-rate by wisely selecting threshold values at different QPs.","PeriodicalId":158764,"journal":{"name":"2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON)","volume":"09 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127228877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Exploiting depth information for fast motion and disparity estimation in Multi-view Video Coding 利用深度信息实现多视点视频编码中的快速运动和视差估计
B. Micallef, C. J. Debono, R. Farrugia
{"title":"Exploiting depth information for fast motion and disparity estimation in Multi-view Video Coding","authors":"B. Micallef, C. J. Debono, R. Farrugia","doi":"10.1109/3DTV.2011.5877170","DOIUrl":"https://doi.org/10.1109/3DTV.2011.5877170","url":null,"abstract":"Multi-view Video Coding (MVC) employs both motion and disparity estimation within the encoding process. These provide a significant increase in coding efficiency at the expense of a substantial increase in computational requirements. This paper presents a fast motion and disparity estimation technique that utilizes the multi-view geometry together with the depth information and the corresponding encoded motion vectors from the reference view, to produce more reliable motion and disparity vector predictors for the current view. This allows for a smaller search area which reduces the computational cost of the multi-view encoding system. Experimental results confirm that the proposed techniques can provide a speed-up gain of up to 4.2 times, with a negligible loss in the rate-distortion performance for both the color and the depth MVC.","PeriodicalId":158764,"journal":{"name":"2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115451732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
A novel 2D-to-3D scheme by visual attention and occlusion analysis 一种新的基于视觉注意和遮挡分析的二维到三维方案
Jiahong Zhang, You Yang, Qionghai Dai
{"title":"A novel 2D-to-3D scheme by visual attention and occlusion analysis","authors":"Jiahong Zhang, You Yang, Qionghai Dai","doi":"10.1109/3DTV.2011.5877189","DOIUrl":"https://doi.org/10.1109/3DTV.2011.5877189","url":null,"abstract":"We propose a novel 2D to 3D scheme by considering perceived 3D experience caused by occlusion and visual attention. In this scheme, initial depth model, the saliency map of visual attention and occlusion analysis are integrated in depth calculation. Mean-while, characteristics of human visual system are also considered as weight factors. Then, depth normalization and refining are implemented. The experimental results show that the final depth map presents proper depth order, details of objects and interesting region of 3D perception and the perceived 3D experiences for the converted images and videos are satisfied.","PeriodicalId":158764,"journal":{"name":"2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114574939","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Interactive quality assessment for asymmetric coding of 3D video 三维视频非对称编码的交互式质量评价
N. Ozbek, Gizem Ertan, Oktay Karakuş
{"title":"Interactive quality assessment for asymmetric coding of 3D video","authors":"N. Ozbek, Gizem Ertan, Oktay Karakuş","doi":"10.1109/3DTV.2011.5877199","DOIUrl":"https://doi.org/10.1109/3DTV.2011.5877199","url":null,"abstract":"In stereoscopic 3D video, it is known that humans can perceive high quality 3D video provided that one of the views in high quality. Hence, in stereo video encoding the best overall rate vs. perceived-distortion performance may be achieved by asymmetric coding which is reduction of the spatial or quantization resolution of the auxiliary view, while keeping the reference view in full resolution. In this paper, we propose an interactive quality assessment method to evaluate perceptual quality for asymmetric coding of 3D video. Subjective Evaluation of Stereo VIdeo Quality (SESVIQ) is proposed as stereo extension of SAMVIQ (Subjective Assessment Methodology for VIdeo Quality) methodology of EBU. We conducted subjective experiments to compare different algorithms such as symmetric over asymmetric rate allocation or spatial over SNR (signal-to-noise ratio) scaling of the auxiliary view. Test results show that the interactive test methodology gives more reliable results than conventional sequential test methodology, especially for low bitrate scenarios.","PeriodicalId":158764,"journal":{"name":"2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON)","volume":"86 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122563978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信