Fast repurposing of high-resolution stereo video content for mobile use

Ali Karaoglu, Bong-Ho Lee, A. Boev, W. Cheong, A. Gotchev
{"title":"Fast repurposing of high-resolution stereo video content for mobile use","authors":"Ali Karaoglu, Bong-Ho Lee, A. Boev, W. Cheong, A. Gotchev","doi":"10.1117/12.924508","DOIUrl":null,"url":null,"abstract":"3D video content is captured and created mainly in high resolution targeting big cinema or home TV screens. For 3D \nmobile devices, equipped with small-size auto-stereoscopic displays, such content has to be properly repurposed, \npreferably in real-time. The repurposing requires not only spatial resizing but also properly maintaining the output stereo \ndisparity, as it should deliver realistic, pleasant and harmless 3D perception. \nIn this paper, we propose an approach to adapt the disparity range of the source video to the comfort disparity zone of \nthe target display. To achieve this, we adapt the scale and the aspect ratio of the source video. We aim at maximizing the \ndisparity range of the retargeted content within the comfort zone, and minimizing the letterboxing of the cropped \ncontent. \nThe proposed algorithm consists of five stages. First, we analyse the display profile, which characterises what 3D \ncontent can be comfortably observed in the target display. Then, we perform fast disparity analysis of the input \nstereoscopic content. Instead of returning the dense disparity map, it returns an estimate of the disparity statistics (min, \nmax, meanand variance) per frame. Additionally, we detect scene cuts, where sharp transitions in disparities occur. \nBased on the estimated input, and desired output disparity ranges, we derive the optimal cropping parameters and scale \nof the cropping window, which would yield the targeted disparity range and minimize the area of cropped and \nletterboxed content. Once the rescaling and cropping parameters are known, we perform resampling procedure using \nspline-based and perceptually optimized resampling (anti-aliasing) kernels, which have also a very efficient \ncomputational structure. Perceptual optimization is achieved through adjusting the cut-off frequency of the anti-aliasing \nfilter with the throughput of the target display.","PeriodicalId":369288,"journal":{"name":"Real-Time Image and Video Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Real-Time Image and Video Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.924508","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

3D video content is captured and created mainly in high resolution targeting big cinema or home TV screens. For 3D mobile devices, equipped with small-size auto-stereoscopic displays, such content has to be properly repurposed, preferably in real-time. The repurposing requires not only spatial resizing but also properly maintaining the output stereo disparity, as it should deliver realistic, pleasant and harmless 3D perception. In this paper, we propose an approach to adapt the disparity range of the source video to the comfort disparity zone of the target display. To achieve this, we adapt the scale and the aspect ratio of the source video. We aim at maximizing the disparity range of the retargeted content within the comfort zone, and minimizing the letterboxing of the cropped content. The proposed algorithm consists of five stages. First, we analyse the display profile, which characterises what 3D content can be comfortably observed in the target display. Then, we perform fast disparity analysis of the input stereoscopic content. Instead of returning the dense disparity map, it returns an estimate of the disparity statistics (min, max, meanand variance) per frame. Additionally, we detect scene cuts, where sharp transitions in disparities occur. Based on the estimated input, and desired output disparity ranges, we derive the optimal cropping parameters and scale of the cropping window, which would yield the targeted disparity range and minimize the area of cropped and letterboxed content. Once the rescaling and cropping parameters are known, we perform resampling procedure using spline-based and perceptually optimized resampling (anti-aliasing) kernels, which have also a very efficient computational structure. Perceptual optimization is achieved through adjusting the cut-off frequency of the anti-aliasing filter with the throughput of the target display.
快速重新利用高分辨率立体视频内容的移动使用
3D视频内容的捕获和创建主要是针对大影院或家庭电视屏幕的高分辨率。对于配备小尺寸自动立体显示器的3D移动设备,这些内容必须适当地重新利用,最好是实时的。重新利用不仅需要调整空间大小,还需要适当地保持输出立体视差,因为它应该提供真实,愉快和无害的3D感知。本文提出了一种使源视频视差范围适应目标显示器舒适视差区的方法。为了实现这一点,我们调整了源视频的比例和宽高比。我们的目标是在舒适区域内最大化重定向内容的差异范围,并最小化裁剪内容的信箱。该算法分为五个阶段。首先,我们分析了显示配置文件,它表征了在目标显示中可以舒适地观察到的3D内容。然后,我们对输入的立体内容进行快速的视差分析。它不是返回密集的视差图,而是返回每帧视差统计(最小值、最大值、平均值和方差)的估计值。此外,我们检测场景剪切,在差距发生急剧过渡的地方。基于估计的输入和期望的输出视差范围,我们推导出最佳的裁剪参数和裁剪窗口的规模,以产生目标的视差范围,并最小化裁剪和letterboxed内容的面积。一旦重新缩放和裁剪参数已知,我们使用基于样条和感知优化的重采样(抗混叠)核执行重采样过程,它也具有非常高效的计算结构。通过根据目标显示器的吞吐量调整抗混叠滤波器的截止频率来实现感知优化。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信