Flexible openCL accelerated disparity estimation for video communication applications

C. Weigel, N. Treutner
{"title":"Flexible openCL accelerated disparity estimation for video communication applications","authors":"C. Weigel, N. Treutner","doi":"10.1109/3DTV.2011.5877207","DOIUrl":null,"url":null,"abstract":"Due to widespread broadband connections in normal households, the use of video chats via Internet is no longer limited to business meetings. However, the camera configuration usually makes it impossible to achieve direct eye contact between the conversational partners. This effect can be compensated using virtual view synthesis methods based on disparity maps. The virtual camera is positioned “behind” the communications windows and thus re-establishes the eye-contact. Obtaining a good disparity map is still a challenging problem and, with respect to video communication, must perform at interactive frame rates. In this paper we present optimized algorithms for disparity estimation that run in near real time. Recent developments in the consumer-hardware industry allow the implementation of complex algorithms for eye gaze correction, which can be used with relatively inexpensive out-of-the-box components. We employ the newly introduced OpenCL Framework and present an implementation of several optimized algorithms on a Graphics Processing Unit (GPU). Our implementation supports different methods for cost-estimation and aggregation, which we can combine flexibly. We present a method to efficiently implement a dynamic programming approach on the GPU. Our contribution makes it possible to interactively change parameters of the algorithms and get instant visual feedback which is crucial in algorithm development and parameter tuning. We also show first results of virtual views that re-establish the eye contact.","PeriodicalId":158764,"journal":{"name":"2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/3DTV.2011.5877207","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Due to widespread broadband connections in normal households, the use of video chats via Internet is no longer limited to business meetings. However, the camera configuration usually makes it impossible to achieve direct eye contact between the conversational partners. This effect can be compensated using virtual view synthesis methods based on disparity maps. The virtual camera is positioned “behind” the communications windows and thus re-establishes the eye-contact. Obtaining a good disparity map is still a challenging problem and, with respect to video communication, must perform at interactive frame rates. In this paper we present optimized algorithms for disparity estimation that run in near real time. Recent developments in the consumer-hardware industry allow the implementation of complex algorithms for eye gaze correction, which can be used with relatively inexpensive out-of-the-box components. We employ the newly introduced OpenCL Framework and present an implementation of several optimized algorithms on a Graphics Processing Unit (GPU). Our implementation supports different methods for cost-estimation and aggregation, which we can combine flexibly. We present a method to efficiently implement a dynamic programming approach on the GPU. Our contribution makes it possible to interactively change parameters of the algorithms and get instant visual feedback which is crucial in algorithm development and parameter tuning. We also show first results of virtual views that re-establish the eye contact.
灵活的openCL加速视差估计视频通信应用
由于普通家庭宽带连接的普及,通过互联网视频聊天的使用不再局限于商务会议。然而,相机的配置通常使对话伙伴之间无法实现直接的目光接触。这种影响可以使用基于视差图的虚拟视图合成方法进行补偿。虚拟摄像机被放置在通信窗口的“后面”,从而重新建立了目光接触。获得一个好的视差图仍然是一个具有挑战性的问题,并且就视频通信而言,必须在交互式帧速率下执行。本文提出了一种接近实时运行的视差估计优化算法。消费硬件行业的最新发展允许实现复杂的眼睛注视校正算法,这可以与相对便宜的开箱即用组件一起使用。我们采用新引入的OpenCL框架,并在图形处理单元(GPU)上实现了几种优化算法。我们的实现支持不同的成本估算和聚合方法,我们可以灵活地组合这些方法。提出了一种在GPU上有效实现动态规划方法的方法。我们的贡献使得交互式地改变算法的参数并获得即时的视觉反馈成为可能,这在算法开发和参数调整中至关重要。我们还展示了虚拟视角重建眼神交流的初步结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信