利用多视角RGB相机阵列实现实时三维可视化。

IF 1.6 4区 计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS
Jianwei Ke, Alex J Watras, Jae-Jun Kim, Hewei Liu, Hongrui Jiang, Yu Hen Hu
{"title":"利用多视角RGB相机阵列实现实时三维可视化。","authors":"Jianwei Ke,&nbsp;Alex J Watras,&nbsp;Jae-Jun Kim,&nbsp;Hewei Liu,&nbsp;Hongrui Jiang,&nbsp;Yu Hen Hu","doi":"10.1007/s11265-021-01729-0","DOIUrl":null,"url":null,"abstract":"<p><p>A real-time 3D visualization (RT3DV) system using a multiview RGB camera array is presented. RT3DV can process multiple synchronized video streams to produce a stereo video of a dynamic scene from a chosen view angle. Its design objective is to facilitate 3D visualization at the video frame rate with good viewing quality. To facilitate 3D vision, RT3DV estimates and updates a surface mesh model formed directly from a set of sparse key points. The 3D coordinates of these key points are estimated from matching 2D key points across multiview video streams with the aid of epipolar geometry and trifocal tensor. To capture the scene dynamics, 2D key points in individual video streams are tracked between successive frames. We implemented a proof of concept RT3DV system tasked to process five synchronous video streams acquired by an RGB camera array. It achieves a processing speed of 44 milliseconds per frame and a peak signal to noise ratio (PSNR) of 15.9 dB from a viewpoint coinciding with a reference view. As a comparison, an image-based MVS algorithm utilizing a dense point cloud model and frame by frame feature detection and matching will require 7 seconds to render a frame and yield a reference view PSNR of 16.3 dB.</p>","PeriodicalId":50050,"journal":{"name":"Journal of Signal Processing Systems for Signal Image and Video Technology","volume":null,"pages":null},"PeriodicalIF":1.6000,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9159639/pdf/nihms-1776819.pdf","citationCount":"2","resultStr":"{\"title\":\"Towards real-time 3D visualization with multiview RGB camera array.\",\"authors\":\"Jianwei Ke,&nbsp;Alex J Watras,&nbsp;Jae-Jun Kim,&nbsp;Hewei Liu,&nbsp;Hongrui Jiang,&nbsp;Yu Hen Hu\",\"doi\":\"10.1007/s11265-021-01729-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>A real-time 3D visualization (RT3DV) system using a multiview RGB camera array is presented. RT3DV can process multiple synchronized video streams to produce a stereo video of a dynamic scene from a chosen view angle. Its design objective is to facilitate 3D visualization at the video frame rate with good viewing quality. To facilitate 3D vision, RT3DV estimates and updates a surface mesh model formed directly from a set of sparse key points. The 3D coordinates of these key points are estimated from matching 2D key points across multiview video streams with the aid of epipolar geometry and trifocal tensor. To capture the scene dynamics, 2D key points in individual video streams are tracked between successive frames. We implemented a proof of concept RT3DV system tasked to process five synchronous video streams acquired by an RGB camera array. It achieves a processing speed of 44 milliseconds per frame and a peak signal to noise ratio (PSNR) of 15.9 dB from a viewpoint coinciding with a reference view. As a comparison, an image-based MVS algorithm utilizing a dense point cloud model and frame by frame feature detection and matching will require 7 seconds to render a frame and yield a reference view PSNR of 16.3 dB.</p>\",\"PeriodicalId\":50050,\"journal\":{\"name\":\"Journal of Signal Processing Systems for Signal Image and Video Technology\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2022-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9159639/pdf/nihms-1776819.pdf\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Signal Processing Systems for Signal Image and Video Technology\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1007/s11265-021-01729-0\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Signal Processing Systems for Signal Image and Video Technology","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s11265-021-01729-0","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 2

摘要

提出了一种基于多视点RGB相机阵列的实时三维可视化系统。RT3DV可以处理多个同步视频流,从选定的视角生成动态场景的立体视频。其设计目标是在视频帧率和良好的观看质量下实现3D可视化。为了方便3D视觉,RT3DV估计和更新由一组稀疏关键点直接形成的表面网格模型。借助极极几何和三焦张量,通过匹配多视频流中的二维关键点来估计这些关键点的三维坐标。为了捕捉场景动态,在连续的帧之间跟踪单个视频流中的2D关键点。我们实现了一个概念验证RT3DV系统,任务是处理由RGB相机阵列获取的五个同步视频流。从与参考视图一致的视点来看,它实现了每帧44毫秒的处理速度和15.9 dB的峰值信噪比(PSNR)。相比之下,使用密集点云模型和逐帧特征检测和匹配的基于图像的MVS算法将需要7秒来渲染一帧,并产生16.3 dB的参考视图PSNR。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Towards real-time 3D visualization with multiview RGB camera array.

Towards real-time 3D visualization with multiview RGB camera array.

Towards real-time 3D visualization with multiview RGB camera array.

Towards real-time 3D visualization with multiview RGB camera array.

A real-time 3D visualization (RT3DV) system using a multiview RGB camera array is presented. RT3DV can process multiple synchronized video streams to produce a stereo video of a dynamic scene from a chosen view angle. Its design objective is to facilitate 3D visualization at the video frame rate with good viewing quality. To facilitate 3D vision, RT3DV estimates and updates a surface mesh model formed directly from a set of sparse key points. The 3D coordinates of these key points are estimated from matching 2D key points across multiview video streams with the aid of epipolar geometry and trifocal tensor. To capture the scene dynamics, 2D key points in individual video streams are tracked between successive frames. We implemented a proof of concept RT3DV system tasked to process five synchronous video streams acquired by an RGB camera array. It achieves a processing speed of 44 milliseconds per frame and a peak signal to noise ratio (PSNR) of 15.9 dB from a viewpoint coinciding with a reference view. As a comparison, an image-based MVS algorithm utilizing a dense point cloud model and frame by frame feature detection and matching will require 7 seconds to render a frame and yield a reference view PSNR of 16.3 dB.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
4.00
自引率
0.00%
发文量
106
审稿时长
4-8 weeks
期刊介绍: The Journal of Signal Processing Systems for Signal, Image, and Video Technology publishes research papers on the design and implementation of signal processing systems, with or without VLSI circuits. The journal is published in twelve issues and is distributed to engineers, researchers, and educators in the general field of signal processing systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信