照片逼真的流媒体自由视点视频

ACM SIGGRAPH 2023 Posters Pub Date : 2023-07-23 DOI:10.1145/3588028.3603666

Shaohui Jiao, Yuzhong Chen, Zhaoliang Liu, Danying Wang, Wen-Hui Zhou, Li Zhang, Yue Wang

{"title":"照片逼真的流媒体自由视点视频","authors":"Shaohui Jiao, Yuzhong Chen, Zhaoliang Liu, Danying Wang, Wen-Hui Zhou, Li Zhang, Yue Wang","doi":"10.1145/3588028.3603666","DOIUrl":null,"url":null,"abstract":"We present a novel free-viewpoint video(FVV) framework for capturing, processing and compressing the volumetric content for immersive VR/AR experience. Compared to previous FVV capture systems, we propose an easy-to-use multi-camera array consisting of mobile phones with time synchronization. In order to generate photo-realistic FVV results with sparse multi-camera input, we improve the novel view synthesis method by introducing visual hull guided neural representation, called VH-NeRF. Our VH-NeRF combines the advantages of both explicit models by traditional 3D reconstruction and the notable implicit representation of Neural Radiance Field. Each dynamic entity’s VH-NeRF is learned and supervised by the visual hull reconstructed data, and can be further edited for complex and large-scale dynamic scenes. Moreover, our FVV solution can do both effective compression and transmission on multi-perspective videos, as well as real-time rendering on consumer-grade hardware. To the best of our knowledge, our work is the first solution for photo-realistic FVV captured by sparse multi-camera array, and allow real-time live streaming of large-scale dynamic scenes for immersive VR and AR applications on mobile devices.","PeriodicalId":113397,"journal":{"name":"ACM SIGGRAPH 2023 Posters","volume":"85 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Photo-Realistic Streamable Free-Viewpoint Video\",\"authors\":\"Shaohui Jiao, Yuzhong Chen, Zhaoliang Liu, Danying Wang, Wen-Hui Zhou, Li Zhang, Yue Wang\",\"doi\":\"10.1145/3588028.3603666\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We present a novel free-viewpoint video(FVV) framework for capturing, processing and compressing the volumetric content for immersive VR/AR experience. Compared to previous FVV capture systems, we propose an easy-to-use multi-camera array consisting of mobile phones with time synchronization. In order to generate photo-realistic FVV results with sparse multi-camera input, we improve the novel view synthesis method by introducing visual hull guided neural representation, called VH-NeRF. Our VH-NeRF combines the advantages of both explicit models by traditional 3D reconstruction and the notable implicit representation of Neural Radiance Field. Each dynamic entity’s VH-NeRF is learned and supervised by the visual hull reconstructed data, and can be further edited for complex and large-scale dynamic scenes. Moreover, our FVV solution can do both effective compression and transmission on multi-perspective videos, as well as real-time rendering on consumer-grade hardware. To the best of our knowledge, our work is the first solution for photo-realistic FVV captured by sparse multi-camera array, and allow real-time live streaming of large-scale dynamic scenes for immersive VR and AR applications on mobile devices.\",\"PeriodicalId\":113397,\"journal\":{\"name\":\"ACM SIGGRAPH 2023 Posters\",\"volume\":\"85 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-07-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACM SIGGRAPH 2023 Posters\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3588028.3603666\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM SIGGRAPH 2023 Posters","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3588028.3603666","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

我们提出了一种新的自由视点视频(FVV)框架，用于捕获，处理和压缩身临其境的VR/AR体验的体积内容。与以往的FVV捕获系统相比，我们提出了一种易于使用的多相机阵列，由手机组成，具有时间同步。为了在稀疏多摄像机输入下生成逼真的FVV结果，我们改进了一种新的视图合成方法，引入视觉船体引导神经表示(VH-NeRF)。我们的VH-NeRF结合了传统3D重建的显式模型和显著的隐式神经辐射场表示的优点。每个动态实体的VH-NeRF由可视化船体重构数据学习和监督，并可针对复杂和大规模的动态场景进行进一步编辑。此外，我们的FVV解决方案既可以对多视角视频进行有效的压缩和传输，也可以在消费级硬件上进行实时渲染。据我们所知，我们的工作是第一个通过稀疏多相机阵列捕获的逼真FVV的解决方案，并允许在移动设备上为沉浸式VR和AR应用实时直播大规模动态场景。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Photo-Realistic Streamable Free-Viewpoint Video

We present a novel free-viewpoint video(FVV) framework for capturing, processing and compressing the volumetric content for immersive VR/AR experience. Compared to previous FVV capture systems, we propose an easy-to-use multi-camera array consisting of mobile phones with time synchronization. In order to generate photo-realistic FVV results with sparse multi-camera input, we improve the novel view synthesis method by introducing visual hull guided neural representation, called VH-NeRF. Our VH-NeRF combines the advantages of both explicit models by traditional 3D reconstruction and the notable implicit representation of Neural Radiance Field. Each dynamic entity’s VH-NeRF is learned and supervised by the visual hull reconstructed data, and can be further edited for complex and large-scale dynamic scenes. Moreover, our FVV solution can do both effective compression and transmission on multi-perspective videos, as well as real-time rendering on consumer-grade hardware. To the best of our knowledge, our work is the first solution for photo-realistic FVV captured by sparse multi-camera array, and allow real-time live streaming of large-scale dynamic scenes for immersive VR and AR applications on mobile devices.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

ACM SIGGRAPH 2023 Posters

自引率

0.00%

发文量