沉浸式视频传输中基于图像渲染的时间一致性研究

2022 10th European Workshop on Visual Information Processing (EUVIP) Pub Date : 2022-09-11 DOI:10.1109/EUVIP53989.2022.9922680

Smitha Lingadahalli Ravi, F. Henry, L. Morin, Matthieu Gendrin

{"title":"沉浸式视频传输中基于图像渲染的时间一致性研究","authors":"Smitha Lingadahalli Ravi, F. Henry, L. Morin, Matthieu Gendrin","doi":"10.1109/EUVIP53989.2022.9922680","DOIUrl":null,"url":null,"abstract":"Image-based rendering methods synthesize novel views given input images captured from multiple viewpoints to display free viewpoint immersive video. Despite significant progress with the recent learning-based approaches, there are still some drawbacks. In particular, these approaches operate at the still image level and do not maintain consistency among consecutive time instants, leading to temporal noise. To address this, we propose an intra-only framework to identify regions of input images leading to temporally inconsistent synthesized views. Our method synthesizes better and more stable novel views, even in the most general use case of immersive video transmission. We conclude that the network seems to identify and correct spatial features at the still image level that produce artifacts in the temporal dimension.","PeriodicalId":120249,"journal":{"name":"2022 10th European Workshop on Visual Information Processing (EUVIP)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Exploring Temporal Consistency in Image-Based Rendering for Immersive Video Transmission\",\"authors\":\"Smitha Lingadahalli Ravi, F. Henry, L. Morin, Matthieu Gendrin\",\"doi\":\"10.1109/EUVIP53989.2022.9922680\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Image-based rendering methods synthesize novel views given input images captured from multiple viewpoints to display free viewpoint immersive video. Despite significant progress with the recent learning-based approaches, there are still some drawbacks. In particular, these approaches operate at the still image level and do not maintain consistency among consecutive time instants, leading to temporal noise. To address this, we propose an intra-only framework to identify regions of input images leading to temporally inconsistent synthesized views. Our method synthesizes better and more stable novel views, even in the most general use case of immersive video transmission. We conclude that the network seems to identify and correct spatial features at the still image level that produce artifacts in the temporal dimension.\",\"PeriodicalId\":120249,\"journal\":{\"name\":\"2022 10th European Workshop on Visual Information Processing (EUVIP)\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 10th European Workshop on Visual Information Processing (EUVIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EUVIP53989.2022.9922680\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 10th European Workshop on Visual Information Processing (EUVIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EUVIP53989.2022.9922680","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

基于图像的渲染方法合成从多个视点捕获的输入图像的新视图，以显示自由视点沉浸式视频。尽管最近基于学习的方法取得了重大进展，但仍然存在一些缺点。特别是，这些方法在静止图像水平上运行，并且不能保持连续时间瞬间之间的一致性，从而导致时间噪声。为了解决这个问题，我们提出了一个内部框架来识别导致合成视图暂时不一致的输入图像区域。即使在沉浸式视频传输的最一般用例中，我们的方法也能合成更好、更稳定的新视图。我们得出的结论是，该网络似乎在静止图像级别上识别并纠正了在时间维度上产生伪影的空间特征。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Exploring Temporal Consistency in Image-Based Rendering for Immersive Video Transmission

Image-based rendering methods synthesize novel views given input images captured from multiple viewpoints to display free viewpoint immersive video. Despite significant progress with the recent learning-based approaches, there are still some drawbacks. In particular, these approaches operate at the still image level and do not maintain consistency among consecutive time instants, leading to temporal noise. To address this, we propose an intra-only framework to identify regions of input images leading to temporally inconsistent synthesized views. Our method synthesizes better and more stable novel views, even in the most general use case of immersive video transmission. We conclude that the network seems to identify and correct spatial features at the still image level that produce artifacts in the temporal dimension.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 10th European Workshop on Visual Information Processing (EUVIP)

自引率

0.00%

发文量