Geometry-guided compact compression for light field image using graph convolutional networks

Yu Liu, Linhui Wei, Heming Zhao, Jingming Shan, Yumei Wang
{"title":"Geometry-guided compact compression for light field image using graph convolutional networks","authors":"Yu Liu, Linhui Wei, Heming Zhao, Jingming Shan, Yumei Wang","doi":"10.1145/3534088.3534345","DOIUrl":null,"url":null,"abstract":"Light field records the information of the light in space and contributes to regenerate the content effectively, which makes immersive media more promising. In this paper, we propose a geometry-guided compact compression scheme (GCC) for light field image. We regard that the geometry of GCC includes the structure in a single sub-aperture image (SAI) and the relationship among SAIs, which can be used to fully explore the compact representation for light field image. The light field image is grouped into key SAIs and non-key SAIs. The key SAIs are obtained by down-sampling in the angular domain and arranged into the pseudo-sequence that needs to be compressed. We consider the superpixel-based segmentation algorithm to detect the contours and obtain the sketch map for the non-key SAIs. Meanwhile, the graph model is used to establish the relationships among the SAIs by the vertices and edges. On the decoder side, the light field image is reconstructed by the graph convolutional networks, and the sketch map optimizes the details of the recovered images to some extent. Experimental results show the benefit of GCC in terms of rate-distortion performances compare with several state-of-the-art methods for the real-world and synthetic light field datasets. Besides, the proposed GCC is able to generalize over datasets not seen during training.","PeriodicalId":150454,"journal":{"name":"Proceedings of the 32nd Workshop on Network and Operating Systems Support for Digital Audio and Video","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 32nd Workshop on Network and Operating Systems Support for Digital Audio and Video","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3534088.3534345","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Light field records the information of the light in space and contributes to regenerate the content effectively, which makes immersive media more promising. In this paper, we propose a geometry-guided compact compression scheme (GCC) for light field image. We regard that the geometry of GCC includes the structure in a single sub-aperture image (SAI) and the relationship among SAIs, which can be used to fully explore the compact representation for light field image. The light field image is grouped into key SAIs and non-key SAIs. The key SAIs are obtained by down-sampling in the angular domain and arranged into the pseudo-sequence that needs to be compressed. We consider the superpixel-based segmentation algorithm to detect the contours and obtain the sketch map for the non-key SAIs. Meanwhile, the graph model is used to establish the relationships among the SAIs by the vertices and edges. On the decoder side, the light field image is reconstructed by the graph convolutional networks, and the sketch map optimizes the details of the recovered images to some extent. Experimental results show the benefit of GCC in terms of rate-distortion performances compare with several state-of-the-art methods for the real-world and synthetic light field datasets. Besides, the proposed GCC is able to generalize over datasets not seen during training.
基于图形卷积网络的光场图像几何导向压缩
光场记录光在空间中的信息,有助于内容的有效再生,使沉浸式媒体更具前景。本文提出了一种光场图像的几何导向压缩方案(GCC)。我们认为,GCC的几何结构包括单子孔径图像(SAI)的结构和SAI之间的关系,可以用来充分探索光场图像的紧凑表示。光场图像分为关键sar和非关键sar。在角域下采样得到关键的sai,并将其排列到需要压缩的伪序列中。我们考虑了基于超像素的分割算法来检测非关键sai的轮廓并获得草图图。同时,利用图模型通过顶点和边来建立sai之间的关系。在解码器端,利用图卷积网络重构光场图像,草图映射在一定程度上优化了复原图像的细节。实验结果表明,在真实世界和合成光场数据集上,与几种最先进的方法相比,GCC在率失真性能方面具有优势。此外,所提出的GCC能够在训练期间未见的数据集上进行泛化。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信