Yu Liu, Linhui Wei, Heming Zhao, Jingming Shan, Yumei Wang
{"title":"基于图形卷积网络的光场图像几何导向压缩","authors":"Yu Liu, Linhui Wei, Heming Zhao, Jingming Shan, Yumei Wang","doi":"10.1145/3534088.3534345","DOIUrl":null,"url":null,"abstract":"Light field records the information of the light in space and contributes to regenerate the content effectively, which makes immersive media more promising. In this paper, we propose a geometry-guided compact compression scheme (GCC) for light field image. We regard that the geometry of GCC includes the structure in a single sub-aperture image (SAI) and the relationship among SAIs, which can be used to fully explore the compact representation for light field image. The light field image is grouped into key SAIs and non-key SAIs. The key SAIs are obtained by down-sampling in the angular domain and arranged into the pseudo-sequence that needs to be compressed. We consider the superpixel-based segmentation algorithm to detect the contours and obtain the sketch map for the non-key SAIs. Meanwhile, the graph model is used to establish the relationships among the SAIs by the vertices and edges. On the decoder side, the light field image is reconstructed by the graph convolutional networks, and the sketch map optimizes the details of the recovered images to some extent. Experimental results show the benefit of GCC in terms of rate-distortion performances compare with several state-of-the-art methods for the real-world and synthetic light field datasets. Besides, the proposed GCC is able to generalize over datasets not seen during training.","PeriodicalId":150454,"journal":{"name":"Proceedings of the 32nd Workshop on Network and Operating Systems Support for Digital Audio and Video","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Geometry-guided compact compression for light field image using graph convolutional networks\",\"authors\":\"Yu Liu, Linhui Wei, Heming Zhao, Jingming Shan, Yumei Wang\",\"doi\":\"10.1145/3534088.3534345\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Light field records the information of the light in space and contributes to regenerate the content effectively, which makes immersive media more promising. In this paper, we propose a geometry-guided compact compression scheme (GCC) for light field image. We regard that the geometry of GCC includes the structure in a single sub-aperture image (SAI) and the relationship among SAIs, which can be used to fully explore the compact representation for light field image. The light field image is grouped into key SAIs and non-key SAIs. The key SAIs are obtained by down-sampling in the angular domain and arranged into the pseudo-sequence that needs to be compressed. We consider the superpixel-based segmentation algorithm to detect the contours and obtain the sketch map for the non-key SAIs. Meanwhile, the graph model is used to establish the relationships among the SAIs by the vertices and edges. On the decoder side, the light field image is reconstructed by the graph convolutional networks, and the sketch map optimizes the details of the recovered images to some extent. Experimental results show the benefit of GCC in terms of rate-distortion performances compare with several state-of-the-art methods for the real-world and synthetic light field datasets. Besides, the proposed GCC is able to generalize over datasets not seen during training.\",\"PeriodicalId\":150454,\"journal\":{\"name\":\"Proceedings of the 32nd Workshop on Network and Operating Systems Support for Digital Audio and Video\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 32nd Workshop on Network and Operating Systems Support for Digital Audio and Video\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3534088.3534345\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 32nd Workshop on Network and Operating Systems Support for Digital Audio and Video","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3534088.3534345","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Geometry-guided compact compression for light field image using graph convolutional networks
Light field records the information of the light in space and contributes to regenerate the content effectively, which makes immersive media more promising. In this paper, we propose a geometry-guided compact compression scheme (GCC) for light field image. We regard that the geometry of GCC includes the structure in a single sub-aperture image (SAI) and the relationship among SAIs, which can be used to fully explore the compact representation for light field image. The light field image is grouped into key SAIs and non-key SAIs. The key SAIs are obtained by down-sampling in the angular domain and arranged into the pseudo-sequence that needs to be compressed. We consider the superpixel-based segmentation algorithm to detect the contours and obtain the sketch map for the non-key SAIs. Meanwhile, the graph model is used to establish the relationships among the SAIs by the vertices and edges. On the decoder side, the light field image is reconstructed by the graph convolutional networks, and the sketch map optimizes the details of the recovered images to some extent. Experimental results show the benefit of GCC in terms of rate-distortion performances compare with several state-of-the-art methods for the real-world and synthetic light field datasets. Besides, the proposed GCC is able to generalize over datasets not seen during training.