面向无损全光图像压缩的子孔径图像稀疏建模与预测编码

2016 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON) Pub Date : 2016-07-04 DOI:10.1109/3DTV.2016.7548953

Petri Helin, P. Astola, B. Rao, I. Tabus

{"title":"面向无损全光图像压缩的子孔径图像稀疏建模与预测编码","authors":"Petri Helin, P. Astola, B. Rao, I. Tabus","doi":"10.1109/3DTV.2016.7548953","DOIUrl":null,"url":null,"abstract":"This paper studies the lossless compression of rectified light-field images captured by plenoptic cameras, exploiting the high similarity existing between the subaperture images, or views, composing the light-field image. The encoding is predictive, where one sparse predictor is designed for every region of a view, using as regressors the pixels from the already transmitted views. As a first step, consistent segmentations for all subaperture images are constructed, defining the regions as connected components in the quantized depth map of the central view, and then propagating them to all side views. The sparse predictors are able to take into account the small horizontal and vertical disparities between regions in corresponding close-by views and perform optimal least squares interpolation accounting implicitly for fractional disparities. The optimal structure of the sparse predictor is selected for each region based on an implementable description length. The encoding of the views is done sequentially starting from the central view and the scheme produces results better than standard lossless compression methods utilized directly on the full lightfield image or applied to the views in a similar sequential order as our method.","PeriodicalId":378956,"journal":{"name":"2016 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"32","resultStr":"{\"title\":\"Sparse modelling and predictive coding of subaperture images for lossless plenoptic image compression\",\"authors\":\"Petri Helin, P. Astola, B. Rao, I. Tabus\",\"doi\":\"10.1109/3DTV.2016.7548953\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper studies the lossless compression of rectified light-field images captured by plenoptic cameras, exploiting the high similarity existing between the subaperture images, or views, composing the light-field image. The encoding is predictive, where one sparse predictor is designed for every region of a view, using as regressors the pixels from the already transmitted views. As a first step, consistent segmentations for all subaperture images are constructed, defining the regions as connected components in the quantized depth map of the central view, and then propagating them to all side views. The sparse predictors are able to take into account the small horizontal and vertical disparities between regions in corresponding close-by views and perform optimal least squares interpolation accounting implicitly for fractional disparities. The optimal structure of the sparse predictor is selected for each region based on an implementable description length. The encoding of the views is done sequentially starting from the central view and the scheme produces results better than standard lossless compression methods utilized directly on the full lightfield image or applied to the views in a similar sequential order as our method.\",\"PeriodicalId\":378956,\"journal\":{\"name\":\"2016 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-07-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"32\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/3DTV.2016.7548953\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/3DTV.2016.7548953","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 32

摘要

本文研究了全光相机采集的整流光场图像的无损压缩，利用子孔径图像或视图之间存在的高度相似性，构成光场图像。编码是预测性的，其中为视图的每个区域设计了一个稀疏预测器，使用来自已传输视图的像素作为回归量。首先，构建所有子孔径图像的一致分割，将区域定义为中心视图量化深度图中的连接分量，然后将其传播到所有侧视图。稀疏预测器能够考虑到相应近距离视图中区域之间的小水平和垂直差异，并隐式地执行最优最小二乘插值。基于可实现的描述长度为每个区域选择稀疏预测器的最优结构。视图的编码是从中心视图开始顺序完成的，该方案产生的结果优于直接对全光场图像使用的标准无损压缩方法，或者以与我们的方法类似的顺序顺序应用于视图。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Sparse modelling and predictive coding of subaperture images for lossless plenoptic image compression

This paper studies the lossless compression of rectified light-field images captured by plenoptic cameras, exploiting the high similarity existing between the subaperture images, or views, composing the light-field image. The encoding is predictive, where one sparse predictor is designed for every region of a view, using as regressors the pixels from the already transmitted views. As a first step, consistent segmentations for all subaperture images are constructed, defining the regions as connected components in the quantized depth map of the central view, and then propagating them to all side views. The sparse predictors are able to take into account the small horizontal and vertical disparities between regions in corresponding close-by views and perform optimal least squares interpolation accounting implicitly for fractional disparities. The optimal structure of the sparse predictor is selected for each region based on an implementable description length. The encoding of the views is done sequentially starting from the central view and the scheme produces results better than standard lossless compression methods utilized directly on the full lightfield image or applied to the views in a similar sequential order as our method.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2016 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON)

自引率

0.00%

发文量