{"title":"Intra-coding of 360-degree images on the sphere","authors":"Navid Mahmoudian Bidgoli, Thomas Maugey, A. Roumy","doi":"10.1109/PCS48520.2019.8954538","DOIUrl":"https://doi.org/10.1109/PCS48520.2019.8954538","url":null,"abstract":"Omni-directional images are characterized by their high resolution (usually 8K) and therefore require high compression efficiency. Existing methods project the spherical content onto one or multiple planes and process the mapped content with classical 2D video coding algorithms. However, this projection induces sub-optimality. Indeed, after projection, the statistical properties of the pixels are modified, the connectivity between neighboring pixels on the sphere might be lost, and finally, the sampling is not uniform. Therefore, we propose to process uniformly distributed pixels directly on the sphere to achieve high compression efficiency. In particular, a scanning order and a prediction scheme are proposed to exploit, directly on the sphere, the statistical dependencies between the pixels. A Graph Fourier Transform is also applied to exploit local dependencies while taking into account the 3D geometry. Experimental results demonstrate that the proposed method provides up to 5.6% bitrate reduction and on average around 2% bitrate reduction over state-of-the-art methods.","PeriodicalId":237809,"journal":{"name":"2019 Picture Coding Symposium (PCS)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126539540","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Patrick Garus, Joël Jung, Thomas Maugey, C. Guillemot
{"title":"Bypassing Depth Maps Transmission For Immersive Video Coding","authors":"Patrick Garus, Joël Jung, Thomas Maugey, C. Guillemot","doi":"10.1109/PCS48520.2019.8954543","DOIUrl":"https://doi.org/10.1109/PCS48520.2019.8954543","url":null,"abstract":"This paper addresses several downsides of the system under development in MPEG-I for coding and transmission of immersive media. We present a solution, which enables Depth-Image-Based Rendering for immersive video applications, while lifting the requirement of transmitting depth information. Instead, we estimate the depth information on the client-side from the transmitted views. The approach leads to an impressive rate saving (37.3% in average). Preserving perceptual quality in terms of MS-SSIM of synthesized views, it yields to 24.6% rate reduction for the same quality of reconstructed views after residue transmission under the MPEG-I common test conditions. Simultaneously, the required pixel rate, i.e. the number of pixels processed per second by the decoder, is reduced by 50% for any test sequence. To the author’s knowledge, this is the first time that such an approach is under consideration in the context of immersive video coding.","PeriodicalId":237809,"journal":{"name":"2019 Picture Coding Symposium (PCS)","volume":"154 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131371602","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Robust Circular Two-Dimensional Barcode and Decoding Method","authors":"Fuwang Yi, Guangtao Zhai, Zehao Zhu","doi":"10.1109/PCS48520.2019.8954533","DOIUrl":"https://doi.org/10.1109/PCS48520.2019.8954533","url":null,"abstract":"With the popularity of two-dimensional (2D) bar-codes, many image correction algorithms for two-dimensional barcodes have been proposed. However, limited by the form of matrix barcodes, the effectiveness of these correction algorithms is limited. Besides, circular 2D barcodes such as ShotCode have better ability to resist image distortion. But, due to the small information capacity, the research on circular 2D barcodes in recent years is limited. Therefore, we propose a robust circular 2D barcode in this paper. By adopting colour-coding and a new decoding method, it can not only achieve the basic information capacity, but also effectively enhance the ability to resist image distortion. In addition, we propose two indicators (maximum distortion ratio and maximum support opening angle) to measure the image distortion of two-dimensional barcodes on geometric bodies. Experiments verify the superiority of the new circular two-dimensional barcode and its decoding method.","PeriodicalId":237809,"journal":{"name":"2019 Picture Coding Symposium (PCS)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134472269","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Deep Scalable Image Compression via Hierarchical Feature Decorrelation","authors":"Zongyu Guo, Zhizheng Zhang, Zhibo Chen","doi":"10.1109/PCS48520.2019.8954536","DOIUrl":"https://doi.org/10.1109/PCS48520.2019.8954536","url":null,"abstract":"Scalable image compression allows reconstructing complete images through partially decoding. It plays an important role for image transmission and storage. In this paper, we study the problem of feature decorrelation for Deep Neural Network (DNN) based image codec. Inspired by self-attention mechanism [1], we design a transformer-based decorrelation unit (DU) and adopt it in our scalable image compression framework to reduce the redundancy of feature representations at different levels. Experimental results demonstrate that proposed framework outperforms the state-of-the-art DNN-based scalable image codec and conventional scalable image codecs in terms of MS-SSIM. We also conduct ablation experiments which explicitly verify the effectiveness of decorrelation unit in our scheme.","PeriodicalId":237809,"journal":{"name":"2019 Picture Coding Symposium (PCS)","volume":"238 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133795604","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Weijia Zhu, Jizheng Xu, Li Zhang, Kai Zhang, Hongbin Liu, Yue Wang
{"title":"Compound Palette Mode for Screen Content Coding","authors":"Weijia Zhu, Jizheng Xu, Li Zhang, Kai Zhang, Hongbin Liu, Yue Wang","doi":"10.1109/PCS48520.2019.8954517","DOIUrl":"https://doi.org/10.1109/PCS48520.2019.8954517","url":null,"abstract":"The Joint Video Exploration Team (JVET) has been developing an emerging standard Versatile Video Coding (VVC), which includes screen contents as one of its requirements. Intra block copy (IBC) and palette coding are the two powerful coding tools for screen content coding. In this paper, a compound palette mode is proposed to exploit the advantages of both IBC and palette coding, which allows samples to be reconstructed by either IBC predictions or palette entries. The proposed method is evaluated with VVC reference software VTM4 on typical sequences containing \"text and graphics with motion\". Experimental results report significant coding gain that the proposed scheme can achieve 7.80% and 1.03% BD-rate savings under AI conditions on average when compared with VTM4 and the existing palette scheme.","PeriodicalId":237809,"journal":{"name":"2019 Picture Coding Symposium (PCS)","volume":"126 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124457501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Chunlei Cai, Li Chen, Xiaoyun Zhang, Guo Lu, Zhiyong Gao
{"title":"A Novel Deep Progressive Image Compression Framework","authors":"Chunlei Cai, Li Chen, Xiaoyun Zhang, Guo Lu, Zhiyong Gao","doi":"10.1109/PCS48520.2019.8954500","DOIUrl":"https://doi.org/10.1109/PCS48520.2019.8954500","url":null,"abstract":"In Internet applications, compressing the image without perceptually distinguishable distortions and loading the images without notable delays in the client end can significantly improve the user experience. Compressing the image at high bit rates can maintain the high quality of the decoded image but in cost of long transmitting and decoding time, resulting in bad user experience. The progressive coding scheme can resolve the conflict between the high quality requirement and the large loading delay. This paper proposes a novel efficient progressive image coding framework based on deep convolutional neural networks. The proposed framework is composed of a uniform encoder network and two progressive decoder networks. The encoder network decomposes the input image into two scales of representations, that can be transmitted and reconstructed progressively into a basic quality preview image and a high-quality image by two individual decoder networks respectively. All the networks are jointly learned when achieving the rate distortion optimization of both scales. Experiments results show that the proposed method has much better coding performance than the commercial codecs WebP and JPEG, which are commonly used in Internet applications. Meanwhile, the proposed codec consumes much less time to load the image compared with WebP.","PeriodicalId":237809,"journal":{"name":"2019 Picture Coding Symposium (PCS)","volume":"74 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122151163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Implicit Transform Selection based on Cross Color Component Prediction for Future Video Coding","authors":"Shimpei Nemoto, S. Iwamura, A. Ichigaya","doi":"10.1109/PCS48520.2019.8954526","DOIUrl":"https://doi.org/10.1109/PCS48520.2019.8954526","url":null,"abstract":"In the study of video coding technologies, Discrete Cosine Transform type II (DCT-II) has been employed for energy compaction. To improve coding efficiency, Multiple Transform Selection (MTS) scheme is recently proposed where Discrete Sine Transform type VII (DST-VII) and DCT-VIII are newly introduced with explicit signaling. MTS is not applied for chroma components due to the limitation of computation complexity at an encoder side. This paper proposes implicit transform selection to apply DST-VII for the chroma blocks based on the intra prediction mode applied to the chroma block. The experimental results show 0.45% and 0.48% BD-rate gain in all intra configuration for Cb and Cr components compared to the conventional method with negligible impact of encoding and decoding complexity.","PeriodicalId":237809,"journal":{"name":"2019 Picture Coding Symposium (PCS)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114957327","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
W. Hamidouche, P. Philippe, C.-E. Mohamed, Ahmed Kammoun, D. Ménard, O. Déforges
{"title":"Hardware-friendly DST-VII/DCT-VIII approximations for the Versatile Video Coding Standard","authors":"W. Hamidouche, P. Philippe, C.-E. Mohamed, Ahmed Kammoun, D. Ménard, O. Déforges","doi":"10.1109/PCS48520.2019.8954535","DOIUrl":"https://doi.org/10.1109/PCS48520.2019.8954535","url":null,"abstract":"Versatile Video Coding (VVC) is the next generation video coding standard expected by the end of 2020. The new concept of Multiple-Transform Selection (MTS) has been introduced in VVC. MTS enables the VVC encoder to select the transform that minimizes the rate-distortion cost among a set of pre-defined trigonometric transforms including the well known Discrete Cosine Transform (DCT)-II, DCT-VIII and Discrete Sine Transform (DST)-VII. Unlike the DCT-II that has fast computing algorithms, the DST-VII and DCT-VIII rely on more complex matrix multiplication.This paper tackles the problem of DST-VII and DCT-VIII approximations based on the DCT-II and an adjustment stage. This latter consists in a multiplication by a band-matrix with low number of non-zero coefficients per row. The approximation problem is first modeled as a constrained integer optimization problem minimizing both error and orthogonality. The genetic algorithm is then used to solve the optimization problem and find the adjustment band-matrix that minimizes a trade-off between error and orthogonality. The proposed solution enables to preserve the coding gain achieved by the MTS and considerably reduces the complexity in terms of required number of multiplications by coefficient. Moreover, the proposed approach is hardwarefriendly and will provide a lightweight shared hardware module for DST-II, DST-VII and DCT-VIII transforms.","PeriodicalId":237809,"journal":{"name":"2019 Picture Coding Symposium (PCS)","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124111476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Huaifei Xing, Zhichao Zhou, Jialiang Wang, Huifeng Shen, Dongliang He, Fu Li
{"title":"Predicting Rate Control Target Through A Learning Based Content Adaptive Model","authors":"Huaifei Xing, Zhichao Zhou, Jialiang Wang, Huifeng Shen, Dongliang He, Fu Li","doi":"10.1109/PCS48520.2019.8954541","DOIUrl":"https://doi.org/10.1109/PCS48520.2019.8954541","url":null,"abstract":"Rate Control (RC) plays an important role in video encoding. Traditional solutions are using fixed rate or fixed quantization parameters as the unified rate-control targets for all videos in one given video application. However, unified ratecontrol targets tend to have some bad encoding cases because of applying wrong rate for the video content. In this paper, we propose one content-adaptive rate control solution. We employ one neural-network based model which can end-to-end learn the optimal rate-control target appropriate to the content characteristics. The experimental results show that the proposed model can predict the optimal rate-factor value with the accuracy up to 77.637%. With this model, the proposed video-encoding method can significantly decrease the encoding quality fluctuation.","PeriodicalId":237809,"journal":{"name":"2019 Picture Coding Symposium (PCS)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130003963","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}