A contourlet transform based algorithm for real-time video encoding

Real-Time Image and Video Processing Pub Date : 2012-05-01 DOI:10.1117/12.924327

Stamos Katsigiannis, Georgios Papaioannou, D. Maroulis

{"title":"A contourlet transform based algorithm for real-time video encoding","authors":"Stamos Katsigiannis, Georgios Papaioannou, D. Maroulis","doi":"10.1117/12.924327","DOIUrl":null,"url":null,"abstract":"In recent years, real-time video communication over the internet has been widely utilized for applications like video \nconferencing. Streaming live video over heterogeneous IP networks, including wireless networks, requires video coding \nalgorithms that can support various levels of quality in order to adapt to the network end-to-end bandwidth and \ntransmitter/receiver resources. In this work, a scalable video coding and compression algorithm based on the Contourlet \nTransform is proposed. The algorithm allows for multiple levels of detail, without re-encoding the video frames, by just \ndropping the encoded information referring to higher resolution than needed. Compression is achieved by means of lossy \nand lossless methods, as well as variable bit rate encoding schemes. Furthermore, due to the transformation utilized, it \ndoes not suffer from blocking artifacts that occur with many widely adopted compression algorithms. Another highly \nadvantageous characteristic of the algorithm is the suppression of noise induced by low-quality sensors usually \nencountered in web-cameras, due to the manipulation of the transform coefficients at the compression stage. The \nproposed algorithm is designed to introduce minimal coding delay, thus achieving real-time performance. Performance is \nenhanced by utilizing the vast computational capabilities of modern GPUs, providing satisfactory encoding and decoding \ntimes at relatively low cost. These characteristics make this method suitable for applications like video-conferencing that \ndemand real-time performance, along with the highest visual quality possible for each user. Through the presented \nperformance and quality evaluation of the algorithm, experimental results show that the proposed algorithm achieves \nbetter or comparable visual quality relative to other compression and encoding methods tested, while maintaining a \nsatisfactory compression ratio. Especially at low bitrates, it provides more human-eye friendly images compared to \nalgorithms utilizing block-based coding, like the MPEG family, as it introduces fuzziness and blurring instead of \nartificial block artifacts.","PeriodicalId":369288,"journal":{"name":"Real-Time Image and Video Processing","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Real-Time Image and Video Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.924327","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 11

Abstract

In recent years, real-time video communication over the internet has been widely utilized for applications like video conferencing. Streaming live video over heterogeneous IP networks, including wireless networks, requires video coding algorithms that can support various levels of quality in order to adapt to the network end-to-end bandwidth and transmitter/receiver resources. In this work, a scalable video coding and compression algorithm based on the Contourlet Transform is proposed. The algorithm allows for multiple levels of detail, without re-encoding the video frames, by just dropping the encoded information referring to higher resolution than needed. Compression is achieved by means of lossy and lossless methods, as well as variable bit rate encoding schemes. Furthermore, due to the transformation utilized, it does not suffer from blocking artifacts that occur with many widely adopted compression algorithms. Another highly advantageous characteristic of the algorithm is the suppression of noise induced by low-quality sensors usually encountered in web-cameras, due to the manipulation of the transform coefficients at the compression stage. The proposed algorithm is designed to introduce minimal coding delay, thus achieving real-time performance. Performance is enhanced by utilizing the vast computational capabilities of modern GPUs, providing satisfactory encoding and decoding times at relatively low cost. These characteristics make this method suitable for applications like video-conferencing that demand real-time performance, along with the highest visual quality possible for each user. Through the presented performance and quality evaluation of the algorithm, experimental results show that the proposed algorithm achieves better or comparable visual quality relative to other compression and encoding methods tested, while maintaining a satisfactory compression ratio. Especially at low bitrates, it provides more human-eye friendly images compared to algorithms utilizing block-based coding, like the MPEG family, as it introduces fuzziness and blurring instead of artificial block artifacts.

查看原文本刊更多论文

基于contourlet变换的实时视频编码算法

近年来，互联网上的实时视频通信被广泛应用于视频会议等应用。在包括无线网络在内的异构IP网络上，流媒体直播视频需要支持不同质量水平的视频编码算法，以适应网络端到端带宽和发送/接收资源。本文提出了一种基于Contourlet变换的可伸缩视频编码压缩算法。该算法允许多个层次的细节，而不需要重新编码视频帧，通过删除编码信息来参考更高的分辨率。压缩是通过有损和无损方法以及可变比特率编码方案来实现的。此外，由于使用了转换，它不会受到许多广泛采用的压缩算法中出现的阻塞工件的影响。该算法的另一个非常有利的特点是抑制了通常在网络摄像机中遇到的低质量传感器引起的噪声，这是由于在压缩阶段对变换系数进行了操作。该算法旨在引入最小的编码延迟，从而实现实时性。通过利用现代gpu的巨大计算能力，性能得到增强，以相对较低的成本提供令人满意的编码和解码时间。这些特点使得这种方法适用于视频会议等需要实时性能的应用程序，以及每个用户尽可能高的视觉质量。通过对算法的性能和质量评价，实验结果表明，相对于所测试的其他压缩和编码方法，该算法在保持满意的压缩比的同时，获得了更好或相当的视觉质量。特别是在低比特率下，与使用基于块的编码(如MPEG家族)的算法相比，它提供了更适合人眼的图像，因为它引入了模糊和模糊，而不是人工块工件。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Real-Time Image and Video Processing

自引率

0.00%

发文量