基于可伸缩矩阵分解的低复杂度HEVC变换体系结构

IF 4.3 2区计算机科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Transactions on Consumer Electronics Pub Date : 2024-10-21 DOI:10.1109/TCE.2024.3483949

Subiman Chatterjee;Dinesh Bhardwaj;Binod Prasad;Kishor Sarawadekar

{"title":"基于可伸缩矩阵分解的低复杂度HEVC变换体系结构","authors":"Subiman Chatterjee;Dinesh Bhardwaj;Binod Prasad;Kishor Sarawadekar","doi":"10.1109/TCE.2024.3483949","DOIUrl":null,"url":null,"abstract":"High Efficiency Video Coding (HEVC) is widely used in ultra-high definition (UHD) video applications for its high compression ability, and Discrete Cosine Transform (DCT) is an indispensable module in it. With increasing video resolution, DCT sizes and hardware complexity increase continuously, which poses challenges to the system designers, especially when real-time operations are required. Therefore, the low-complexity approximation of the DCT is paramount in applications demanding real-time computation. To meet this requirement, this article proposes a new DCT architecture for HEVC based on the matrix decomposition method. Each coefficient of the resultant matrices is approximated in such a way that all the multiplications can be realized by shift and add operations. The method reduces the data path width and number of cascaded adders as the matrices use small coefficients compared to the integer DCT of HEVC. The proposed approach designs all transform sizes and maintains the scalability features of DCT used in HEVC. At the cost of a minor drop in PSNR, the proposed method requires 74% less area-delay product in comparison to the HEVC reference algorithm. It is capable of processing at least 30 frames/s of UHD video when implemented on the FPGA of a 28 nm technology node.","PeriodicalId":13208,"journal":{"name":"IEEE Transactions on Consumer Electronics","volume":"70 4","pages":"6691-6699"},"PeriodicalIF":4.3000,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Scalable Matrix Decomposition-Based Less-Complex HEVC Transform Architecture\",\"authors\":\"Subiman Chatterjee;Dinesh Bhardwaj;Binod Prasad;Kishor Sarawadekar\",\"doi\":\"10.1109/TCE.2024.3483949\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"High Efficiency Video Coding (HEVC) is widely used in ultra-high definition (UHD) video applications for its high compression ability, and Discrete Cosine Transform (DCT) is an indispensable module in it. With increasing video resolution, DCT sizes and hardware complexity increase continuously, which poses challenges to the system designers, especially when real-time operations are required. Therefore, the low-complexity approximation of the DCT is paramount in applications demanding real-time computation. To meet this requirement, this article proposes a new DCT architecture for HEVC based on the matrix decomposition method. Each coefficient of the resultant matrices is approximated in such a way that all the multiplications can be realized by shift and add operations. The method reduces the data path width and number of cascaded adders as the matrices use small coefficients compared to the integer DCT of HEVC. The proposed approach designs all transform sizes and maintains the scalability features of DCT used in HEVC. At the cost of a minor drop in PSNR, the proposed method requires 74% less area-delay product in comparison to the HEVC reference algorithm. It is capable of processing at least 30 frames/s of UHD video when implemented on the FPGA of a 28 nm technology node.\",\"PeriodicalId\":13208,\"journal\":{\"name\":\"IEEE Transactions on Consumer Electronics\",\"volume\":\"70 4\",\"pages\":\"6691-6699\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2024-10-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Consumer Electronics\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10726565/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Consumer Electronics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10726565/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

摘要

高效视频编码（High Efficiency Video Coding， HEVC）因其高压缩能力在超高清视频应用中得到了广泛的应用，而离散余弦变换（Discrete Cosine Transform， DCT）是其中不可或缺的一个模块。随着视频分辨率的提高，DCT的尺寸和硬件复杂度不断增加，这给系统设计人员带来了挑战，特别是在需要实时操作的情况下。因此，在需要实时计算的应用中，DCT的低复杂度近似是至关重要的。为了满足这一要求，本文提出了一种基于矩阵分解方法的HEVC DCT结构。所得到的矩阵的每个系数都以这样一种方式近似，即所有的乘法都可以通过移位和加法运算来实现。与HEVC的整数DCT相比，该方法减少了数据路径宽度和级联加器的数量，因为矩阵使用的系数较小。该方法设计了所有的变换大小，并保持了HEVC中使用的DCT的可扩展性特征。以PSNR小幅下降为代价，与HEVC参考算法相比，该方法所需的面积延迟积减少了74%。当在28nm技术节点的FPGA上实现时，它能够处理至少30帧/秒的超高清视频。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Scalable Matrix Decomposition-Based Less-Complex HEVC Transform Architecture

High Efficiency Video Coding (HEVC) is widely used in ultra-high definition (UHD) video applications for its high compression ability, and Discrete Cosine Transform (DCT) is an indispensable module in it. With increasing video resolution, DCT sizes and hardware complexity increase continuously, which poses challenges to the system designers, especially when real-time operations are required. Therefore, the low-complexity approximation of the DCT is paramount in applications demanding real-time computation. To meet this requirement, this article proposes a new DCT architecture for HEVC based on the matrix decomposition method. Each coefficient of the resultant matrices is approximated in such a way that all the multiplications can be realized by shift and add operations. The method reduces the data path width and number of cascaded adders as the matrices use small coefficients compared to the integer DCT of HEVC. The proposed approach designs all transform sizes and maintains the scalability features of DCT used in HEVC. At the cost of a minor drop in PSNR, the proposed method requires 74% less area-delay product in comparison to the HEVC reference algorithm. It is capable of processing at least 30 frames/s of UHD video when implemented on the FPGA of a 28 nm technology node.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Transactions on Consumer Electronics 工程技术-电信学

CiteScore

7.70

自引率

9.30%

发文量

审稿时长

3.3 months

期刊介绍： The main focus for the IEEE Transactions on Consumer Electronics is the engineering and research aspects of the theory, design, construction, manufacture or end use of mass market electronics, systems, software and services for consumers.