基于可伸缩矩阵分解的低复杂度HEVC变换体系结构

IF 4.3 2区 计算机科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC
Subiman Chatterjee;Dinesh Bhardwaj;Binod Prasad;Kishor Sarawadekar
{"title":"基于可伸缩矩阵分解的低复杂度HEVC变换体系结构","authors":"Subiman Chatterjee;Dinesh Bhardwaj;Binod Prasad;Kishor Sarawadekar","doi":"10.1109/TCE.2024.3483949","DOIUrl":null,"url":null,"abstract":"High Efficiency Video Coding (HEVC) is widely used in ultra-high definition (UHD) video applications for its high compression ability, and Discrete Cosine Transform (DCT) is an indispensable module in it. With increasing video resolution, DCT sizes and hardware complexity increase continuously, which poses challenges to the system designers, especially when real-time operations are required. Therefore, the low-complexity approximation of the DCT is paramount in applications demanding real-time computation. To meet this requirement, this article proposes a new DCT architecture for HEVC based on the matrix decomposition method. Each coefficient of the resultant matrices is approximated in such a way that all the multiplications can be realized by shift and add operations. The method reduces the data path width and number of cascaded adders as the matrices use small coefficients compared to the integer DCT of HEVC. The proposed approach designs all transform sizes and maintains the scalability features of DCT used in HEVC. At the cost of a minor drop in PSNR, the proposed method requires 74% less area-delay product in comparison to the HEVC reference algorithm. It is capable of processing at least 30 frames/s of UHD video when implemented on the FPGA of a 28 nm technology node.","PeriodicalId":13208,"journal":{"name":"IEEE Transactions on Consumer Electronics","volume":"70 4","pages":"6691-6699"},"PeriodicalIF":4.3000,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Scalable Matrix Decomposition-Based Less-Complex HEVC Transform Architecture\",\"authors\":\"Subiman Chatterjee;Dinesh Bhardwaj;Binod Prasad;Kishor Sarawadekar\",\"doi\":\"10.1109/TCE.2024.3483949\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"High Efficiency Video Coding (HEVC) is widely used in ultra-high definition (UHD) video applications for its high compression ability, and Discrete Cosine Transform (DCT) is an indispensable module in it. With increasing video resolution, DCT sizes and hardware complexity increase continuously, which poses challenges to the system designers, especially when real-time operations are required. Therefore, the low-complexity approximation of the DCT is paramount in applications demanding real-time computation. To meet this requirement, this article proposes a new DCT architecture for HEVC based on the matrix decomposition method. Each coefficient of the resultant matrices is approximated in such a way that all the multiplications can be realized by shift and add operations. The method reduces the data path width and number of cascaded adders as the matrices use small coefficients compared to the integer DCT of HEVC. The proposed approach designs all transform sizes and maintains the scalability features of DCT used in HEVC. At the cost of a minor drop in PSNR, the proposed method requires 74% less area-delay product in comparison to the HEVC reference algorithm. It is capable of processing at least 30 frames/s of UHD video when implemented on the FPGA of a 28 nm technology node.\",\"PeriodicalId\":13208,\"journal\":{\"name\":\"IEEE Transactions on Consumer Electronics\",\"volume\":\"70 4\",\"pages\":\"6691-6699\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2024-10-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Consumer Electronics\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10726565/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Consumer Electronics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10726565/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

摘要

高效视频编码(High Efficiency Video Coding, HEVC)因其高压缩能力在超高清视频应用中得到了广泛的应用,而离散余弦变换(Discrete Cosine Transform, DCT)是其中不可或缺的一个模块。随着视频分辨率的提高,DCT的尺寸和硬件复杂度不断增加,这给系统设计人员带来了挑战,特别是在需要实时操作的情况下。因此,在需要实时计算的应用中,DCT的低复杂度近似是至关重要的。为了满足这一要求,本文提出了一种基于矩阵分解方法的HEVC DCT结构。所得到的矩阵的每个系数都以这样一种方式近似,即所有的乘法都可以通过移位和加法运算来实现。与HEVC的整数DCT相比,该方法减少了数据路径宽度和级联加器的数量,因为矩阵使用的系数较小。该方法设计了所有的变换大小,并保持了HEVC中使用的DCT的可扩展性特征。以PSNR小幅下降为代价,与HEVC参考算法相比,该方法所需的面积延迟积减少了74%。当在28nm技术节点的FPGA上实现时,它能够处理至少30帧/秒的超高清视频。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Scalable Matrix Decomposition-Based Less-Complex HEVC Transform Architecture
High Efficiency Video Coding (HEVC) is widely used in ultra-high definition (UHD) video applications for its high compression ability, and Discrete Cosine Transform (DCT) is an indispensable module in it. With increasing video resolution, DCT sizes and hardware complexity increase continuously, which poses challenges to the system designers, especially when real-time operations are required. Therefore, the low-complexity approximation of the DCT is paramount in applications demanding real-time computation. To meet this requirement, this article proposes a new DCT architecture for HEVC based on the matrix decomposition method. Each coefficient of the resultant matrices is approximated in such a way that all the multiplications can be realized by shift and add operations. The method reduces the data path width and number of cascaded adders as the matrices use small coefficients compared to the integer DCT of HEVC. The proposed approach designs all transform sizes and maintains the scalability features of DCT used in HEVC. At the cost of a minor drop in PSNR, the proposed method requires 74% less area-delay product in comparison to the HEVC reference algorithm. It is capable of processing at least 30 frames/s of UHD video when implemented on the FPGA of a 28 nm technology node.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
7.70
自引率
9.30%
发文量
59
审稿时长
3.3 months
期刊介绍: The main focus for the IEEE Transactions on Consumer Electronics is the engineering and research aspects of the theory, design, construction, manufacture or end use of mass market electronics, systems, software and services for consumers.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信