使用 CU-VGG 深度学习架构进行高速编码单元深度识别

IF 2.6 4区综合性期刊 Q2 MULTIDISCIPLINARY SCIENCES

Arabian Journal for Science and Engineering Pub Date : 2024-04-01 DOI:10.1007/s13369-024-08928-4

Hari Pattimi, B. K. N. Srinivasarao

{"title":"使用 CU-VGG 深度学习架构进行高速编码单元深度识别","authors":"Hari Pattimi, B. K. N. Srinivasarao","doi":"10.1007/s13369-024-08928-4","DOIUrl":null,"url":null,"abstract":"<div><p>The quadtree partition process involves major complexity in high-efficiency video coding (HEVC/H.265). It divides the coding tree units (CTUs) recursively into coding units (CUs). Determining the coding unit partition depth based on rate-distortion optimisation is computationally difficult in HEVC. This article proposes a system based on a deep learning architecture for determining the coding unit partition depth with less time in HEVC intra-prediction. The proposed system minimises computing complexity by removing the rate-distortion optimisation. The proposed system comprises two main blocks: the pre-processing block and the deep learning block. During the pre-processing phase, the spatial resolution of the input data is drastically reduced, enabling the neural network model to quickly adapt to the input sample and extract more meaningful feature data. This paper proposes two distinct deep learning architectures, CU-VGG16 and CU-VGG19. Pre-processed coding units (16 <span>\\(\\times \\)</span> 16) are the input for the deep learning architecture, and the corresponding coding units’ depths (0, 1, 2, 3) are the output. To compare the accuracy of coding unit depth prediction in the two proposed models, we have created a database with varying resolutions. The performance of the proposed models was observed by replacing the CU partition block of traditional HEVC with the proposed systems and comparing the bit rate and encoding time with traditional HEVC. The results demonstrated that the proposed architecture with CU-VGG16 and CU-VGG19 designs speeds up coding unit partitioning by 87.15% and 87.70%, respectively, as compared to standard HEVC.</p></div>","PeriodicalId":54354,"journal":{"name":"Arabian Journal for Science and Engineering","volume":"49 12","pages":"16287 - 16298"},"PeriodicalIF":2.6000,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"High-Speed Coding Unit Depth Identifications Using CU-VGG Deep Learning Architectures\",\"authors\":\"Hari Pattimi, B. K. N. Srinivasarao\",\"doi\":\"10.1007/s13369-024-08928-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The quadtree partition process involves major complexity in high-efficiency video coding (HEVC/H.265). It divides the coding tree units (CTUs) recursively into coding units (CUs). Determining the coding unit partition depth based on rate-distortion optimisation is computationally difficult in HEVC. This article proposes a system based on a deep learning architecture for determining the coding unit partition depth with less time in HEVC intra-prediction. The proposed system minimises computing complexity by removing the rate-distortion optimisation. The proposed system comprises two main blocks: the pre-processing block and the deep learning block. During the pre-processing phase, the spatial resolution of the input data is drastically reduced, enabling the neural network model to quickly adapt to the input sample and extract more meaningful feature data. This paper proposes two distinct deep learning architectures, CU-VGG16 and CU-VGG19. Pre-processed coding units (16 <span>\\\\(\\\\times \\\\)</span> 16) are the input for the deep learning architecture, and the corresponding coding units’ depths (0, 1, 2, 3) are the output. To compare the accuracy of coding unit depth prediction in the two proposed models, we have created a database with varying resolutions. The performance of the proposed models was observed by replacing the CU partition block of traditional HEVC with the proposed systems and comparing the bit rate and encoding time with traditional HEVC. The results demonstrated that the proposed architecture with CU-VGG16 and CU-VGG19 designs speeds up coding unit partitioning by 87.15% and 87.70%, respectively, as compared to standard HEVC.</p></div>\",\"PeriodicalId\":54354,\"journal\":{\"name\":\"Arabian Journal for Science and Engineering\",\"volume\":\"49 12\",\"pages\":\"16287 - 16298\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2024-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Arabian Journal for Science and Engineering\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s13369-024-08928-4\",\"RegionNum\":4,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Arabian Journal for Science and Engineering","FirstCategoryId":"103","ListUrlMain":"https://link.springer.com/article/10.1007/s13369-024-08928-4","RegionNum":4,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

在高效视频编码（HEVC/H.265）中，四叉树分割过程非常复杂。它将编码树单元（CTU）递归划分为编码单元（CU）。在 HEVC 中，基于速率-失真优化确定编码单元划分深度在计算上非常困难。本文提出了一种基于深度学习架构的系统，用于在 HEVC 内部预测中以更短的时间确定编码单元划分深度。该系统取消了速率-失真优化，从而最大限度地降低了计算复杂度。提议的系统包括两个主要模块：预处理模块和深度学习模块。在预处理阶段，输入数据的空间分辨率被大幅降低，从而使神经网络模型能够快速适应输入样本并提取更有意义的特征数据。本文提出了两种不同的深度学习架构：CU-VGG16 和 CU-VGG19。预处理后的编码单元（16 \(\times \) 16）是深度学习架构的输入，相应的编码单元深度（0、1、2、3）是输出。为了比较两种拟议模型中编码单元深度预测的准确性，我们创建了一个具有不同分辨率的数据库。将传统 HEVC 的 CU 分区块替换为建议的系统，并将比特率和编码时间与传统 HEVC 进行比较，从而观察建议模型的性能。结果表明，与标准 HEVC 相比，采用 CU-VGG16 和 CU-VGG19 设计的拟议架构将编码单元分区速度分别提高了 87.15% 和 87.70%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

High-Speed Coding Unit Depth Identifications Using CU-VGG Deep Learning Architectures

查看原文本刊更多论文

High-Speed Coding Unit Depth Identifications Using CU-VGG Deep Learning Architectures

The quadtree partition process involves major complexity in high-efficiency video coding (HEVC/H.265). It divides the coding tree units (CTUs) recursively into coding units (CUs). Determining the coding unit partition depth based on rate-distortion optimisation is computationally difficult in HEVC. This article proposes a system based on a deep learning architecture for determining the coding unit partition depth with less time in HEVC intra-prediction. The proposed system minimises computing complexity by removing the rate-distortion optimisation. The proposed system comprises two main blocks: the pre-processing block and the deep learning block. During the pre-processing phase, the spatial resolution of the input data is drastically reduced, enabling the neural network model to quickly adapt to the input sample and extract more meaningful feature data. This paper proposes two distinct deep learning architectures, CU-VGG16 and CU-VGG19. Pre-processed coding units (16 \(\times \) 16) are the input for the deep learning architecture, and the corresponding coding units’ depths (0, 1, 2, 3) are the output. To compare the accuracy of coding unit depth prediction in the two proposed models, we have created a database with varying resolutions. The performance of the proposed models was observed by replacing the CU partition block of traditional HEVC with the proposed systems and comparing the bit rate and encoding time with traditional HEVC. The results demonstrated that the proposed architecture with CU-VGG16 and CU-VGG19 designs speeds up coding unit partitioning by 87.15% and 87.70%, respectively, as compared to standard HEVC.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Arabian Journal for Science and Engineering MULTIDISCIPLINARY SCIENCES-

CiteScore

5.70

自引率

3.40%

发文量

993

期刊介绍： King Fahd University of Petroleum & Minerals (KFUPM) partnered with Springer to publish the Arabian Journal for Science and Engineering (AJSE). AJSE, which has been published by KFUPM since 1975, is a recognized national, regional and international journal that provides a great opportunity for the dissemination of research advances from the Kingdom of Saudi Arabia, MENA and the world.