基于早终止递阶CNN模型的H.266/VVC帧内快速QTMT预测

2021 International Conference on Advanced Technologies for Communications (ATC) Pub Date : 2021-10-14 DOI:10.1109/atc52653.2021.9598222

Xiem HoangVan, Sang NguyenQuang, Minh DinhBao, Minh DoNgoc, Dinh Trieu Duong

{"title":"基于早终止递阶CNN模型的H.266/VVC帧内快速QTMT预测","authors":"Xiem HoangVan, Sang NguyenQuang, Minh DinhBao, Minh DoNgoc, Dinh Trieu Duong","doi":"10.1109/atc52653.2021.9598222","DOIUrl":null,"url":null,"abstract":"Versatile Video Coding (VVC) has been standardization in July 2020. Compared to previous High Efficiency Video Coding (HEVC) standard, VVC saves up to 50% bitrate for equal perceptual video quality. To reach this efficiency, Joint Video Experts Team (JVET) has introduced a number of improvement techniques to VVC model. As a result, the complexity of VVC encoding also greatly increases. One of the new techniques affects to the growing of complexity is the quad-tree nested multi-type tree (QTMT) including binary split and ternary splits, which lead to a block in VVC with various shapes in both square and rectangle. Based on the aforementioned information we propose in this paper a new deep learning based fast QTMT method. We use a learned convolutional neural network (CNN) model namely Early-Terminated Hierarchical CNN to predict the coding unit map and then fed into the VVC encoder to early terminate the block partitioning process. Experimental results show that the proposed method can save 30.29% encoding time with a negligible BD-Rate increase.","PeriodicalId":196900,"journal":{"name":"2021 International Conference on Advanced Technologies for Communications (ATC)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Fast QTMT for H.266/VVC Intra Prediction using Early-Terminated Hierarchical CNN model\",\"authors\":\"Xiem HoangVan, Sang NguyenQuang, Minh DinhBao, Minh DoNgoc, Dinh Trieu Duong\",\"doi\":\"10.1109/atc52653.2021.9598222\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Versatile Video Coding (VVC) has been standardization in July 2020. Compared to previous High Efficiency Video Coding (HEVC) standard, VVC saves up to 50% bitrate for equal perceptual video quality. To reach this efficiency, Joint Video Experts Team (JVET) has introduced a number of improvement techniques to VVC model. As a result, the complexity of VVC encoding also greatly increases. One of the new techniques affects to the growing of complexity is the quad-tree nested multi-type tree (QTMT) including binary split and ternary splits, which lead to a block in VVC with various shapes in both square and rectangle. Based on the aforementioned information we propose in this paper a new deep learning based fast QTMT method. We use a learned convolutional neural network (CNN) model namely Early-Terminated Hierarchical CNN to predict the coding unit map and then fed into the VVC encoder to early terminate the block partitioning process. Experimental results show that the proposed method can save 30.29% encoding time with a negligible BD-Rate increase.\",\"PeriodicalId\":196900,\"journal\":{\"name\":\"2021 International Conference on Advanced Technologies for Communications (ATC)\",\"volume\":\"70 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Advanced Technologies for Communications (ATC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/atc52653.2021.9598222\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Advanced Technologies for Communications (ATC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/atc52653.2021.9598222","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

摘要

多功能视频编码(VVC)已于2020年7月标准化。与之前的HEVC (High Efficiency Video Coding)标准相比，VVC在获得相同的感知视频质量的情况下，节省了高达50%的比特率。为了达到这种效率，联合视频专家小组(JVET)对VVC模型引入了许多改进技术。因此，VVC编码的复杂度也大大增加。四叉树嵌套多类型树(QTMT)是影响VVC复杂性增长的新技术之一，它包括二进制分割和三元分割，导致VVC中的块具有正方形和矩形的各种形状。基于上述信息，本文提出了一种新的基于深度学习的快速QTMT方法。我们使用一种习得的卷积神经网络(CNN)模型即early - ended Hierarchical CNN来预测编码单元映射，然后将其输入到VVC编码器中以提前终止块划分过程。实验结果表明，该方法可以节省30.29%的编码时间，而BD-Rate的提高可以忽略不计。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Fast QTMT for H.266/VVC Intra Prediction using Early-Terminated Hierarchical CNN model

Versatile Video Coding (VVC) has been standardization in July 2020. Compared to previous High Efficiency Video Coding (HEVC) standard, VVC saves up to 50% bitrate for equal perceptual video quality. To reach this efficiency, Joint Video Experts Team (JVET) has introduced a number of improvement techniques to VVC model. As a result, the complexity of VVC encoding also greatly increases. One of the new techniques affects to the growing of complexity is the quad-tree nested multi-type tree (QTMT) including binary split and ternary splits, which lead to a block in VVC with various shapes in both square and rectangle. Based on the aforementioned information we propose in this paper a new deep learning based fast QTMT method. We use a learned convolutional neural network (CNN) model namely Early-Terminated Hierarchical CNN to predict the coding unit map and then fed into the VVC encoder to early terminate the block partitioning process. Experimental results show that the proposed method can save 30.29% encoding time with a negligible BD-Rate increase.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 International Conference on Advanced Technologies for Communications (ATC)

自引率

0.00%

发文量