S. Q. Nguyen, Tien Huu Vu, Duong Trieu Dinh, Minh Bao Dinh, Minh N. Do, X. Hoang
{"title":"Early CTU Termination and Three-steps Mode Decision Method for Fast Versatile Video Coding","authors":"S. Q. Nguyen, Tien Huu Vu, Duong Trieu Dinh, Minh Bao Dinh, Minh N. Do, X. Hoang","doi":"10.25073/2588-1086/vnucsce.375","DOIUrl":null,"url":null,"abstract":"Versatile Video Coding (VVC) has been recently becoming popular in coding videos due to its compression efficiency. To reach this performance, Joint Video Experts Team (JVET) has introduced a number of improvement techniques to VVC coding model. Among them, VVC Intra coding introduces a new concept of quad-tree nested multi-type tree (QTMT) and extends the predicted modes with up to 67 options. As a result, the complexity of the VVC Intra encoding also greatly increases. To make VVC Intra coding more feasible in real-time applications, we propose in this paper a novel deep learning based fast QTMT and an early mode prediction method. At the first stage, we use a learned convolutional neural network (CNN) model to predict the coding unit map and then fed into the VVC encoder to early terminate the block partitioning process. After that, we design a statistical model to predict a list of most probable modes (MPM) for each selected Coding using (CU) size. Finally, we employ a so-called three-steps mode decision algorithm to estimate the optimal directional mode without sacrificing the compression performance. The proposed early CU splitting and fast intra prediction are integrated into the latest VTM reference software. Experimental results show that the proposed method can save 50.2% of encoding time with a negligible BD-Rate increase.","PeriodicalId":416488,"journal":{"name":"VNU Journal of Science: Computer Science and Communication Engineering","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"VNU Journal of Science: Computer Science and Communication Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.25073/2588-1086/vnucsce.375","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Versatile Video Coding (VVC) has been recently becoming popular in coding videos due to its compression efficiency. To reach this performance, Joint Video Experts Team (JVET) has introduced a number of improvement techniques to VVC coding model. Among them, VVC Intra coding introduces a new concept of quad-tree nested multi-type tree (QTMT) and extends the predicted modes with up to 67 options. As a result, the complexity of the VVC Intra encoding also greatly increases. To make VVC Intra coding more feasible in real-time applications, we propose in this paper a novel deep learning based fast QTMT and an early mode prediction method. At the first stage, we use a learned convolutional neural network (CNN) model to predict the coding unit map and then fed into the VVC encoder to early terminate the block partitioning process. After that, we design a statistical model to predict a list of most probable modes (MPM) for each selected Coding using (CU) size. Finally, we employ a so-called three-steps mode decision algorithm to estimate the optimal directional mode without sacrificing the compression performance. The proposed early CU splitting and fast intra prediction are integrated into the latest VTM reference software. Experimental results show that the proposed method can save 50.2% of encoding time with a negligible BD-Rate increase.