M. Lorkiewicz, O. Stankiewicz, M. Domański, H. Hang, Wen-Hsiao Peng
{"title":"基于人工神经网络的HEVC编码器内部CTU划分快速选择","authors":"M. Lorkiewicz, O. Stankiewicz, M. Domański, H. Hang, Wen-Hsiao Peng","doi":"10.1109/spsympo51155.2020.9593483","DOIUrl":null,"url":null,"abstract":"In the intra-frame video coding, an image is divided into small blocks, and the actual coding is performed individually in these blocks. In this paper, the process is considered in the context of the widely used HEVC compression, where the optimum choice of the division is crucial for the ratedistortion performance. Unfortunately, the search for such optimum division needs very many operations, and is done on the basis of “try and check” approach in the classic implementations. The idea of the paper is to replace this complex part of the encoder by a neural network, and some variants of the potential neural networks are studied and compared in the paper. For the chosen network, the complexity of the encoder is vastly reduced at the cost of negligible loss in the rate-distortion performance. These features are demonstrated using an extensive set of frames from many test video sequences.","PeriodicalId":380515,"journal":{"name":"2021 Signal Processing Symposium (SPSympo)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Fast Selection of INTRA CTU Partitioning in HEVC Encoders using Artificial Neural Networks\",\"authors\":\"M. Lorkiewicz, O. Stankiewicz, M. Domański, H. Hang, Wen-Hsiao Peng\",\"doi\":\"10.1109/spsympo51155.2020.9593483\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the intra-frame video coding, an image is divided into small blocks, and the actual coding is performed individually in these blocks. In this paper, the process is considered in the context of the widely used HEVC compression, where the optimum choice of the division is crucial for the ratedistortion performance. Unfortunately, the search for such optimum division needs very many operations, and is done on the basis of “try and check” approach in the classic implementations. The idea of the paper is to replace this complex part of the encoder by a neural network, and some variants of the potential neural networks are studied and compared in the paper. For the chosen network, the complexity of the encoder is vastly reduced at the cost of negligible loss in the rate-distortion performance. These features are demonstrated using an extensive set of frames from many test video sequences.\",\"PeriodicalId\":380515,\"journal\":{\"name\":\"2021 Signal Processing Symposium (SPSympo)\",\"volume\":\"48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 Signal Processing Symposium (SPSympo)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/spsympo51155.2020.9593483\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Signal Processing Symposium (SPSympo)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/spsympo51155.2020.9593483","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Fast Selection of INTRA CTU Partitioning in HEVC Encoders using Artificial Neural Networks
In the intra-frame video coding, an image is divided into small blocks, and the actual coding is performed individually in these blocks. In this paper, the process is considered in the context of the widely used HEVC compression, where the optimum choice of the division is crucial for the ratedistortion performance. Unfortunately, the search for such optimum division needs very many operations, and is done on the basis of “try and check” approach in the classic implementations. The idea of the paper is to replace this complex part of the encoder by a neural network, and some variants of the potential neural networks are studied and compared in the paper. For the chosen network, the complexity of the encoder is vastly reduced at the cost of negligible loss in the rate-distortion performance. These features are demonstrated using an extensive set of frames from many test video sequences.