A Macroblock Homogeneity Detection Method and its Application for Block Size Decision in H.264/AVC

G. Tian, T. Zhang, S. Goto
{"title":"A Macroblock Homogeneity Detection Method and its Application for Block Size Decision in H.264/AVC","authors":"G. Tian, T. Zhang, S. Goto","doi":"10.11371/IIEEJ.39.672","DOIUrl":null,"url":null,"abstract":"〈 Summary 〉 The variable block sizes for intra and inter coding in H.264/AVC achieves significant coding gain compared with coding a macroblock (MB) with fixed size. How-ever, extremely heavy computational burden is required when Rate Distortion Optimization (RDO) process runs in brutal force searching manner for selecting the optimal coding block. This paper proposes an MB homogeneity detection method to accelerate H.264/AVC intra and inter coding. All the luminance values of pixels in an MB are taken to calculate their entropy feature, which is defined as MB’s spatial homogeneity. Based on homogeneity judgment, 16 × 16 or 4 × 4 block size is appropriately selected for intra coding; Meanwhile, either the large blocks in { 16 × 16 , 16 × 8 , 8 × 16 } or sub-blocks in { 8 × 8 , 8 × 4 , 4 × 8 , 4 × 4 } are chosen for inter coding. Especially, a cost function is defined to select near optimal threshold for selecting optimal block size. Proposed methods are verified on a wide range of video sequences with different spatial-/motion characteristics. Sufficient simulations demonstrate that consistent encoding gain is achieved for all videos with different motion and spatial features. Encoding complexity for intra coding alone can be reduced by 31%– 34% and time savings for inter mode decision is 43.7%–58.7%, both with negligible loss in bitrate and PSNR.","PeriodicalId":153591,"journal":{"name":"The Journal of the Institute of Image Electronics Engineers of Japan","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Journal of the Institute of Image Electronics Engineers of Japan","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11371/IIEEJ.39.672","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

〈 Summary 〉 The variable block sizes for intra and inter coding in H.264/AVC achieves significant coding gain compared with coding a macroblock (MB) with fixed size. How-ever, extremely heavy computational burden is required when Rate Distortion Optimization (RDO) process runs in brutal force searching manner for selecting the optimal coding block. This paper proposes an MB homogeneity detection method to accelerate H.264/AVC intra and inter coding. All the luminance values of pixels in an MB are taken to calculate their entropy feature, which is defined as MB’s spatial homogeneity. Based on homogeneity judgment, 16 × 16 or 4 × 4 block size is appropriately selected for intra coding; Meanwhile, either the large blocks in { 16 × 16 , 16 × 8 , 8 × 16 } or sub-blocks in { 8 × 8 , 8 × 4 , 4 × 8 , 4 × 4 } are chosen for inter coding. Especially, a cost function is defined to select near optimal threshold for selecting optimal block size. Proposed methods are verified on a wide range of video sequences with different spatial-/motion characteristics. Sufficient simulations demonstrate that consistent encoding gain is achieved for all videos with different motion and spatial features. Encoding complexity for intra coding alone can be reduced by 31%– 34% and time savings for inter mode decision is 43.7%–58.7%, both with negligible loss in bitrate and PSNR.
H.264/AVC中宏块均匀性检测方法及其在块大小决定中的应用
<摘要>在H.264/AVC中,与固定大小的宏块(MB)编码相比,内部和内部编码的可变块大小可以获得显著的编码增益。然而,速率失真优化(RDO)过程以野蛮搜索的方式选择最优编码块时,计算量非常大。提出了一种MB同质性检测方法,以加快H.264/AVC的码内编码和码间编码。取图像中所有像素的亮度值计算其熵特征,熵特征被定义为图像的空间均匀性。根据同质性判断,适当选择16 × 16或4 × 4块大小进行内编码;同时,选择{16 × 16,16 × 8,8 × 16}的大块或{8 × 8,8 × 4,4 × 8,4 × 4}的子块进行编码。特别地,定义了一个代价函数来选择最优块大小的近最优阈值。在具有不同空间/运动特征的广泛视频序列上验证了所提出的方法。充分的仿真表明,对于具有不同运动和空间特征的所有视频,该方法都能获得一致的编码增益。单帧内编码的编码复杂度可以降低31% - 34%,模式间决策的时间节省为43.7%-58.7%,比特率和PSNR的损失都可以忽略不计。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信