通过改变所利用的数据宽度,比较了H.264中两种硬件实现的去块滤波器的性能

K. Messaoudi, E. Bourennane, S. Toumi, Gilberto Ochoa
{"title":"通过改变所利用的数据宽度,比较了H.264中两种硬件实现的去块滤波器的性能","authors":"K. Messaoudi, E. Bourennane, S. Toumi, Gilberto Ochoa","doi":"10.1109/WOSSPA.2011.5931411","DOIUrl":null,"url":null,"abstract":"The deblocking filter is more complex than other modules in the H.264 because it is highly adaptive, applied to each boundary of all 4×4 blocks and updated three pixels in each direction. After careful study and analysis of this filter, we have concluded that its complexity lies in the data dependency and in the control module of elementary filters that compose it, but not in the type of these filters. In this paper, we propose two hardware implementations for the deblocking filter using the same strategy for memory management. The implementations differ on the utilized data width. The first one utilizes 32-bit while the second one utilizes 128-bit. The use of 128-bit data width is in order to ensure a high degree of parallelism and to avoid the use of transpose circuits and the intermediate buffers between the elementary modules in the filter. Simulation and synthesis results are then compared. The number of consumed LUTs remains almost the same compared with previous implementations and the number of clock cycles required to process a macroblock is reduced about 40% less than the best of the competing proposals.","PeriodicalId":343415,"journal":{"name":"International Workshop on Systems, Signal Processing and their Applications, WOSSPA","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Performance comparison of two hardware implementations of the deblocking filter used in H.264 by changing the utilized data width\",\"authors\":\"K. Messaoudi, E. Bourennane, S. Toumi, Gilberto Ochoa\",\"doi\":\"10.1109/WOSSPA.2011.5931411\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The deblocking filter is more complex than other modules in the H.264 because it is highly adaptive, applied to each boundary of all 4×4 blocks and updated three pixels in each direction. After careful study and analysis of this filter, we have concluded that its complexity lies in the data dependency and in the control module of elementary filters that compose it, but not in the type of these filters. In this paper, we propose two hardware implementations for the deblocking filter using the same strategy for memory management. The implementations differ on the utilized data width. The first one utilizes 32-bit while the second one utilizes 128-bit. The use of 128-bit data width is in order to ensure a high degree of parallelism and to avoid the use of transpose circuits and the intermediate buffers between the elementary modules in the filter. Simulation and synthesis results are then compared. The number of consumed LUTs remains almost the same compared with previous implementations and the number of clock cycles required to process a macroblock is reduced about 40% less than the best of the competing proposals.\",\"PeriodicalId\":343415,\"journal\":{\"name\":\"International Workshop on Systems, Signal Processing and their Applications, WOSSPA\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-05-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Workshop on Systems, Signal Processing and their Applications, WOSSPA\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WOSSPA.2011.5931411\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Workshop on Systems, Signal Processing and their Applications, WOSSPA","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WOSSPA.2011.5931411","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

摘要

去块滤波器比H.264中的其他模块更复杂,因为它是高度自适应的,应用于所有4×4块的每个边界,并在每个方向更新三个像素。经过对这个过滤器的仔细研究和分析,我们得出结论,它的复杂性在于数据依赖性和组成它的基本过滤器的控制模块,而不在于这些过滤器的类型。在本文中,我们提出了两种硬件实现的去块过滤器使用相同的策略内存管理。这些实现在所利用的数据宽度上有所不同。第一个使用32位,第二个使用128位。使用128位数据宽度是为了保证高度并行性,避免使用转置电路和滤波器中基本模块之间的中间缓冲区。仿真结果与综合结果进行了比较。与以前的实现相比,所消耗的lut数量几乎保持不变,处理宏块所需的时钟周期数量减少了大约40%,比竞争中的最佳方案少。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Performance comparison of two hardware implementations of the deblocking filter used in H.264 by changing the utilized data width
The deblocking filter is more complex than other modules in the H.264 because it is highly adaptive, applied to each boundary of all 4×4 blocks and updated three pixels in each direction. After careful study and analysis of this filter, we have concluded that its complexity lies in the data dependency and in the control module of elementary filters that compose it, but not in the type of these filters. In this paper, we propose two hardware implementations for the deblocking filter using the same strategy for memory management. The implementations differ on the utilized data width. The first one utilizes 32-bit while the second one utilizes 128-bit. The use of 128-bit data width is in order to ensure a high degree of parallelism and to avoid the use of transpose circuits and the intermediate buffers between the elementary modules in the filter. Simulation and synthesis results are then compared. The number of consumed LUTs remains almost the same compared with previous implementations and the number of clock cycles required to process a macroblock is reduced about 40% less than the best of the competing proposals.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信