An efficient pipeline execution of H.264/AVC intra 4×4 frame design

Smaoui Soulef, H. Loukil, A. Ben Atitallah, N. Masmoudi
{"title":"An efficient pipeline execution of H.264/AVC intra 4×4 frame design","authors":"Smaoui Soulef, H. Loukil, A. Ben Atitallah, N. Masmoudi","doi":"10.1109/SSD.2010.5585524","DOIUrl":null,"url":null,"abstract":"In this paper, we present an implementation of an optimized H.264 intra 4×4 algorithm in order to reduce the time of the intra 4×4 process. However the source of waste time in conventional architecture of intra 4×4 is the serialization of intra prediction and reconstruction of sixteen 4×4 blocks in one macroblock and the intra prediction of the current 4×4 block cannot be performed before the reconstruction of the previous 4×4 block. Therefore, for a high speed implementation we replaced the conventional one by a pipelined architecture while maintaining consistency with the standard. So we have studied ten alternative scanning orders based on rearranging order of intra 4×4 and we choose the best one which reduce dependencies between consecutively executed blocks without performance degradation. This order is implemented by a pipelined architecture using VHDL language. The VHDL code is verified to work at 100 MHz in an ALTERA Stratix II EP2S60F1020C3 FPGA. As a result, the processing time is reduced by 31.25% compared to the conventional implementation. So, it can be a good solution for real-time video application. The H.264 intra 4×4 hardware and software are demonstrated to work together on ALTERA NIOS-II development board with Stratix II EP2S60F1020C3 FPGA.","PeriodicalId":432382,"journal":{"name":"2010 7th International Multi- Conference on Systems, Signals and Devices","volume":"2018 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 7th International Multi- Conference on Systems, Signals and Devices","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSD.2010.5585524","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9

Abstract

In this paper, we present an implementation of an optimized H.264 intra 4×4 algorithm in order to reduce the time of the intra 4×4 process. However the source of waste time in conventional architecture of intra 4×4 is the serialization of intra prediction and reconstruction of sixteen 4×4 blocks in one macroblock and the intra prediction of the current 4×4 block cannot be performed before the reconstruction of the previous 4×4 block. Therefore, for a high speed implementation we replaced the conventional one by a pipelined architecture while maintaining consistency with the standard. So we have studied ten alternative scanning orders based on rearranging order of intra 4×4 and we choose the best one which reduce dependencies between consecutively executed blocks without performance degradation. This order is implemented by a pipelined architecture using VHDL language. The VHDL code is verified to work at 100 MHz in an ALTERA Stratix II EP2S60F1020C3 FPGA. As a result, the processing time is reduced by 31.25% compared to the conventional implementation. So, it can be a good solution for real-time video application. The H.264 intra 4×4 hardware and software are demonstrated to work together on ALTERA NIOS-II development board with Stratix II EP2S60F1020C3 FPGA.
高效流水线执行H.264/AVC帧内4×4帧设计
在本文中,我们提出了一个优化的H.264 intra 4×4算法的实现,以减少intra 4×4过程的时间。然而,在传统的intra 4×4架构中,浪费时间的根源是在一个宏块中序列化16个4×4块的内部预测和重建,并且不能在重建前一个4×4块之前执行当前4×4块的内部预测。因此,对于高速实现,我们在保持与标准的一致性的同时,用流水线架构取代了传统的实现。因此,我们研究了10种基于内部重新排列顺序4×4的扫描顺序,并选择了最佳的扫描顺序,减少了连续执行块之间的依赖关系,而不会降低性能。该命令是通过使用VHDL语言的流水线架构实现的。VHDL代码经验证可在ALTERA Stratix II EP2S60F1020C3 FPGA中工作在100 MHz。因此,与传统实现相比,处理时间减少了31.25%。因此,它可以成为实时视频应用的一个很好的解决方案。演示了H.264内部4×4硬件和软件在ALTERA NIOS-II开发板上与Stratix II EP2S60F1020C3 FPGA一起工作。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信