A hardware/software co-solution to achieving high throughput required by motion estimation part in H.264/AVC HDTV real-time application

Zhenxing Chen, T. Ikenaga, S. Goto
{"title":"A hardware/software co-solution to achieving high throughput required by motion estimation part in H.264/AVC HDTV real-time application","authors":"Zhenxing Chen, T. Ikenaga, S. Goto","doi":"10.1109/VDAT.2008.4542429","DOIUrl":null,"url":null,"abstract":"The high throughput required by Motion Estimation (ME) part in H.264/AVC High Definition TV (HDTV) real-time application is very difficult to achieve. Currently one kind of the solutions to this problem is multipling the processing element (PE) array to construct redundant PE array structure. Although redundant structure can take advantages of parallel processing to achieve high throughput, meanwhile it bring in linear increasing of hardware cost. In this paper, a hardware/software co-solusion is proposed to achieve the required throughput of ME part in H.264/AVC HDTV realtime application. In software side, one adaptive search range (ASR) algorithm which is previously proposed by us [12] is firstly introduced and then experimentally proved can improve the throughput 11.48 times averagely in HDTV1080p video sequence. In hardware side, a previously proposed architecture called SAD-tree [17] is firstly introduced. Then based on this architecture, optimization that increase the frequence is proposed. The hardware implementing result of the optimized architecture shows the proposed optimization can triple the frequence. Finally, it is illustrated that the hardware/software co-solution can help to achieve the required throughput.","PeriodicalId":156790,"journal":{"name":"2008 IEEE International Symposium on VLSI Design, Automation and Test (VLSI-DAT)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE International Symposium on VLSI Design, Automation and Test (VLSI-DAT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/VDAT.2008.4542429","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

The high throughput required by Motion Estimation (ME) part in H.264/AVC High Definition TV (HDTV) real-time application is very difficult to achieve. Currently one kind of the solutions to this problem is multipling the processing element (PE) array to construct redundant PE array structure. Although redundant structure can take advantages of parallel processing to achieve high throughput, meanwhile it bring in linear increasing of hardware cost. In this paper, a hardware/software co-solusion is proposed to achieve the required throughput of ME part in H.264/AVC HDTV realtime application. In software side, one adaptive search range (ASR) algorithm which is previously proposed by us [12] is firstly introduced and then experimentally proved can improve the throughput 11.48 times averagely in HDTV1080p video sequence. In hardware side, a previously proposed architecture called SAD-tree [17] is firstly introduced. Then based on this architecture, optimization that increase the frequence is proposed. The hardware implementing result of the optimized architecture shows the proposed optimization can triple the frequence. Finally, it is illustrated that the hardware/software co-solution can help to achieve the required throughput.
为实现H.264/AVC高清电视实时应用中运动估计部分所需的高吞吐量,提出了一种软硬件协同解决方案
在H.264/AVC高清电视(HDTV)实时应用中,运动估计(ME)部分所要求的高吞吐量是很难实现的。目前解决这一问题的一种方法是将处理单元(PE)阵列相乘,构造冗余的PE阵列结构。冗余结构虽然可以利用并行处理实现高吞吐量,但同时也带来了硬件成本的线性增加。针对H.264/AVC高清电视实时应用中ME部分的吞吐量要求,提出了一种软硬件协同解决方案。在软件方面,首先介绍了我们[12]提出的一种自适应搜索范围(adaptive search range, ASR)算法,并通过实验证明,在HDTV1080p视频序列中,ASR算法的吞吐量平均提高了11.48倍。在硬件方面,首先介绍了先前提出的一种称为SAD-tree的架构[17]。在此基础上,提出了提高频率的优化方案。优化架构的硬件实现结果表明,所提出的优化可以使频率提高三倍。最后,说明了硬件/软件协同解决方案可以帮助实现所需的吞吐量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信