利用AVX2提高HEVC中切片和平铺并行度的加速

Dimitris Skoumpourdis, Panos K. Papadopoulos, M. Koziri, Nikos Tziritas, Thanasis Loukopoulos, Ioannis Anagnostopoulos
{"title":"利用AVX2提高HEVC中切片和平铺并行度的加速","authors":"Dimitris Skoumpourdis, Panos K. Papadopoulos, M. Koziri, Nikos Tziritas, Thanasis Loukopoulos, Ioannis Anagnostopoulos","doi":"10.1145/3139367.3139427","DOIUrl":null,"url":null,"abstract":"HEVC has emerged as the new video coding standard promising improved compression ratios (for the same quality) by up to 50% compared to H.264/AVC. To achieve this performance HEVC requires increased computational overhead compared to its predecessor. For this reason parallelism is used, usually at a coarse grained level, e.g., per slice or tile. In this paper we turn our attention towards further speeding up the HEVC encoding process by combining coarse grained parallelism with fine grained, in the form of AVX2 instructions implementing SIMD parallelism at SAD (Sum of Absolute Difference) and SSE (Sum of Squared Error) calculations. Experimental evaluation with common test video sequences illustrates that an additional reduction (in encoding time) of roughly 11% on average, compared to standalone coarse grained parallelism is achievable, leading in many cases to superlinear speedup.","PeriodicalId":436862,"journal":{"name":"Proceedings of the 21st Pan-Hellenic Conference on Informatics","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"On Improving the Speedup of Slice and Tile Level Parallelism in HEVC Using AVX2\",\"authors\":\"Dimitris Skoumpourdis, Panos K. Papadopoulos, M. Koziri, Nikos Tziritas, Thanasis Loukopoulos, Ioannis Anagnostopoulos\",\"doi\":\"10.1145/3139367.3139427\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"HEVC has emerged as the new video coding standard promising improved compression ratios (for the same quality) by up to 50% compared to H.264/AVC. To achieve this performance HEVC requires increased computational overhead compared to its predecessor. For this reason parallelism is used, usually at a coarse grained level, e.g., per slice or tile. In this paper we turn our attention towards further speeding up the HEVC encoding process by combining coarse grained parallelism with fine grained, in the form of AVX2 instructions implementing SIMD parallelism at SAD (Sum of Absolute Difference) and SSE (Sum of Squared Error) calculations. Experimental evaluation with common test video sequences illustrates that an additional reduction (in encoding time) of roughly 11% on average, compared to standalone coarse grained parallelism is achievable, leading in many cases to superlinear speedup.\",\"PeriodicalId\":436862,\"journal\":{\"name\":\"Proceedings of the 21st Pan-Hellenic Conference on Informatics\",\"volume\":\"47 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-09-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 21st Pan-Hellenic Conference on Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3139367.3139427\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 21st Pan-Hellenic Conference on Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3139367.3139427","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

HEVC已经成为新的视频编码标准,与H.264/AVC相比,HEVC有望将压缩比(在相同质量下)提高50%。为了达到这种性能,HEVC需要比其前身增加计算开销。由于这个原因,通常在粗粒度级别上使用并行性,例如,每片或每块。在本文中,我们将注意力转向通过结合粗粒度并行性和细粒度并行性来进一步加快HEVC编码过程,以AVX2指令的形式在SAD(绝对差和)和SSE(平方误差和)计算中实现SIMD并行性。使用普通测试视频序列进行的实验评估表明,与独立的粗粒度并行性相比,可以平均减少大约11%的额外(编码时间),从而在许多情况下实现超线性加速。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
On Improving the Speedup of Slice and Tile Level Parallelism in HEVC Using AVX2
HEVC has emerged as the new video coding standard promising improved compression ratios (for the same quality) by up to 50% compared to H.264/AVC. To achieve this performance HEVC requires increased computational overhead compared to its predecessor. For this reason parallelism is used, usually at a coarse grained level, e.g., per slice or tile. In this paper we turn our attention towards further speeding up the HEVC encoding process by combining coarse grained parallelism with fine grained, in the form of AVX2 instructions implementing SIMD parallelism at SAD (Sum of Absolute Difference) and SSE (Sum of Squared Error) calculations. Experimental evaluation with common test video sequences illustrates that an additional reduction (in encoding time) of roughly 11% on average, compared to standalone coarse grained parallelism is achievable, leading in many cases to superlinear speedup.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信