Assessing the Overhead of Offloading Compression Tasks

Workshop Proceedings of the 49th International Conference on Parallel Processing Pub Date : 2020-08-17 DOI:10.1145/3409390.3409405

L. Promberger, R. Schwemmer, H. Fröning

{"title":"Assessing the Overhead of Offloading Compression Tasks","authors":"L. Promberger, R. Schwemmer, H. Fröning","doi":"10.1145/3409390.3409405","DOIUrl":null,"url":null,"abstract":"Exploring compression is increasingly promising as trade-off between computations and data movement. There are two main reasons: First, the gap between processing speed and I/O continues to grow, and technology trends indicate a continuation of this. Second, performance is determined by energy efficiency, and the overall power consumption is dominated by the consumption of data movements. For these reasons there is already a plethora of related works on compression from various domains. Most recently, a couple of accelerators have been introduced to offload compression tasks from the main processor, for instance by AHA, Intel and Microsoft. Yet, one lacks the understanding of the overhead of compression when offloading tasks. In particular, such offloading is most beneficial for overlap with other tasks, if the associated overhead on the main processor is negligible. This work evaluates the integration costs compared to a solely software-based solution considering multiple compression algorithms. Among others, High Energy Physics data are used as a prime example of big data sources. The results imply that on average the zlib implementation on the accelerator achieves a comparable compression ratio to zlib level 2 on a CPU, while having up to 17 times the throughput and utilizing over 80 % less CPU resources. These results suggest that, given the right orchestration of compression and data movement tasks, the overhead of offloading compression is limited but present. Considering that compression is only a single task of a larger data processing pipeline, this overhead cannot be neglected.","PeriodicalId":350506,"journal":{"name":"Workshop Proceedings of the 49th International Conference on Parallel Processing","volume":"272 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Workshop Proceedings of the 49th International Conference on Parallel Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3409390.3409405","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Exploring compression is increasingly promising as trade-off between computations and data movement. There are two main reasons: First, the gap between processing speed and I/O continues to grow, and technology trends indicate a continuation of this. Second, performance is determined by energy efficiency, and the overall power consumption is dominated by the consumption of data movements. For these reasons there is already a plethora of related works on compression from various domains. Most recently, a couple of accelerators have been introduced to offload compression tasks from the main processor, for instance by AHA, Intel and Microsoft. Yet, one lacks the understanding of the overhead of compression when offloading tasks. In particular, such offloading is most beneficial for overlap with other tasks, if the associated overhead on the main processor is negligible. This work evaluates the integration costs compared to a solely software-based solution considering multiple compression algorithms. Among others, High Energy Physics data are used as a prime example of big data sources. The results imply that on average the zlib implementation on the accelerator achieves a comparable compression ratio to zlib level 2 on a CPU, while having up to 17 times the throughput and utilizing over 80 % less CPU resources. These results suggest that, given the right orchestration of compression and data movement tasks, the overhead of offloading compression is limited but present. Considering that compression is only a single task of a larger data processing pipeline, this overhead cannot be neglected.

查看原文本刊更多论文

评估卸载压缩任务的开销

探索压缩作为计算和数据移动之间的权衡越来越有前途。主要有两个原因:首先，处理速度和I/O之间的差距继续扩大，技术趋势表明这种差距将继续扩大。其次，性能是由能源效率决定的，总体功耗主要是数据移动的消耗。由于这些原因，已经有大量来自不同领域的压缩相关工作。最近，一些加速器被引入来从主处理器中卸载压缩任务，例如AHA, Intel和Microsoft。然而，人们缺乏对卸载任务时压缩开销的理解。特别是，如果主处理器上的相关开销可以忽略不计，那么这种卸载对于与其他任务重叠是最有利的。这项工作评估了与考虑多种压缩算法的单独基于软件的解决方案相比的集成成本。其中，高能物理数据被用作大数据源的主要示例。结果表明，平均而言，加速器上的zlib实现实现了与CPU上的zlib级别2相当的压缩比，同时具有高达17倍的吞吐量并使用超过80%的CPU资源。这些结果表明，在正确编排压缩和数据移动任务的情况下，卸载压缩的开销是有限的，但仍然存在。考虑到压缩只是一个更大的数据处理管道的一个任务，这个开销是不能忽视的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Workshop Proceedings of the 49th International Conference on Parallel Processing

自引率

0.00%

发文量