Extending Sparse Tensor Accelerators to Support Multiple Compression Formats

2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS) Pub Date : 2021-03-18 DOI:10.1109/IPDPS49936.2021.00110

Eric Qin, Geonhwa Jeong, William Won, Sheng-Chun Kao, Hyoukjun Kwon, S. Srinivasan, Dipankar Das, G. Moon, S. Rajamanickam, T. Krishna

引用次数: 8

Abstract

Sparsity, which occurs in both scientific applications and Deep Learning (DL) models, has been a key target of optimization within recent ASIC accelerators due to the potential memory and compute savings. These applications use data stored in a variety of compression formats. We demonstrate that both the compactness of different compression formats and the compute efficiency of the algorithms enabled by them vary across tensor dimensions and amount of sparsity. Since DL and scientific workloads span across all sparsity regions, there can be numerous format combinations for optimizing memory and compute efficiency. Unfortunately, many proposed accelerators operate on one or two fixed format combinations. This work proposes hardware extensions to accelerators for supporting numerous format combinations seamlessly and demonstrates $\sim 4 \times$ speedup over performing format conversions in software.

查看原文本刊更多论文

扩展稀疏张量加速器以支持多种压缩格式

稀疏性在科学应用和深度学习(DL)模型中都存在，由于潜在的内存和计算节省，它一直是最近ASIC加速器优化的关键目标。这些应用程序使用以各种压缩格式存储的数据。我们证明了不同压缩格式的紧凑性和它们所支持的算法的计算效率在张量维度和稀疏度上都是不同的。由于DL和科学工作负载跨越所有稀疏性区域，因此可以有多种格式组合来优化内存和计算效率。不幸的是，许多被提议的加速器都是在一种或两种固定格式组合上运行的。这项工作提出了加速器的硬件扩展，以无缝地支持多种格式组合，并演示了在软件中执行格式转换时的4倍加速。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

自引率

0.00%

发文量