Instruction Offloading with HMC 2.0 Standard: A Case Study for Graph Traversals

Proceedings of the 2015 International Symposium on Memory Systems Pub Date : 2015-10-05 DOI:10.1145/2818950.2818982

Lifeng Nai, Hyesoon Kim

引用次数: 32

Abstract

Processing in Memory (PIM) was first proposed decades ago for reducing the overhead of data movement between core and memory. With the advances in 3D-stacking technologies, recently PIM architectures have regained researchers' attentions. Several fully-programmable PIM architectures as well as programming models were proposed in previous literature. Meanwhile, memory industry also starts to integrate computation units into Hybrid Memory Cube (HMC). In HMC 2.0 specification, a number of atomic instructions are supported. Although the instruction support is limited, it enables us to offload computations at instruction granularity. In this paper, we present a preliminary study of instruction offloading on HMC 2.0 using graph traversals as an example. By demonstrating the programmability and performance benefits, we show the feasibility of an instruction-level offloading PIM architecture.

查看原文本刊更多论文

指令卸载与HMC 2.0标准:图遍历的案例研究

内存中处理(PIM)是几十年前首次提出的，目的是减少内核和内存之间数据移动的开销。随着3d叠加技术的进步，近年来PIM结构重新引起了研究人员的关注。在以前的文献中提出了几种完全可编程的PIM体系结构以及编程模型。与此同时，存储行业也开始将计算单元集成到混合存储立方体(HMC)中。在HMC 2.0规范中，支持许多原子指令。尽管指令支持是有限的，但它使我们能够在指令粒度上卸载计算。本文以图遍历为例，对hmc2.0上的指令卸载进行了初步研究。通过演示可编程性和性能优势，我们展示了指令级卸载PIM体系结构的可行性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2015 International Symposium on Memory Systems

自引率

0.00%

发文量