HeteroSync: A benchmark suite for fine-grained synchronization on tightly coupled GPUs

2017 IEEE International Symposium on Workload Characterization (IISWC) Pub Date : 2017-10-01 DOI:10.1109/IISWC.2017.8167781

Matthew D. Sinclair, Johnathan Alsop, S. Adve

引用次数: 22

Abstract

Traditionally GPUs focused on streaming, data-parallel applications, with little data reuse or sharing and coarse-grained synchronization. However, the rise of general-purpose GPU (GPGPU) computing has made GPUs desirable for applications with more general sharing patterns and fine-grained synchronization, especially for recent GPUs that have a unified address space and coherent caches. Prior work has introduced microbenchmarks to measure the impact of these changes, but each paper uses its own set of microbenchmarks. In this work, we combine several of these sets together in a single suite, HeteroSync. HeteroSync includes several synchronization primitives, data sharing at different levels of the memory hierarchy, and relaxed atomics. We characterize the scalability of HeteroSync for different coherence protocols and consistency models on modern, tightly coupled CPU-GPU systems and show that certain algorithms, coherence protocols, and consistency models scale better than others.

查看原文本刊更多论文

HeteroSync:在紧密耦合的gpu上进行细粒度同步的基准测试套件

传统上，gpu专注于流、数据并行应用，很少有数据重用或共享以及粗粒度同步。然而，通用GPU (GPGPU)计算的兴起使得GPU更适合具有更通用的共享模式和细粒度同步的应用程序，特别是对于具有统一地址空间和一致缓存的最新GPU。以前的工作已经引入了微基准来衡量这些变化的影响，但是每篇论文都使用自己的一组微基准。在这项工作中，我们将其中几个集合组合在一个套件中，即HeteroSync。异构同步包括几个同步原语、内存层次结构不同级别上的数据共享以及宽松的原子。我们描述了在现代紧密耦合的CPU-GPU系统上不同一致性协议和一致性模型的异构同步的可扩展性，并表明某些算法、一致性协议和一致性模型比其他算法、一致性协议和一致性模型的可扩展性更好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 IEEE International Symposium on Workload Characterization (IISWC)

自引率

0.00%

发文量