Hierarchical Page Eviction Policy for Unified Memory in GPUs

2019 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) Pub Date : 2019-03-24 DOI:10.1109/ISPASS.2019.00027

Qi Yu, B. Childers, Libo Huang, Cheng Qian, Zhiying Wang

引用次数: 5

Abstract

The introduction of unified memory in discrete GPUs not only improves programmability but also enables oversubscription. However, it introduces high overhead when page faults occur. Therefore, when GPU memory is full, how to select eviction candidates becomes an important issue. The widely used policy LRU performs poorly for workloads with thrashing access patterns, and the advanced cache replacement policy RRIP incurs thrashing when directly applied to GPU memory. In this paper, we propose hierarchical page eviction policy for GPU memory, which relies on a software-managed page set chain to select eviction candidates. Results show that for 15 selected applications, our policy achieves an average speedup of 1.44 and 1.2 over LRU when the oversubscription rate is 75% and 50 %, respectively.

查看原文本刊更多论文

gpu统一内存的分层页面清除策略

在离散gpu中引入统一内存不仅提高了可编程性，而且还实现了超额订阅。但是，当出现页面错误时，它会带来很高的开销。因此，当GPU内存已满时，如何选择驱逐候选对象就成为一个重要的问题。广泛使用的LRU策略对于具有抖动访问模式的工作负载表现不佳，而高级缓存替换策略RRIP直接应用于GPU内存时会导致抖动。在本文中，我们提出了GPU内存的分层页面移除策略，该策略依赖于软件管理的页面集链来选择移除候选对象。结果表明，对于15个选定的应用程序，当超额认购率为75%和50%时，我们的策略分别比LRU实现了1.44和1.2的平均加速。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)

自引率

0.00%

发文量