Exploring DRAM cache architectures for CMP server platforms

2007 25th International Conference on Computer Design Pub Date : 2007-10-01 DOI:10.1109/ICCD.2007.4601880

Li Zhao, R. Iyer, R. Illikkal, D. Newell

引用次数: 85

Abstract

As dual-core and quad-core processors arrive in the marketplace, the momentum behind CMP architectures continues to grow strong. As more and more cores/threads are placed on-die, the pressure on the memory subsystem is rapidly increasing. To address this issue, we explore DRAM cache architectures for CMP platforms. In this paper, we investigate the impact of introducing a low latency, large capacity and high bandwidth DRAM-based cache between the last level SRAM cache and memory subsystem. We first show the potential benefits of large DRAM caches for key commercial server workloads. As the primary hurdle to achieving these benefits with DRAM caches is the tag space overheads associated with them, we identify the most efficient DRAM cache organization and investigate various options. Our results show that the combination of 8-bit partial tags and 2-way sectoring achieves the highest performance (20% to 70%) with the lowest tag space (<25%) overhead.

查看原文本刊更多论文

探索CMP服务器平台的DRAM缓存架构

随着双核和四核处理器进入市场，CMP架构背后的势头继续强劲增长。随着越来越多的内核/线程被放置在芯片上，内存子系统的压力正在迅速增加。为了解决这个问题，我们探索了CMP平台的DRAM缓存架构。在本文中，我们研究了在最后一级SRAM缓存和内存子系统之间引入低延迟，大容量和高带宽基于dram的缓存的影响。我们首先展示了大型DRAM缓存对关键商业服务器工作负载的潜在好处。由于使用DRAM缓存实现这些好处的主要障碍是与之相关的标签空间开销，因此我们确定了最有效的DRAM缓存组织并研究了各种选项。我们的结果表明，8位部分标签和双向分界的组合以最低的标签空间开销(<25%)实现了最高的性能(20%到70%)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2007 25th International Conference on Computer Design

自引率

0.00%

发文量