H-NMRU: A Low Area, High Performance Cache Replacement Policy for Embedded Processors

2009 22nd International Conference on VLSI Design Pub Date : 2009-01-05 DOI:10.1109/VLSI.Design.2009.32

Sourav Roy

引用次数: 9

Abstract

We propose a low area, high performance cache replacement policy for embedded processors called Hierarchical Non-Most-Recently-Used (H-NMRU). The H-NMRU is a parameterizable policy where we can trade-off performance with area. We extended the Dinero cache simulator [1] with the H-NMRU policy and performed architectural exploration with a set of cellular and multimedia benchmarks. On a 16 way cache, a two level H-NMRU policy where the first and second levels have 8 and 2 branches respectively, performs as good as the Pseudo-LRU (PLRU) policy with storage area saving of 27%. Compared to true LRU, H-NMRU on a 16 way cache saves huge amount of area (82%) with marginal increase of cache misses (3%). Similar result was also noticed on other cache like structures like branch target buffers. Therefore the two level H-NMRU cache replacement policy (with associativity/2 and 2 branches on the two levels) is a very attractive option for caches on embedded processors with associativities greater than 4.

查看原文本刊更多论文

H-NMRU:嵌入式处理器的低面积、高性能缓存替换策略

我们提出了一种低面积、高性能的嵌入式处理器缓存替换策略，称为分层非最近使用(H-NMRU)。H-NMRU是一个可参数化的策略，我们可以在性能和面积之间进行权衡。我们使用H-NMRU策略扩展了Dinero缓存模拟器[1]，并使用一组蜂窝和多媒体基准进行了架构探索。在16路缓存上，两级H-NMRU策略(第一级和第二级分别有8个和2个分支)的性能与伪lru (PLRU)策略一样好，存储面积节省27%。与真正的LRU相比，H-NMRU在16路高速缓存上节省了大量的面积(82%)，而缓存丢失的边际增加(3%)。类似的结果也出现在其他缓存结构上，比如分支目标缓冲区。因此，对于关联度大于4的嵌入式处理器上的缓存来说，两级H-NMRU缓存替换策略(在两个级别上具有关联性/2和2分支)是一个非常有吸引力的选择。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2009 22nd International Conference on VLSI Design

自引率

0.00%

发文量